• hexaflexagonbear [he/him]
    hexbear
    10
    10 months ago

    To be fair pdfs are a pain in the ass. I wrote a python script which extracts the text but i feel like there's information I'm missing. Anyone have any tips?

    • InevitableSwing [none/use name]
      hexagon
      hexbear
      7
      10 months ago

      To be fair pdfs are a pain in the ass.

      They are an Adobe invention. I wouldn't expect anything less. I guess it could be worse though. Imagine if Microsoft had invented them.

      "Unable to open file. Download ms-pdf-pdf-upgrade-pdf.pdf for instructions on how to upgrade!" Clippy happily says.

    • buckykat [none/use name]
      hexbear
      5
      10 months ago

      me using a libreoffice terminal command to convert pptx to pdf so I can annotate it in xournal++