• MonkderVierte@lemmy.ml
    link
    fedilink
    arrow-up
    3
    ·
    5 hours ago

    Selecting text doesn’t work in most multi-column pdfs and good OCR cost money. And if the original source is lost and you want an exact copy in word, the OCR tools need to be really good at guessing whitespace-to-line ratio, because pdf is only an output format and not a processing format.

    For most other converting needs, there’s pandoc, imagemagick and ffmpeg.