Greg Lopp on 22 Feb 2005 18:15:21 -0000 |
Gregson Helledy wrote: I have a .pdf file which I'd like to convert to text. How about... $ apt-get install gs-common $ pdf2ps $FILE.pdf $FILE.ps $ ps2ascii $FILE.ps $FILE.txt What did the .pdm look like? Perhaps the text of your .pdf did not survive that translation.I apt-got a package called gocr (and gocr-gtk, a frontend). gocr wants .pbm files, so I converted the .pdf to .pbm with ImageMagick, then used gocr on it.
|
|