I am working on publishing a large number of public domain documents on the web. We have them as PDFs and as OCR'd text files. The OCR files serve as a means to search the documents for key words, but they are fairly ugly, and the PDF's give an accurate picture of each page, but are fairly large. Unfortuanately, each doc = one file and the files are fairly large (10 megs up to about 60 megs). Needless to say (write) that takes a while with a 56k modem. So, does anyone know if any of the existing Perl PDF modules will pull a specific page from a PDF? Are there any examples of this being done? Any clues would be appreciated.
Thanks,
keep the rudder amid ship and beware the odd typo
Thanks,
keep the rudder amid ship and beware the odd typo