I am working on publishing a large number of public domain documents on the web. We have them as PDFs and as OCR'd text files. The OCR files serve as a means to search the documents for key words, but they are fairly ugly, and the PDF's give an accurate picture of each page, but are fairly large. Unfortuanately, each doc = one file and the files are fairly large (10 megs up to about 60 megs). Needless to say (write) that takes a while with a 56k modem. So, does anyone know if any of the existing Perl PDF modules will pull a specific page from a PDF? Are there any examples of this being done? Any clues would be appreciated.
keep the rudder amid ship and beware the odd typo
Red Flag Submitted
Thank you for helping keep Tek-Tips Forums free from inappropriate posts. The Tek-Tips staff will check this out and take appropriate action.
Reply To This Thread
Posting in the Tek-Tips forums is a member-only feature.