Hello,
I have approximate 10 pdf files (5-10 MB each) I would like to embed an index in. The files say they were produced by Acrobat Distiller 9.0.0 and are PDF version 1.5 (Acrobat 6.x) or PDF version 1.6 (Acrobat 7.x).
When I try to search the documents after either embedding an index or creating a full index with catalog none of my search terms are found (the only search terms that bring up results are those in the document's properties page).
Both the Embedded index and Index with catalog creation steps complete without errors.
When I try the Recognize text using OCR command I get:
"Acrobat could not perform recognition (OCR) on this page because:
This page has graphics other than images or text on it. It cannot be captured."
Is there an easy way to convert these documents (approx. 300-400 pages each) back into text so I can see my search results?
Thanks.
Something to try.
First, use Acrobat's Print Setup dialog to configure what you need for your PDFs.
If all of these PDFs are of the same page size or can be of the same page size you'd do this once.
In the Print Setup dialog, configure Paper Size and Orientation.
Then, click on the "Properites" button.
In the Adobe PDF Document Properties dialog, view [i]each[/i] tab.
Be sure each shows desired settings. Edit as appropriate.
If you are using a custom PostScript page size then, under the Paper/Qualtity tab,
be sure to click on the "Advanced" button.
Doing so presents the Adobe PDF Converter Advanced Options dialog.
At the right of the "Paper Size" line - click the button. In the presented dialog,
enter the appropriate custom page size.
In the Adobe PDF Document Properties dialog's Adobe PDF Settings tab,
select the desired Distiller Job Option.
With that done -
File > Print
This opens the Print dialog.
At the bottom-left region of the dialog, click on the "Advanced" button.
In the Advanced Print Setup dialog - Obsever the "Print as Image" entry (upper right region).
Select this and select the desired dpi value.
Click OK.
Back in the Print dialog, confirm shown settings are what you want.
Click OK.
A new PDF, of the PDF open in Acrobat, is produced.
This PDF's content is all image.
Try OCR on this PDF.
Be well...
Be well...