These forums are now Read Only. If you have an Acrobat question, ask questions and get help from one of our experts.

No luck using Batch Sequence to OCR files in folder

jurban
Registered: Sep 30 2008
Posts: 5

Hello,

I am attempting to use the "Recognize Text Using OCR" Batch sequence to OCR several documents in a folder, but am keep getting the same result: nada. I have PDF output style set to "searchable image" and the downsample images set at Low (300dpi).

The sequence processes, but after each file is lists the following information in a separate window:

Under "optimization results" the command indicates the document was saving, but there is no Output. Following the Output section is this statement:

"Settings which allow retaining the document's PDF version, cannot be processed. Please select a different option."

I've searched this forum for someone with the same problem, but in finding no one else I'm guessing I'm missing something basic here.

Thanks for your help.

My Product Information:
Acrobat Pro 8.0, Macintosh
daka630
Expert
Registered: Mar 1 2007
Posts: 1420
Hi,
What is the original, effective resolution of the scanned images?
If your initial, effective resolution is, say 250 ppi, a downsample to a higer value is somewhat problematic.
Extract some test pages and try using the other downsample values (600, 150, 72) to see what happens.
Are results from 600 similar to those at 300?
What happens at 150? 72?

n.b., Adobe, fairly consistently, has stated in various documents that the minimum recommended value is 300 ppi for acceptable OCR of textual content. NARA targets 600 ppi.

Be well...

Be well...

jurban
Registered: Sep 30 2008
Posts: 5
daka630,

So of course after I posted my question last night I found something that worked. It turns out that de-selecting "PDF Optimizer" in the Output Options solved the problem. Not being much a Acrobat expert, though, I'm not sure why this solved the problem. Several of the PDFs in my batch are pretty old files, and their resolution is not incredible (think dot matrix). Since the batch I am running includes almost 5000 files, the sequence is still currently running. Once completed, I look forward to tinkering with your suggestions to see if I can't advance my Acrobat knowledge (and accept your answer).

Thanks for taking the time to address my question. Cheers,

Jon
daka630
Expert
Registered: Mar 1 2007
Posts: 1420
jurban,
PDF Optimizer - should have thought of that , have had occassion to put it to good use..Too focused on the OCR end of it and the Batch Sequence usage sort of sailed over my head.

btw - the KB TechNote below is chock-a-block with useful information associated with scanning/ocr/pdf.
[url]http://kb.adobe.com/selfservice/viewContent.do?externalId=317791[/url]

Be well...

Be well...