These forums are now Read Only. If you have an Acrobat question, ask questions and get help from one of our experts.

OCR Guidelines to Produce Searchable PDF - Acrobat 9.0 Pro

Shiloh Jones
Registered: Mar 6 2009
Posts: 3
Answered

I have several thousands of documents to scan, index and produce searchable PDF - on a continuing basis. I have been given to understand that the OCR capability of 9.0 is limited to 50 pages per document.

If this is accurate, what is the recommended OCR engine (AABBYY, Omnipage,etc) for large groups of documents.

Shiloh

My Product Information:
Acrobat Pro 9.0, Windows
daka630
Expert
Registered: Mar 1 2007
Posts: 1420
G'day Shiloh,

I've been unable to find anything that cites a page limit associated with Acrobat 9's OCR engine
in the on line help (for Standard | Pro | Extended) or the product comparison information .

When I used Acrobat 4.x there was no page limits associated with OCR.
With Acrobat 5.x I had to download and install the Paper Capture Plug-In which imposed a 50 page limit on each OCR run.
When I used Acrobat 6.x Professional there was no page limits associated with OCR.
Currently using Acrobat 7.x Professional, Acrobat 8.x Professional, and Acrobat (8.x) 3D there are no page limits associated with OCR.

Cetainly other OCR engines might better fit your needs.
In addition to AABBY and Omnipage, you might consider AdLib Server.
Also, while perhaps not as "cutting edge" as some; Adobe's Capture product may be
something to consider.

An advantage of a server based product is that you do not have a dedicate computer to run the OCR job(s) which would be the case if you were using Acrobat's OCR engine.

Be well...

Be well...

Shiloh Jones
Registered: Mar 6 2009
Posts: 3
G'day daka,

Thanks for the response. Following are (dated) comments found on PlanetPDF site:

QUOTE …. prior to Acrobat 6, Adobe allowed you to perform "paper capture" with Acrobat only up to 50 pages. If you have Acrobat 4 or 5, you've got a 50-page limit (although, of course, there are ways to work around it.) I think that Adobe still offers the Capture Server product for large scale scanning and OCR work. It's meant for use in a high-volume production environment, such as a litigation support vendor. In my experience, in government at least, people were leery of using it because you paid by the page. That is, you could buy a 100,000-page license and then you have to fill 'er up again for the next 100,000. The most significant difference in the earlier versions is that, until Acrobat 6, you could only OCR 50 pages at a time. END QUOTE.

Comments do not address 9.0 but led me to assume there was an OCR limit in 9.0. Nevertheless, when I attempt to use 9.0 OCR Text Recognition, Acrobat shuts down with error screen while processing several PDF files of 100 pages plus.

I have used Capture 3.0 and grew tired of it's license 'dongle' and SLOW speed. Seems I need a more robust (and more expensive) OCR engine.

Regards,
daka630
Expert
Registered: Mar 1 2007
Posts: 1420
Bon dia Shiloh,

fwiw
I too have read the PlantPDF post re: page limit for OCR with Acrobat 4.x
Found it odd as for a time I ran 4 to 6 machines with licenses for Acrobat 4.x
for 5 days a week, 8 hours a day to OCR large scanned documents that were in PDF.
Never encountered a page limit.
Only issue was resource allocation on the Windows boxes in use.

Re: Capture - Capture Cluster Edition - no dongle. No page limit.

Be well..

Be well...