These forums are now Read Only. If you have an Acrobat question, ask questions and get help from one of our experts.

How to use Acrobat in server mode to run OCR on a flow of PDF images?

Vladimir
Registered: Oct 13 2009
Posts: 2
Answered

I would like to build a small application that runs like a black box :
Folder In : Scanned documents 200 DPI (with PDF format)
Folder Out : Same document + the info that comes from the Acrobat text recognition tool (= text reput in place with a transparent font to overlap the letters that are matched...)

I can do this manually but I would like to script or even to automate this process on a server (Windwos or Linux, does not matter...).
- Does someone know what scripting tools I can use?
- Is there a command line version of Acrobat?
- The menu command asks to chose the main OCR language... Can I skip this with the script command (even if OCR performance is degraded)?

My target is not to reach perfect OCR performance, but just enough to be able to index my documents with any local HDD search engine.

Thanks for your help

lkassuba
ExpertTeam
Registered: Jun 28 2007
Posts: 3636
Acrobat isn't designed (i.e., not multi-threaded) nor licensed for use as a server product. For a server-based product that has these features you'll want to look into [url=http://www.adobe.com/products/livecycle/pdfs/91011467_lces2_pdfgenerator_ds_ue.pdf]LiveCycle PDF Generator[/url].

If it's just a small process you would like to run on the desktop, try creating a batch sequence to automate the process. Here is a link to a [url=http://www.acrobatusers.com/tutorials/scan-and-batch-ocr-using-acrobat-professional]video tutorial[/url] on how to do this.

Lori Kassuba is an AUC Expert and Community Manager for AcrobatUsers.com.