I would like to build a small application that runs like a black box :
Folder In : Scanned documents 200 DPI (with PDF format)
Folder Out : Same document + the info that comes from the Acrobat text recognition tool (= text reput in place with a transparent font to overlap the letters that are matched...)
I can do this manually but I would like to script or even to automate this process on a server (Windwos or Linux, does not matter...).
- Does someone know what scripting tools I can use?
- Is there a command line version of Acrobat?
- The menu command asks to chose the main OCR language... Can I skip this with the script command (even if OCR performance is degraded)?
My target is not to reach perfect OCR performance, but just enough to be able to index my documents with any local HDD search engine.
Thanks for your help
If it's just a small process you would like to run on the desktop, try creating a batch sequence to automate the process. Here is a link to a [url=http://www.acrobatusers.com/tutorials/scan-and-batch-ocr-using-acrobat-professional]video tutorial[/url] on how to do this.
Lori Kassuba is an AUC Expert and Community Manager for AcrobatUsers.com.