These forums are now Read Only. If you have an Acrobat question, ask questions and get help from one of our experts.

OCR Regognition from Downloaded pdf files

KappJay
Registered: Jan 11 2011
Posts: 3
Answered

I am using Adobe Acrobat Pro 9.4.1 on a Mac running OSX 10.6.6. I scan in many documents to pdf format and use the OCR and Clear Scan options to make them searchable. It all works well, except when I download a file from some financial companies.
 
The credit card companies I deal with do not create files that I can convert to OCR or at least I have not found a way to do that. In one case the option to convert using OCR recognition is grayed out and in other cases it fails because the document is in "renderable pdf text". Some of these documents were created using Adobe Acrobat 5.0.
 
I would like to be able to directly download these files and make them searchable as opposed to letting the paper come to my house then scanning it in.
 
Have you had a similar experience? If so, were you able to convert these documents somehow? Thanks in advance for your advice.

Jay

My Product Information:
Acrobat Pro 9.4, Macintosh
gkaiseril
Expert
Registered: Feb 23 2006
Posts: 4308
A lot depends upon how the PDF was created and the quality of the PDF.

George Kaiser

rbogie
Registered: Apr 28 2008
Posts: 432
Accepted Answer
if the document contains "renderable text" it is incompatible with OCR because renderable text IS searchable. OCR is applied to scanned (image) PDF.
mansoorindia
Registered: Jan 21 2011
Posts: 1
I have one PDF file with the information type in Embedded Subset of True Type Fonts (Hindi) viz. CDAC_GISTSurekh (TT) and CDAC_GistSurekh-Bold (TT).

I would like to perform OCR and would like use in my word/excell document of the above said PDF.

I used ReadIRIS Pro/PaperPort, but i cant get results.. i mean .. not able to use that data in Word/Excell in Hindi.

please help me.. how it is possible.

plz. do mail me mansoorindia [at] gmail [dot] com