These forums are now Read Only. If you have an Acrobat question, ask questions and get help from one of our experts.

Cleaning up rough text from scanned papers

Oceanking
Registered: Dec 5 2008
Posts: 2

Is there a way to "clean up" rough text from scanned papers and books? For example, some older scanned pdf's I have have jaggedy text as a result of older scanning technology. This makes the documents harder to read. The text is good enough that it is recognized as text and not an image, so it seems possible to convert the rough text to a regular font that is clean. I have been able to do this by converting the PDf to a word document using ABBYY PDF Transformer, but this often creates strange artifacts in the images,etc.

Does anyone know if Acrobat can do this, or how it can be done?

Thanks for helping!

Oceanking

daka630
Expert
Registered: Mar 1 2007
Posts: 1420
Oceanking,
Perhaps this will help.
With Acrobat Professional 8 or 9 you have the "Examine Document" feature.
With this you can delete the existing OCR text.
Run OCR again. This would be using the "newer" technology found in Acrobat 8 or 9.
Searchable Image gives you the OCR text and keeps the image (tweeked somewhat).
Searchable Image (Exact) gives you the OCR text and the image (no tweeks).
Formatted Text & Graphics or Clear Scan gives you the OCR text which replaces the image.
This third option lets you correct suspect (spelling) words.

The OCR output from all three may be exported to a word processor where additional editing and formatting can be accomplished.
This might be a change of font type, etc.

Be well...

Be well...