These forums are now Read Only. If you have an Acrobat question, ask questions and get help from one of our experts.

Editing hidden text behind PDF image of historic documents

HistoricArchives
Registered: Dec 17 2009
Posts: 2

I need to edit the hidden text behind a scanned PDF image of a document. The image must remain as an “exact” copy of the original scanned document.

I used Adobe Acrobat Pro (versions 7 and 9) to make PDF images of old typed documents from the 1940’s. When I open those images and run OCR in version 9, then examine the hidden (invisible) text layer behind the image, there are errors. For example, the word “book” has been picked-up by the OCR as the word “look.” I need to change the “l” to a “b” in order to make the PDF accurate when it is searched at a later date.

I have checked many user forums. Most people imply that hidden text can be viewed, but NOT edited in Acrobat Pro 7 and 9. (Hidden text can be viewed in Version 9 by selecting “Document” “Examine Document” and then clicking on the “+” symbol next to “Hidden Text,” then clicking “Show preview.”) Some say to use Adobe Capture 3.0 to edit hidden text. Others say to use Photoshop or Illustrator to edit hidden text (I think these folks were confused, in that Photoshop and Illustrator would be used, logically, to edit the image on top of the hidden text). Yet another person seemed to say that Adobe added a hidden text editor to Acrobat 8, but took it away in Acrobat 9. (I don't have 8, so I can't verify this.)

The closest answer I was able to find involved using the Text Touch Up Tool on top of the image to edit hidden text behind it, but when you do that you are typing “blind.” In other words, you highlight a spot on the image (top layer) where you THINK the error MIGHT be, and you type the correction without being able to see what you are typing over. Then, you go back to the “Examine Document” procedure to see if you “hit” your mark, and if not, you redo it until you do “hit” your mark. With the number of documents and corrections that we have, that process is too labor intensive and thus a budget breaker.

If I have to buy more software to edit hidden text, my preference would be to buy a genuine Adobe product because I have experienced problems in the past switching back and forth between Adobe products and other PDF manipulation software, such as PaperPort.

Can anyone answer any of these questions:

(1) Is there a way in Acrobat versions 7, 8 or 9 to edit hidden text, and if so, how?

(2) What Adobe software (other than Acrobat) will edit hidden text behind an image?

(3) Assuming no Adobe product will edit hidden text behind an image, is there any non-Adobe products that will do that?

Thank you!

My Product Information:
Acrobat Pro 9.2, Windows
daka630
Expert
Registered: Mar 1 2007
Posts: 1420
Two threads at the Adobe Forums Acrobat Windows forum that may be of interest.

[url=http://forums.adobe.com/thread/540646?tstart=0]OCR and hidden text in PDF scans of historic documents[/url]
[url=http://forums.adobe.com/thread/540565?tstart=0]Can OCR'd text be edited[/url]

Be well...

Be well...

daor79
Registered: Nov 24 2009
Posts: 2
daka630 wrote:
Two threads at the Adobe Forums Acrobat Windows forum that may be of interest.[url=http://forums.adobe.com/thread/540646?tstart=0]OCR and hidden text in PDF scans of historic documents[/url]
[url=http://sieuthi77.com/giavang]Can OCR'd text be edited[/url]

Be well...
Select the "Tools>Advanced Editing>TouchUp Text Tool", then select the text that you want to edit. Right-click on the selection and bring up the properties dialog. Then select black as the stroke and fill color. This will make the text visible. Just edit your text, but don't get confused by the image in the foreground. Once you are done, set the two colors back to "no color" and you are done.