I need to edit the hidden text behind a scanned PDF image of a document. The image must remain as an “exact” copy of the original scanned document.
I used Adobe Acrobat Pro (versions 7 and 9) to make PDF images of old typed documents from the 1940’s. When I open those images and run OCR in version 9, then examine the hidden (invisible) text layer behind the image, there are errors. For example, the word “book” has been picked-up by the OCR as the word “look.” I need to change the “l” to a “b” in order to make the PDF accurate when it is searched at a later date.
I have checked many user forums. Most people imply that hidden text can be viewed, but NOT edited in Acrobat Pro 7 and 9. (Hidden text can be viewed in Version 9 by selecting “Document” “Examine Document” and then clicking on the “+” symbol next to “Hidden Text,” then clicking “Show preview.”) Some say to use Adobe Capture 3.0 to edit hidden text. Others say to use Photoshop or Illustrator to edit hidden text (I think these folks were confused, in that Photoshop and Illustrator would be used, logically, to edit the image on top of the hidden text). Yet another person seemed to say that Adobe added a hidden text editor to Acrobat 8, but took it away in Acrobat 9. (I don't have 8, so I can't verify this.)
The closest answer I was able to find involved using the Text Touch Up Tool on top of the image to edit hidden text behind it, but when you do that you are typing “blind.” In other words, you highlight a spot on the image (top layer) where you THINK the error MIGHT be, and you type the correction without being able to see what you are typing over. Then, you go back to the “Examine Document” procedure to see if you “hit” your mark, and if not, you redo it until you do “hit” your mark. With the number of documents and corrections that we have, that process is too labor intensive and thus a budget breaker.
If I have to buy more software to edit hidden text, my preference would be to buy a genuine Adobe product because I have experienced problems in the past switching back and forth between Adobe products and other PDF manipulation software, such as PaperPort.
Can anyone answer any of these questions:
(1) Is there a way in Acrobat versions 7, 8 or 9 to edit hidden text, and if so, how?
(2) What Adobe software (other than Acrobat) will edit hidden text behind an image?
(3) Assuming no Adobe product will edit hidden text behind an image, is there any non-Adobe products that will do that?
Thank you!
[url=http://forums.adobe.com/thread/540646?tstart=0]OCR and hidden text in PDF scans of historic documents[/url]
[url=http://forums.adobe.com/thread/540565?tstart=0]Can OCR'd text be edited[/url]
Be well...
Be well...