These forums are now Read Only. If you have an Acrobat question, ask questions and get help from one of our experts.

Converting Word to text searchable PDF

BKalker
Registered: Feb 8 2011
Posts: 3

Hello people-who-are-smarter-than-I,
 
I have done a significant amount of searching here, but I can't seem to get the answer I need.
 
Using: Adobe Acrobat X Pro
 
When clicking "Recognize Text>In this File" I get the error that Acrobat could not perform recognition (OCR) or this page because:
This page contains renderable text
 
It seems the only way possible for me to convert a 35 page all text word document in to a text searchable PDF:
Create the PDF, then convert to TIFF, THEN OCR again.
 
This makes this document grainy, the logos are pixilated and this looks like garbage!
 
There has to be a cleaner solution that I just haven't found yet.
 
Please help me? Please?

My Product Information:
Acrobat Pro 10.0, Windows
gkaiseril
Expert
Registered: Feb 23 2006
Posts: 4307
How exactly and specifically was the PDF created?

Printing to the Acrobat Printer, using PDF Maker in MS Office and processing a PS file type through distiller all create a text search-able PDF.

Exporting to PDF from OpenOffice.org also creates a searchable PDF.

Can you post a link to a sample file?

Open the Adobe JavaScript console and run the following code:

var cnt=0;
for (var p = 0; p < this.numPages; p++)
cnt += getPageNumWords(p);
console.println("There are " + cnt + " words in this doc.");

The Acrobat JavaScript Console (Your best friend for developing Acrobat JavaScript) by Thom Parker.

George Kaiser

BKalker
Registered: Feb 8 2011
Posts: 3
gkaiseril wrote:
How exactly and specifically was the PDF created?Printing to the Acrobat Printer, using PDF Maker in MS Office and processing a PS file type through distiller all create a text search-able PDF.

Exporting to PDF from OpenOffice.org also creates a searchable PDF.

Can you post a link to a sample file?

Open the Adobe JavaScript console and run the following code:

var cnt=0;
for (var p = 0; p < this.numPages; p++)
cnt += getPageNumWords(p);
console.println("There are " + cnt + " words in this doc.");

The Acrobat JavaScript Console (Your best friend for developing Acrobat JavaScript) by Thom Parker.
Thanks George. You'll have to excuse my non-technical knowledge, but I thank you in advance for your patience!

Here is a link to the sample document. It is produced in Word. (click Download Now to open the document) http://www.4shared.com/document/DQ8tKIwG/ADV_Part_2A_5_16_11_Final__2_.php

I have tried print PDF'ing with both Adobe and CutePDF, then opening in Acrobat X. Same error message.
I have tried uploading and creating it in Acrobat from the raw Word doc. Same error message.

Thanks again.

gkaiseril
Expert
Registered: Feb 23 2006
Posts: 4307
Both the "Convert PDF" and printing from MS Word using the Adobe PDF printer create a search-able PDF.

Exactly what is the wording of the error you are getting?

There are thousands of error messages, so saying an error message is of absolutely no help.

George Kaiser

BKalker
Registered: Feb 8 2011
Posts: 3
BKalker wrote:
When clicking "Recognize Text>In this File" I get the error that Acrobat could not perform recognition (OCR) or this page because:
This page contains renderable text

gkaiseril
Expert
Registered: Feb 23 2006
Posts: 4307
Have you tried to perform a search using Acrobat's "Find", + "F", command?The message means the PDF is has text and can not be processed through the OCR process.

George Kaiser