How to create a searchable PDF file

By December 13, 2011

 



Have you ever opened a PDF file only to find that none of the information is searchable? In this tutorial, you'll learn how to convert a scanned document into a fully searchable PDF file using Acrobat X or XI. You'll also learn how to apply OCR, which stands for Optical Character Recognition. To apply OCR, the original scanner resolution must be at least 72 dpi or higher. In the Recognize Text dialog, you can set various capabilities like OCR language and PDF Output Style. There are three different PDF Output Styles you can select.

View transcript

How to create a searchable PDF file

Lori KassubaDecember 13, 2011

Have you ever opened a PDF file only to find that none of the information is searchable? Like this particular file, I’m unable to find any words within the document because it’s just a scanned image. However, if you have Acrobat X Std. or Pro it’s easy to change your document into a fully searchable file when I click OK on this dialog. Or, I can access the same command from the Tools pane, Recognize Text panel and select the In This File command.

To apply OCR, which stands for Optical Character Recognition, the original scanner resolution must be at least 72 dpi or higher. In the Recognize Text dialog you can set various capabilities like OCR language and PDF Output Style. There are three different PDF Output Styles you can select, I’m going to go ahead and use Searchable Image. This will ensure that the text is searchable and selectable, for cut and paste operations. This particular PDF Output style will also place an invisible text layer over the original image.

At this point you can also downsample your file to reduce the size if necessary.

The RecognizeText command now will convert your image to searchable text so that you can now search and find words in your document. Your scanned image is now fully searchable and much more usable. The Recognize Text command can also be run on multiple files or folders using the In Multiple Files command located under the Recognize Text panel.


Was this tutorial helpful?

Please Log in to provide feedback on this tutorial.

Rate this tutorial

Please Log in to rate this tutorial.

Rating:

Did you know?

  • You can ask a question and get an answer from one of our experts.
  • You can search our database of over 800 tutorials by product and/or topic.
  • You can leave a comment below for the author of this tutorial.

Products covered:

Acrobat X ProAcrobat X StandardAcrobat X Suite

Related topics:

Scanning & OCR

Top Searches:

Create PDF, Scan to PDF, Electronic signatures, OCR PDF

6 comments

Lori Kassuba

6, 2014-07-01 01, 2014

Hi Bell,

You can set the primary OCR language but it is limited to those listed under the Recognize Text General Settings. There is a list of these languages in this discussion:
http://answers.acrobatusers.com/acrobat-xI-supports-languages-OCR-q67192.aspx

Thanks,
Lori

Bell

3, 2014-06-30 30, 2014

Is it possible to use this OCR for scanned papers, if on the paper there are small symbols (like small images, drawn with black, exactly like the letters) and the text is written not only horizontally, but also in the vertical direction?
Thank you!

Lori Kassuba

11, 2014-06-25 25, 2014

Hi Amanda,

This error appears if the PDF document already contains some type of editable text. So, perhaps the PDF is a mix of scanned images and others. If you really need to OCR a specific page, you could export that page as a TIFF image and then run OCR and replace the page again. However, you might lose a bit a quality especially if the page contains any vector information.

Thanks,
Lori

Amanda

2, 2014-06-23 23, 2014

Thanks for the video.

I am currently trying to make a 200 page PDF searchable and am finding that some pages pop up with the error that Adobe cannot perform OCR because “the pages contain renderable text”.

Is there a way to fix this?

Thanks.

Joe

11, 2014-04-16 16, 2014

Spot on! Thanks!

Inderpal Singh

1, 2013-07-05 05, 2013

Thanks for video.

Lori Kassuba

9, 2013-04-24 24, 2013

Yes, just be sure to run the Recognize Text command on your images to create searchable text.

Thanks,
Lori

gary

3, 2013-04-18 18, 2013

I scanned my book pages as a JPEG and then converted it to PDF. Will I still be able to convert it to a searchable PDF file?

Pam

3, 2012-02-21 21, 2012

My text become blurry sometimes after optimizing my scanned document.
After I run “Optimize Scanned PDF” a scanned document, the text becomes blurry when it was clear before I ran the “Optimize Scanned PDF”.

Any suggestions?

Pam,

Try modifying your settings in the Optimize Scanned PDF dialog in the compression area - it sounds like you’ve lost some quality here.

Lori

Leave a reply:

Have an urgent question? Post your question to our Ask an Expert forum for a faster response.

Fields marked with * are required.

Download
Acrobat XI trial

Get the trial now

Learn how to
edit PDF.

Get started