These forums are now Read Only. If you have an Acrobat question, ask questions and get help from one of our experts.

Convert scanned pdf doc to editable Word

gyy
Registered: Dec 20 2007
Posts: 6

I have a doc which was scanned to pdf.
I need to convert this pdf to an editable Word doc. There are images- a straight 'Save As' to Word gave me an output of only un-edtiable Word.
I completed the OCR Operation. However, when I 'Save As' to convert to Word, i get an error msg saying it cannot complete the task. How can i convert this OCR op completed doc to editable Word?

My Product Information:
Acrobat Pro 7.0.0, Windows
pddesigner
Registered: Jul 9 2006
Posts: 858
Hi gyy,

If you don’t have the original file from which a PDF was created, you can save the PDF as a Word document that you can then edit in Word.

Click Export in the Tasks toolbar, and then choose Word Document.
Click Settings to set conversion options.

Note: When you save a PDF to Word format, the resulting file isn’t equivalent to a file created in Word; some coding information may be lost.

My favorite quote - "Success is the ability to go from one failure to another with no loss of enthusiasm.

gyy
Registered: Dec 20 2007
Posts: 6
Your response does not answer my question - I actually mentioned in my orig question that when the orig PDF doc is converted to Word, it only comes as an image. Can you pls advise further?
pddesigner
Registered: Jul 9 2006
Posts: 858
Review these Acrobat articles:

[url=http://kb.adobe.com/selfservice/viewContent.do?externalId=325262&sliceId=2]http://kb.adobe.com/selfservice/viewContent.do?externalId=325262&sliceId=2[/url][url=http://www.adobe.com/designcenter/acrobat/articles/acr7ct_createpdf.php]http://www.adobe.com/designcenter/acrobat/articles/acr7ct_createpdf.php[/url]

[url=http://www.adobe.com/designcenter/acrobat/articles/acr7qtsuspects.php]http://www.adobe.com/designcenter/acrobat/articles/acr7qtsuspects.php[/url]

++++++++++++++++++++++++++++++

To convert a pdf image document to a searchable image or to text and graphics, do the following:

1. With the scanned document open, select Document>Recognize Text Using OCR>Start.Acrobat will present you with the Recognize Text dialog box.

2. Choose the radio button corresponding to the pages you want converted to a searchable image.
Most of the time, you will probably choose All pages.

3. Click the Edit button.
Acrobat will present you with the Recognize Text Settings dialog box, on the next page.

4. In the PDF Output Style popup menu, select the type of conversion you want Acrobat to perform. Your options are:

• Searchable Image (Exact) - Acrobat creates a searchable image using a very accurate algorithm to convert the text.

• Searchable Image (Compact) - Acrobat creates a searchable image, but uses a faster and slightly less-accurate ocr method.

• Formatted Text & Graphics - Acrobat replaces each full-page image with real text and line art, doing its best to match the original font.5. Select a language to be used in the ocr conversion.
Acrobat 7 provides a list of a dozen or so languages from which you may choose.

6. Pick a resolution to which Acrobat can reduce the image when doing the ocr.
When Acrobat performs ocr, it reduces the resolution of the image to speed up the processing. The lower the downsampled resolution, the faster the processing, but the less-accurate will be the character recognition. I’d leave this at 600 dpi, myself.

6. Click the OK button to return to the Recognize Text dialog box.

7. Click the OK button to start the ocr process.

Acrobat will process the bitmapped pages.
If you chose to make a searchable image, the resulting pdf pages will look the same as they did before, but now all of the text tools will work on the apparently bitmapped text.

You can search the text, select it with the Select tool, and copy the selection. Of course, what the text tools are really operating on is the underlying text layer.
If you chose to convert the document to text and graphics, you will be looking at a document with real text and line art.

My favorite quote - "Success is the ability to go from one failure to another with no loss of enthusiasm.

daka630
Expert
Registered: Mar 1 2007
Posts: 1420
The error message is the key.
Find out what caused it. Then you can fix it.

Be well...

ingber
Registered: Oct 27 2008
Posts: 5
I upgraded to Acrobat 9 Pro Extended. However, the .pdf to .doc/.rtf is still quite terrible under Acrobat --at least for me. I create very good pdf documents from groff, but many publishers now require .doc submissions. Acrobat (to be fair, as well as just about every other tool I've tried -- but those aren't as expensive :)!) do not do a good job at all. Equations (simple Greek fonts?) and placement and fonts are not at all well converted. A sample example is
http://ingber.com/smni06_ppi.pdf .

Lester

http://www.ingber.com

TDaMoose
Registered: Jan 8 2009
Posts: 7
I started with a scanned .pdf file which looked to me like an image.. Then I selected Recognize Text Using OCR. The result was a document with one or more characters surrounded by a red box. I have no Idea what that means. I then needed to copy the page or document to a Word for Mac 2004 or 2008 document in order to edit and search it. I tried to use Select All but, only a portion of the document was selected. There were a series of areas each surrounded by a light blue box. When I chose i boxed area and used Select all, only the text within the box was selected. But, I could not select the entire page of the document. When I try to select any portion of the document, I get the standard vertical cursor but I cannot use it to drag and select.

My questions are:

1) What do the red outline boxes surrounding characters mean?

2) How do i select all so I can then transfer the text to a word document?

Thank you,

Tom Casselman
cassco [at] c-fourth [dot] com
rbogie
Registered: Apr 28 2008
Posts: 432
instead of exporting the OCR'd pdf to Word, experiment with this: convert the pdf to TIFF. open TIFF in MS Office Document Imaging (an app that comes with MS office); on tools menu apply "send text to MS Word"
gsmith60
Registered: Feb 15 2009
Posts: 1
Hi. I'm using Acrobat 6. Like the folks posting before, I'm trying to convert a scanned PDF document into editable Word or RTF.

The Acrobat version I'm running doesn't seem to offer the options suggested above by "pddesigner", ie. I see no option along the lines of:

Document>Recognize Text Using OCR>StartInstead, under Documents, I see Paper Capture.

Suggestions?

Thanks.
daka630
Expert
Registered: Mar 1 2007
Posts: 1420
gsmith60,
Use "Paper Capture"; this provides OCR.
Note, that with Acrobat 6, there is a 50 page limit.
You may have to process your PDF in increments.

Be well...

Be well...

nanolab
Registered: Nov 5 2009
Posts: 2
You can try to use this [url=http://www.newocr.com/]online OCR[/url] service. It's works good for scanned images with resolution > 150 dpi.
avatarchan008
Registered: Dec 25 2009
Posts: 1
you can try the Tweak PDF to Word Converter 3.0, it is a little program to convert PDF to editable Word for easier editing. The converter enjoys an exact conversion with the output Word retaining intact of all the original features of the PDF, including layout, image positioning, font, graphics, hyperlinks, etc. Encrypted PDF files can be converted to Word, too. here the link:http://www.tweakpdf.com/pdf-to-word.php
TonyPotter
Registered: Feb 1 2010
Posts: 85
Try to use OCR Terminal, which is an online OCR service that converts scanned images into searchable text. Register an account and you can convert 20 pages for free every month.
http://www.ocrterminal.com/
And for the normal PDF, if you want to convert it to Word or Text, try to use AnyBizSoft PDF to Word Converter, which is totally free. The function is superb, have a try!
http://www.anypdftools.com/pdf-to-word.php#201

I will try my best to help you in PDF converison fields, objectively and Neutral.

debrah.h48
Registered: Feb 5 2010
Posts: 2
Get these 59 great features at your fingertips when you license Able2Extract Professional:

1. Convert PDF to Excel and retain the row/column structure of the PDF table within Excel.
2. One click of a button converts PDF files into formatted excel spreadsheets quickly and easily.
3. Convert Image (scanned) PDFs into formatted Excel spreadsheets.
4. The PDF to Excel conversion options supports a variety of different languages, including English, French, Spanish and German, to name just a few.
5. The PDF to Excel conversion is done effortlessly through our easy to use user interface.
6. Converting from PDF to Excel will preserve your page layout. You can view all the PDF pages in one Excel workbook or, in the alternative, one PDF page per Excel workbook.
7. The PDF to Excel conversion feature of Able2Extract is not a plug-in. It does not require any Adobe® software product, such as Acrobat®, to view and convert the PDF.
8. The PDF to Excel conversion output supports most Windows and Office platforms, including - 98/ME/NT/2000/XP.
9. An option in the PDF to Excel conversion lets users select whether to replicate the font of the PDF or not.
10. PDF to Excel converters can choose the custom Excel feature and designate their own columns.
11. Custom PDF to Excel conversions can be saved with the new template saving capability.
12. Our PDF viewer is included so you can see what you are converting from PDF to Excel!
13. Convert to Excel and it will save as an XLS file.
14. Get more control from the Custom PDF to Excel conversion option.
15. Convert PDF to Word and preserve the original layout of your PDF in an editable Word document.
16. Convert Image (scanned) PDFs to editable Word documents.
17. Several PDF to Word conversion options are available. Complex PDF to Word conversion - preserve the originality of converted the PDF to Word by identifying paragraphs, text labels, graphics, tables, and flow of columns etc. and then replicating it within Microsoft Word. Simple PDF to Word conversion - converts text from the PDF to Word document without the graphics.
18. PDF to Word variety – convert PDF to Rich Text Format (RTF) or Word, it is up to you! The PDF to RTF feature allows users to convert large documents faster than the PDF to Word option.
19. PDF to Word Efficiency and Selectivity. Pinpoint selection ability. Since you can see what you select, there is no need to transfer whole documents or even whole pages at a time. Take one line of text if you want!
20. The PDF to Word conversion is processed at a very high speed.
21. PDF to Word conversion size options – Convert the whole document, a range of pages, one page or a portion of a page – it is your choice!
22. PDF to Word output supports most Windows and Office platforms - 98/ME/NT/2000/XP.
23. Our PDF viewer is included so you can see what you pages or portions of a page you are converting from PDF to Word!
24. Choose your PDF to Word output format – Both .Doc and .RTF output formats are supported.
25. Convert PDF to PowerPoint (PPT) and improve your presentations.
26. See what you are converting. Our proprietary PDF viewer lets you view your PDFs just as you would with Adobe Acrobat Reader.
27. Retain PDF graphics in HTML.
28. Text from PDF is available in HTML.
29. Convert PDF to Image formats
30. Convert PDF to TIFF
31. Convert PDF to JPEG
32. Convert PDF to GIF
33. Convert PDF to BMP
34. Convert PDF to PNG
35. Our PDF viewer lets you view your PDF documents at different sizes by zooming in and zooming out.
36. Rotate landscaped PDFs to portrait view for easier viewing and converting
37. Convert HTML pages into Excel spreadsheets for easy data analysis.
38. Convert HTML to editable Word documents
39. HTML can be converted to Text with Able2Extract Professional
40. Convert Text documents into formatted Excel spreadsheets.
41. Convert Text documents to .doc and .rtf format
42. Text files can be transported into HTML files.
43. Simple PDF to Text (.txt) conversion comes included.
44. PDF to HTML conversion for use on web pages comes included.
45. Select your PDF using a variety of options, including: using the mouse, by selecting all on page, by selecting a page range.
46. Able2Extract Professional converts both PDF and XPS documents
47. Converts XPS to Word
48. Convert XPS to Excel
49. Convert XPS to PowerPoint
50. Our XPS conversion has the ability to handle and convert scanned XPS documents
51. Control over Image based and non-image based conversions. Perfect for situations in which the PDF for conversion is a blend of image PDF and native PDF pages.
52. Get pinpoint conversion accuracy that allows you to convert any portion of a page that your require. No need to convert a whole page at a time if it is not required.
53. Convert PDF to the DWG (drawing), the format used for storing two and three dimensional design data and metadata. Used in CAD programs.
54. PDF to DXF conversion. Autodesk DXF (drawing interchange format) is the format adopted by Autodesk to ensure data interoperability between Autodesk and other formats.
55. PDF to ODT format for use in Open Office Writer (the MS Word equivalent)
56. PDF to ODS format for us in Open Office Calc (the MS Excel equivalent)
57. PDF to ODP for use in Open Office Impress (the MS PowerPoint equivalent)
58. Easy batch conversion of PDF documents into other output formats such as Word, Excel and PowerPoint and the other available conversion formats.
59. Right click attachment PDF conversion integration in MS Outlook.
honymumu
Registered: Mar 10 2010
Posts: 1
Why not use advanced pdf to word 5.0 ,it costs little. it is a pretty useful tool to convert PDF files to Word Documents,Converted quality is fairly good: with text content, page layouts, images,and text hyperlinks preserved.Anyway,i am using it,you can try to test wether it true or not.
http://www.blog.advancedpdfconverter.com/
caribbean pirate
Registered: Mar 18 2010
Posts: 1
Convert PDF to GIF and other image formats, it is your best choice to take this PDF to GIF converter. This PDF to GIF converter is specially designed for PDF lovers to convert your PDF files to GIF and other image formats like JPEG, PNG, BMP, PCX, TIFF, etc.

http://convertpdftogif.net/
Brandon
Registered: Mar 19 2010
Posts: 3
Thanks for links.
bolivarbill
Registered: May 18 2010
Posts: 3
I am using Adobe Pro 8 and have tried to change a large format pdf into a jpeg. But I get an error that says the file is too wide to export. Does any one have a solution.
gkaiseril
Expert
Registered: Feb 23 2006
Posts: 4307
[url=www.openoffice.org/]OpenOffice.org[/url] has a plug in that allows the import of a PDF into an OpenOffice.org document.

George Kaiser

rbogie
Registered: Apr 28 2008
Posts: 432
zing wrote:
I'm trying to convert a scanned PDF document into editable Word or RTF.
the answer is in this thread, see post of 2009-01-10 03:52:34: instead of exporting the OCR'd pdf to Word, experiment with this: convert the pdf to TIFF. open TIFF in MS Office Document Imaging (an app that comes with MS office); on tools menu apply "send text to MS Word"
Velly
Registered: Jun 3 2010
Posts: 3
Thanks!!!
omay
Registered: Jun 5 2010
Posts: 1
hi,gyy.I think you require a converter. I'm using a pdf to word converter from http://www.pdftojpeg.biz/convert_pdf_word.htm which can support batch conversion. Moreover, free trial for 30 days!Amazing!

PDF to JPEG Converter http://www.pdftojpeg.biz/ can easily convert PDF to JPEG and to many other image formats