These forums are now Read Only. If you have an Acrobat question, ask questions and get help from one of our experts.

Pulling Data from PDF's

furniturealliance
Registered: Nov 7 2008
Posts: 4

I have a 10,000 page document. All pages are laid out exactly the same. There is an address field on all pages with an address. I need some way of pulling each address from each page in the document into a single file or just extrapolation in general.

If anyone knows of a way to do this using Acrobat Professional 8 or with any program really, this information would be greatly useful.

Thankyou.

My Product Information:
Acrobat Pro 8.0, Windows
thomp
Expert
Registered: Feb 15 2006
Posts: 4411
Do you mean that there is an actual form field for the address, or that there is just a consistent location on the page where the address appears.

If there is an actual form field then yes, of course, this data can be easily extracted.

But if you mean the address is just in a specific location, it takes a little bit more work, but it can be done. There is a function in Acrobat for getting words from the page, "doc.getPageNthWord()" and a companion function for getting the word's location, "doc.getPageNthWordQuad()". Together these functions can be used to extract information from a specific location on the document.

Not a trivial scripting Job.

Thom Parker
The source for PDF Scripting Info
[url=http://www.pdfScripting.com]pdfscripting.com[/url]

The Acrobat JavaScript Reference, Use it Early and Often
[url=http://www.adobe.com/devnet/acrobat/]http://www.adobe.com/devnet/acrobat/[/url]

Thom Parker
The source for PDF Scripting Info
www.pdfscripting.com
Very Important - How to Debug Your Script

furniturealliance
Registered: Nov 7 2008
Posts: 4
I mean that there is a consistent location on the page.

How would you suggest I go about the scripting? This is really my first time jumping into something of this nature with Professional Acrobat.

Thanks for responding.
thomp
Expert
Registered: Feb 15 2006
Posts: 4411
Scripting the extraction of the addresses from the PDF will save you unbelievably enormous amounts of time. What might take you weeks to do by tedius hand transcription, will take only a couple of hours with JavaScript, and all you have to do is push a button.

So, it is well worth the effort to spend a few days writing the script. However, if you are not already a programmer you need to hire a programmer. This is not one of those easy copy and paste programming task, and requires a bit of knowledge about PDF page geometry.

Pop me a note and I can give you a recomendation.

Thom Parker
The source for PDF Scripting Info
[url=http://www.pdfScripting.com]pdfscripting.com[/url]

The Acrobat JavaScript Reference, Use it Early and Often
[url=http://www.adobe.com/devnet/acrobat/]http://www.adobe.com/devnet/acrobat/[/url]

Thom Parker
The source for PDF Scripting Info
www.pdfscripting.com
Very Important - How to Debug Your Script

try67
Expert
Registered: Oct 30 2008
Posts: 2399
furniturealliance, if you're looking for such a script, contact me by email.

- AcrobatUsers Community Expert - Contact me personally at try6767 [at] gmail [dot] com
Check out my custom-made scripts website: http://try67.blogspot.com