Answered
Hi,
I'm using Acrobat Pro 9 exporting a PDF-file whith empty pages to text. The empty pages are left out in the text file - there is no page-brake inserted into the text file for those pages. Therefore I can not use the text-file as a search index in another program, since it has another page count than the original PDF :-(
Isn't there any way to tell Acrobat to keep the empty pages upon export?
Does anyone know another export format which containes the text and page format in one file?
Why is the XML-Export not containing any page-brakes?
Thanks for any advice ;-)
Griddie
XML is a data format. The PDF is decomposed into parts and organized in the XML tags so that it can be inteligently extracted, rebuilt, or repurposed at some later time. I'm pretty sure there is a way to tell which page a piece of text was on.
Thom Parker
The source for PDF Scripting Info
[url=http://www.pdfScripting.com]pdfscripting.com[/url]
The Acrobat JavaScript Reference, Use it Early and Often
[url=http://www.adobe.com/devnet/acrobat/javascript.php]http://www.adobe.com/devnet/acrobat/javascript.php[/url]
Then most important JavaScript Development tool in Acrobat
[url=http://www.pdfscripting.com/public/34.cfm#JSIntro][b]The Console Window (Video tutorial)[/b][/url]
[url=http://www.acrobatusers.com/tutorials/2006/javascript_console][b]The Console Window(article)[/b][/url]
Thom Parker
The source for PDF Scripting Info
www.pdfscripting.com
Very Important - How to Debug Your Script