These forums are now Read Only. If you have an Acrobat question, ask questions and get help from one of our experts.

Acrobat X: Combine multiple local html files into pdf but exclude html header, footer

fionamayer@roge...
Registered: Feb 17 2011
Posts: 7

How can you 'Combine files into pdf' using HTML pages and get Acrobat to exclude the headers & footers from each HTML page. OR is there a scripted way to create PDF file from a list of HTML pages but only use selected sections? I'd like to set up batch file to generate PDF's of text-heavy web content for client distribution (without hdr/footers being reproduced on every page). Batch file would enable pdf to be recreated whenever web info is updated. Any ideas?

My Product Information:
Acrobat Pro 10.0.1, Windows
gkaiseril
Expert
Registered: Feb 23 2006
Posts: 4307
I would start with omitting the headers and footers from the created PDFs from the web pages. You need to check each application that is used to create the PDF to make sure it is not adding a header or footer.

After the fact, use the Redaction tool. This is a multi-step process so make sure you complete all the actions.

George Kaiser

fionamayer@roge...
Registered: Feb 17 2011
Posts: 7
I'm using Acrobat X. Header/Footer is part of web page content. I just want to pull in the content of the page - so exclude the banner/hdr. There are too many pages to do this manually. Redaction just blacks it out.
gkaiseril
Expert
Registered: Feb 23 2006
Posts: 4307
The Redaction tool has many options available. One can select the color (including no color) or cover text to use. One has a number of ways to identify the material to redact. You can even identify the area to redact and use JavaScript to mark that area on every page and apply redaction to the marked areas for the entire document. Automating Redaction with Acrobat JavaScript by Thom Parker

George Kaiser

daka630
Expert
Registered: Mar 1 2007
Posts: 1420
Something that may help — Using Acrobat X —
.
File > Create > PDF from Web Page
.
Create PDF from Web Page dialog displays.
.
Click the "Settings" button.
.
The Web Page Conversion Settings dialog displays.
.
Untick "Place headers and footers on new page".
.
.

Be well...

fionamayer@roge...
Registered: Feb 17 2011
Posts: 7
All good points but they refer to header/footer of printed page, not the coded web page. For instance, go to http://www.latimes.com/ main page. I'm referring to the header - "Los Angeles Times" with the date and menu underneath. The footer is the "Los Angeles Times" with links to all the smaller newspapers at the bottom of the web page. I want to exclude this information from the pdf when it's created. What I'm asking is, is there a way to extract the content between the header and the footer automatically using Acrobat - through scripting or some other method? Another simplistic example: you have HTML5 pages coded up as . Is there some way to direct Acrobat X to exclude from all pages therefore just leaving the which is the main content. Hope that explains it better... ; )
fionamayer@roge...
Registered: Feb 17 2011
Posts: 7
LOL, post blanks carets round code so the line should read: HTML5 pages coded up as "header" "article" "footer" ... and therefore just leaving the "article" which is the main content. Sorry!
gkaiseril
Expert
Registered: Feb 23 2006
Posts: 4307
Have you read Thom Parker's tutorial about using JavaScript to redact data?

George Kaiser

Merlin
Acrobat 9ExpertTeam
Registered: Mar 1 2006
Posts: 766
fionamayer [at] rogers [dot] com wrote:
I want to exclude this information from the pdf when it's created. What I'm asking is, is there a way to extract the content between the header and the footer automatically using Acrobat
Search for "Convert part of a web page to PDF" on this page :
http://help.adobe.com/en_US/Acrobat/9.0/Professional/WS58a04a822e3e50102bd615109794195ff-7f60.w.php

It was made for you.
;-)