I have a number of scanned and OCR'd pdf documents. Due to an unusual requirement, I need to ensure that a particular name is not included in the index. For example - let's assume the name is "Smith". I want the entire document to be indexed and searchable (including the pdf search - but primarily google search) - but a search for the word "Smith" must be unsuccessful. Is there any way to accomplish this (short of modifying every occurance in the original documents)?
To exclude a word from a Catalog index go into Acrobat Preferences prior to building the Catalog index.
For the category "Catalog" - click the "Stop Words..." button.
In the Stop Words dialog, enter the Word(s). "Add" each.
This builds list of words to exclude from the index.
However, any user of Adobe Reader or Acrobat may perform a "Find" that would show the presence of the word(s).
As to Google search of a PDF.
If the PDF can be cataloged by Google, then all content is cataloged.
To assure a word or a text string will not be available use of the Redaction tool may be called for.
The redaction tool, used properly, will fully remove the word or text string selected.
Thus, nothing for a search engine to harvest.
Be well...
Be well...