We're trying to figure out how to distinguish between the content of different PDFs.
We've identified:
1. PDFs created with a "distiller" of some kind.
2. PDFs that are image only.
3. PDFs that are images with OCR text.
Depending on the type of PDF we want to route the files through different parts of our production system.
Presumably we'll need to look at the internals of the PDFs to figure this out.
Thanks.
Ira
If the latter, Document properties will show you what made the file, and text that's OCRed can be selected with the multi-select tool whereas unprocessed images of text can't.