When using ROL feature, I prefer to use the select tool to read certain paragraph instead of letting it read through the entire page. For some files, ROL would read only read a single word or portion of a word when I click the paragraph with the select tool. I sort of figured out the problem may be coming from the file not having been tagged. Tagging it sometimes solves the problem but not consistently. Anyone has a solution to this problem? Thanks.
Yes, When asking ROL to parse a single line it will stop at some characters in a line.
Some examples of what I've encountered are:
--| a nonbreaking space
--| a discretionary hyphen
Such characters, in a PDF, would have come from the authoring file where the content author inserted them.
Typically, not a problem when you have ROL parse the entire page.
For a scanned image of text, there'd have to be OCR applied. If Searchable Image or Searchable Image (Exact)
are used and depending on the quality of the source paper's imprint, the OCR'd text can contain some funky stuff at times.
These can be a speed hump for ROL parsing a single line.
If using either of the OCR methods above, a quick check is to save as to a text file after OCR and Save of the PDF.
The text file's content is highly indicative of what you'll get from ROL (with "funky characters being potential speed humps).
Be well...
Be well...