I have a PDF with hidden text which contains some hebrew characters. I tried to highlight all the hidden text using "ctrl A", then copy and pasted it to a word document/notepad. Unfortunately, it generated some extra/unwanted spaces in the word document/notepad i.e "Adobe" becomes "A d o b e".
I examined the hidden text in the PDF and I did not find any of these spaces. Can somebody help me why this is happening and How should I correct it?
"Hidden text" connotes OCR. If so, what you've copied-pasted is what is.
Clean up post processing in Word or NotePad, for the most part manually, becomes necessary.
If the "hidden" text is actually renderable text but with a font color of white on a white background what you have reflects how the source file to the PDF was authored. Again, going back to a text editor, with your PDF, will require some measure of post processing by you.
Be well...
Be well...