Hi all,
I have historic documents (German and English) that I want to OCR so that the text is searchable *without* changing its appearance. I've tried with previous versions of Acrobat where it did not quite work and thought I give Acrobat XI (Windows 7) a try.
I use "searchable image" in the correct language ("Clearscan" and "exact" are not useful here). The ocr'ed text is in a hidden layer. Since an old-fashioned font is used, the ocr result is expectedly faulty. So I need to correct those results which is where the problems arise. The little sub-menu allows me to look for "problem areas" which are then marked in red. The individual entries can then be corrected one by one. However, these changes do not always seem to be transferred to the hidden layer. This is evident either from trying a search for the term (ctrl f), or exporting to Word. Both yield the original, not the corrected, ocr result.
Second problem: Once I mark a problem area as solved, there is no way to access that word, other than starting all over again.
Third problem: The keyboard shortcuts in the submenu don't always work.
Improving the scan quality is no solution because some older characters have no equivalent anyway.
The only solution seems to me to access the hidden text in some way and edit directly there. I did not find any mentioning of that in Acrobat's help or the forums, however. So I expect it's still not possible?
(As an aside, I'd like to submit the problem to Adobe but don't know how)