Option to eliminate words with mixed number and letters in OCR
Provide a check off that functions to prevent the OCR from mixing numbers and symbols in words.
For example: Custodian should not be read Cu&t0d1an - only letters should be recognized in words.

4 comments
-
Adminrishusha (Admin, Adobe) commented
Hi Brian,
Thanks for suggestion. Given the complexity of OCR engine in Acrobat we will dig deeper into the feasibility of the proposed feature.
Thanks
Rishabh -
Adminrishusha (Admin, Adobe) commented
Hi Jon,
OCR is getting better day by day and we are working to make it work with almost all the real life documents we come up with and obtain from various resources.
This is an ongoing process as we can't check on ALL the documents in the world. Thanks for your interest in reporting about the quality of OCR. We will try our best to make it better.
Thanks
-
Jon Salternate commented
Can we just say it? The OCR is no longer anywhere near as good as it use to be.
-
Jeffrey commented
After OCR, would like spell check feature to review OCR errors beyond correct recognized text feature. OCR has errors. Also, would like to have the option to tell OCR to search but selectively remove symbols from OCR responses. many documents have no "~" or "|" (shift backslash) also would like to eliminate the search for British pound symbol. The user would then selectively help the OCR to eliminate some responses.