OCR Scan showing odd artifacts, numbers clipping into one another, and spaces and/or dashes in the underlining.
We have to use Editable Text and Image for our national database to add our header on the top of pdf documents filed in our database.
Regardless of scanner used, OCR is not is not seeing the underlines as a solid line. It looks like a bunch of underscores. This results in the numbers above it to become distorted. Sometimes the amounts are not readable.
here are times the line comes through looking like this: _----------__
Are settings are 300 dpi, black and white document, OCR on / Editable Text and Images, front side. We have changed the dpi to higher (600), with slightly better results, but the scans take twice as long and do not always guarantee a good scan.
This problem exists on Windows 7 and Windows 10 due to the change Adobe made in your OCR process from Adobe XI Professional. Adobe support worked on my Windows 10 machine, saw it freeze my pc, he uninstalled Adobe DC and reinstalled and OCR does not work well with scans to searchable text.
Thank you for reporting the issue. Can you please provide us the details of the following:
1. Acrobat Version - Are you on reader or DC? Please share the screenshot of the Acrobat version screen
a. Go to help -> About Acrobat ... and take a screenshot
2. Operating system (Mac/Win)
3. Input PDF Files