A few questions about OCR

Discussion for the End User use of OCR in PDF-XChange Editor and Viewer

Moderators: TrackerSupp-Daniel, Tracker Support, Paul - Tracker Supp, Vasyl-Tracker Dev Team, Chris - Tracker Supp, Sean - Tracker, Ivan - Tracker Software, Tracker Supp-Stefan

Post Reply
Willy Van Nuffel
User
Posts: 2386
Joined: Wed Jan 18, 2006 12:10 pm

A few questions about OCR

Post by Willy Van Nuffel »

Hello,

I have a few questions about OCR in PDF-XChange Editor:

1) Via "OCR Page(s)..." it is possible to click OK when no language has been selected.
On the contrary, via "Enhance Scanned Pages" it is NOT possible to click OK when only the check box "Recognize text" is active, and there is NO language selected.
Is this "by design" ?

2) Sometimes you can have to OCR a page with first names, last names or words that are not in one of the available dictionaries. There may be a lot of "special characters". Let us say that a large part of the uni-code character-set must be recognized as correct.
Is there a way to tell the OCR-feature that all these characters must be seen as correct, although they are not as words in a dictionary ?

Best regards
User avatar
TrackerSupp-Daniel
Site Admin
Posts: 8544
Joined: Wed Jan 03, 2018 6:52 pm

Re: A few questions about OCR

Post by TrackerSupp-Daniel »

Hi Willy,
While it is intended to function "Without a language" This should essentially default to English. Though it is a good point that the new function does not allow the same handling. Perhaps we should look at changing it.

On to names, in most cases names come out quite well, despite them not being "Words". As an example I had a user who was OCR'ing an old Russian phone book page just the other day, it came out perfectly in that case. So while there is not a way to expand the dictionary, if no word matches what it is interpreting, I believe it does exactly that and defaults to shows what is present.
Dan McIntyre - Support Technician
Tracker Software Products (Canada) LTD

+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com
Post Reply