Training launguage

Forum for the PDF-XChange Editor - Free and Licensed Versions

Moderators: TrackerSupp-Daniel, Tracker Support, Paul - Tracker Supp, Vasyl-Tracker Dev Team, Chris - Tracker Supp, Sean - Tracker, Ivan - Tracker Software, Tracker Supp-Stefan

Post Reply
homedweller
User
Posts: 1
Joined: Tue Apr 17, 2018 11:26 am

Training launguage

Post by homedweller »

Hello,
Now I'm using the newest version of PDF ExChange Editor plus. I was very happy to see that there are a lot of new languages by OCR. I often use and need the Latin language, but I was disappointed when I scanned a Latin page. After using OCR I copied one line after another, but the results were not satisfying. Is it possible to train that language to get better results?


Cheers,
homedweller
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 17824
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: Training launguage

Post by Tracker Supp-Stefan »

Hello homedweller,

Welcome to our forums, and thanks for the post!
I am afraid that no - it is not possible to train the OCR engine :(

Regards,
Stefan
User avatar
Ovg
User
Posts: 461
Joined: Tue Sep 05, 2017 4:56 pm

Re: Training launguage

Post by Ovg »

2homedweller

Try new Convert->Enhance Scanned Pages
Capture.PNG
Quality of OCR MUCH better.
It's impossible to lead us astray for we don't care even to choose the way.
PDF-XChange PRO, 10.1.1 (Build 381) / W7 SP1 x64
User avatar
TrackerSupp-Daniel
Site Admin
Posts: 8439
Joined: Wed Jan 03, 2018 6:52 pm

Re: Training launguage

Post by TrackerSupp-Daniel »

Hi everyone,
I think the main issue here is that Latin is not a Default OCR language, though it should work nicely with English selected, possibly try adding some of the other language to the mix, it may improve those results a bit.
Let us know how the new enhanced scan goes with a few more languages selected!
Dan McIntyre - Support Technician
Tracker Software Products (Canada) LTD

+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com
Willy Van Nuffel
User
Posts: 2347
Joined: Wed Jan 18, 2006 12:10 pm

Re: Training language

Post by Willy Van Nuffel »

EDIT

Questions moved to a new topic:
viewtopic.php?f=63&t=30862
Last edited by Willy Van Nuffel on Fri Apr 27, 2018 7:00 pm, edited 1 time in total.
User avatar
Ovg
User
Posts: 461
Joined: Tue Sep 05, 2017 4:56 pm

Re: Training launguage

Post by Ovg »

I made quick test with OCR Latin text - it seems not so bad

pdf file was created from png image and now contains text layer.
Attachments
Capture.PNG
Test.PNG
Latin_Test.pdf
(1.35 MiB) Downloaded 80 times
It's impossible to lead us astray for we don't care even to choose the way.
PDF-XChange PRO, 10.1.1 (Build 381) / W7 SP1 x64
User avatar
TrackerSupp-Daniel
Site Admin
Posts: 8439
Joined: Wed Jan 03, 2018 6:52 pm

Re: Training launguage

Post by TrackerSupp-Daniel »

After downloading the Latin Language package, I've found the same results as Ovg here.
Can you confirm that you have the Latin OCR language pack installed?0
https://www.pdf-xchange.com/pdf-xchange-viewer-ocr
Thank you!
Dan McIntyre - Support Technician
Tracker Software Products (Canada) LTD

+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com
Post Reply