I opened KiraTemperament.pdf → Convert → OCR Page(s) → I applied settings from 3.2 item → OK → in file with OCR layer I tried to find “темперамента” word.
Thank you for the report, You are quite correct in point 2 that the issue is related, for that matter it is the exact same issue as you described, and as the others described in the "duplicate post" that another user there mentioned.
Our "Default" OCR engine is indeed the tesseract engine, and unfortunately, if the engine itself is having these sorts of issues, there is nothing we can do from our end until it is resolved over there first.
Kind regards,
Dan McIntyre - Support Technician
Tracker Software Products (Canada) LTD
+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com
That's correct and the current release still uses the Tesseract engine for the free, default OCR. The Enhanced OCR licenses Lead Tools' technology. If you have a valid license with active maintenance, for the Editor, then you can try the Enhanced OCR for 30 days. You can switch under File --> Preferences --> OCR. If you don't have the Enhanced OCR as an option and you have a valid license, you would need to re-install Version 8 and make sure that the option to install the Enhanced OCR plugin is selected.
Thanks,
If posting files to this forum, you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded.
Thank you.
Best regards
Will Travaglini
Tracker Support (Europe)
Tracker Software Products Ltd.
http://www.tracker-software.com
Thanks for that - While there is definitely and absolutely a bug in the Tesseract engine itself, a change in the results does strongly suggest an issue in our software (unless the Tesseract libraries have changed and I'm not aware). Is there any difference at all in your OCR settings between Version 7 Build 328.1 and Version 8 Build 332?
Also, have you tried using Medium accuracy? If not, please do and see if that helps.
Thanks,
If posting files to this forum, you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded.
Thank you.
Best regards
Will Travaglini
Tracker Support (Europe)
Tracker Software Products Ltd.
http://www.tracker-software.com
2 examples; compare KiraFullOCRVersion7.pdf and KiraFullOCRVersion8.pdf for more.
3.1. “темперамента”
7:
8:
3.2. “Гремиславы”
7:
8:
4. Reasons of using PDF-XChange Editor
See my detailed Software Recommendations answer. I recommended PDF-XChange Editor precisely because I didn't had big problems with selected areas. But I can't see reasons, why users should prefer version 8, if it has the same problems as another Tesseract-based tools.
I am unsure what you would like me to say here. Much like our software has occasional updates, so too does the tesseract engine, it is entirely possible that the V7 uses an older version of the engine from before this issue was introduced. We will look into this and see if there is anything to be done from our end, but it does currently still look very likely that the issue is on the Engine side, not from our software.
I cannot say that using the tesseract engine in our software would offer anything superior to using the exact same version of the engine in any other application, I can however say that the other features our application, which can be used on the document for editing both before and after running OCR are obvious benefits. Otherwise, if you are looking for advanced functions and majorly noticeable benefits in the OCR alone, you would need to look into our new EOCR plugin, which uses LeadTools OCR engine instead of tesseract.
We will endeavour to have this issue resolved as soon as possible, but we are likely going to be stuck waiting for a new version of the tesseract engine to be made available.
Kind regards,
Dan McIntyre - Support Technician
Tracker Software Products (Canada) LTD
+++++++++++++++++++++++++++++++++++
Our Web site domain and email address has changed as of 26/10/2023.
https://www.pdf-xchange.com
Support@pdf-xchange.com