Recognition quality compared to Abbyy Finereader

Discussion for the End User use of OCR in PDF-XChange Editor and Viewer

Moderators: TrackerSupp-Daniel, Tracker Support, Paul - Tracker Supp, Vasyl-Tracker Dev Team, Chris - Tracker Supp, Sean - Tracker, Ivan - Tracker Software, Tracker Supp-Stefan

Post Reply
User avatar
Jensen Head
User
Posts: 412
Joined: Mon Sep 13, 2021 8:12 am

Recognition quality compared to Abbyy Finereader

Post by Jensen Head »

I really like PDF-XChange Editor as a PDF editor, a set of utilities and an OCR tool. Moreover, Abbyy does not have a tool to batch add an invisible text layer to PDF documents (only batch recognition with full merging of all layers of source documents). However, the recognition quality of the engine used by Tracker Software is noticeably worse than the one currently used by Abbyy.

https://drive.google.com/drive/folders/1CjVs87-ppL9gbG-OUD9yNyLLJZFMDM1T

Do you plan to improve the recognition algorithms used in your products in the near future?
User avatar
Paul - Tracker Supp
Site Admin
Posts: 6835
Joined: Wed Mar 25, 2009 10:37 pm
Location: Chemainus, Canada
Contact:

Re: Recognition quality compared to Abbyy Finereader

Post by Paul - Tracker Supp »

Hi Jensen Head,

I will have one of the guys who read Cyrillic look at these. I cannot see much difference between them.

As I am sure you know, we use Abbyy libraries, and Abbyy do not allow third parties access to their latest and greatest, we are always a bit behind.

We are keen to improve where we can. Do you want to illustrate where you would like to see the improvement?

please and thanks
Best regards

Paul O'Rorke
Tracker Support North America
http://www.tracker-software.com
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 17824
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: Recognition quality compared to Abbyy Finereader

Post by Tracker Supp-Stefan »

Hello Jensen Head,

I did take a look at your samples - and indeed the file you provide shows some incorrect recognition, however I did get a perfect result using our Enhanced OCR (ABBYY based):
image.png
So can you please make sure that you were using the Enhanced OCR, and share the settings you tried in there?
I got the above result with these settings:
Stefan
image1.png
Kind regards,
User avatar
Jensen Head
User
Posts: 412
Joined: Mon Sep 13, 2021 8:12 am

Re: Recognition quality compared to Abbyy Finereader

Post by Jensen Head »

I re-converted the scanned page to PDF and OCR with the following settings:
_
2021-11-08_16-38-07.png
_
The text copied from the resulting document differs from the document obtained in Abbyy Finereader by only a few spaces (extra spaces at the end of lines were in the abbyy document). I am at a loss to guess what was the reason for the low quality last time. I may have chosen the wrong set of languages.Or, as you suggested, the enhanced OCR mode has been disabled. Be that as it may, I am grateful for your help. The question is closed.
Last edited by Jensen Head on Tue Nov 09, 2021 8:28 am, edited 1 time in total.
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 17824
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: Recognition quality compared to Abbyy Finereader

Post by Tracker Supp-Stefan »

Hello Jensen Head,

Glad to hear that you now managed to get almost identical results!
We would also consider this closed, but if you have any other questions - you can always start a new topic!

Kind regards,
Stefan
User avatar
Jensen Head
User
Posts: 412
Joined: Mon Sep 13, 2021 8:12 am

Re: Recognition quality compared to Abbyy Finereader

Post by Jensen Head »

ABBYY states on its "ABBYY FineReader Engine. The most comprehensive OCR SDK for software developers. Integrate AI-powered OCR features into your applications" page that the latest OCR engine version available for third-party applications is ABBYY FineReader Engine 12. I assume that in the latest versions of PDF-XChange in the "Enhanced" mode, FineReader Engine 12 is used. Which version of FineReader Engine used in FineReader PDF 16, and what are their significant differences for the user (if any)?
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 17824
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: Recognition quality compared to Abbyy Finereader

Post by Tracker Supp-Stefan »

Hello Jensen Head,

We use the FineReader version our license agreement with ABBYY allows.
I am not aware of what version they use in their own products - but it is slightly newer than the one we have access to.
Given that the Enhanced OCR is embedded in our own software - the differences will come down to recognition rate (as we do create our own UI - and e.g. Fine Reader 15 and 16 might have UI differences that are not relevant for the comparison with our EOCR). There would likely be improvements in some languages - but this is usually the less frequently used ones, and European languages are usually quite good already.

Kind regards,
Stefan
Post Reply