Can you directly call many online OCRs, such as Baidu OCR.

Discussion for the End User use of OCR in PDF-XChange Editor and Viewer

Moderators: TrackerSupp-Daniel, Tracker Support, Paul - Tracker Supp, Vasyl-Tracker Dev Team, Chris - Tracker Supp, Sean - Tracker, Ivan - Tracker Software, Tracker Supp-Stefan

Post Reply
softschool
User
Posts: 5
Joined: Thu Jan 19, 2023 11:45 am

Can you directly call many online OCRs, such as Baidu OCR.

Post by softschool »

Can PDF-XChange Editor directly call many online OCRs, such as Baidu OCR, to replace the pre-installed OCR in the software? Because some SHX English and Chinese font PDFs generated by AutoCAD, the recognition effect of using the preset OCR is terrible, and the ABBYY FineReader PDF is also very poor, but the effect of using Baidu OCR is very good, and the accuracy is almost 95%!

PDF Documentation for Experiments
https://drive.google.com/file/d/1525PS8Sth97vWBed4TWhx9ieTUA9dtD-/view?usp=share_link
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 17910
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: Can you directly call many online OCRs, such as Baidu OCR.

Post by Tracker Supp-Stefan »

Hello softschool,

No - I am afraid that because OCR is quite a resource heavy process - it has to be performed on your machine.
If you need to use an online tool - you can use the Editor to export images of the original PDF pages (with a customizable resolution), and then pass those to the online OCR.

Have you tried the different settings options in the Editor's OCR window - are all of them producing bad results?
Do you have a sample file you could share?

Kind regards,
Stefan
softschool
User
Posts: 5
Joined: Thu Jan 19, 2023 11:45 am

Re: Can you directly call many online OCRs, such as Baidu OCR.

Post by softschool »

https://drive.google.com/file/d/1525PS8Sth97vWBed4TWhx9ieTUA9dtD-/view?usp=share_link

You can try this PDF document, neither PDF-XChange nor ABBYY FineReader can recognize it well, I know Baidu OCR can recognize it well, but it is not an application or a web page, it is just a free or paid cloud service!

This is the Chinese introduction page

Baidu OCR general text recognition (high precision version with location)
https://ai.baidu.com/ai-doc/OCR/tk3h7y2aq
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 17910
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: Can you directly call many online OCRs, such as Baidu OCR.

Post by Tracker Supp-Stefan »

Hello softschool,

As you said - it is a cloud tool - so there is a server somewhere that has enough processing power to OCR the content you upload to this service. We do not have plans to include another OCR engine in our products for now, and I am sorry if the ones available are not getting correct recognition of your files!

Kind regards,
Stefan
softschool
User
Posts: 5
Joined: Thu Jan 19, 2023 11:45 am

Re: Can you directly call many online OCRs, such as Baidu OCR.

Post by softschool »

https://drive.google.com/file/d/1525PS8Sth97vWBed4TWhx9ieTUA9dtD-/view?usp=share_link

This is a PDF document generated by AutoCAD, and because it uses SHX fonts, even ABBYY cannot recognize it. Can you contact ABBYY to add recognition of this font to improve the recognition of your OCR engine?

Because there are so many PDF documents in this AutoCAD format, our translation industry often has to face this kind of PDF documents. It is very difficult to translate and typesetting. The most troublesome thing is the work of converting OCR into real text. I hope you can provide The solution for this!
User avatar
Paul - Tracker Supp
Site Admin
Posts: 6897
Joined: Wed Mar 25, 2009 10:37 pm
Location: Chemainus, Canada
Contact:

Re: Can you directly call many online OCRs, such as Baidu OCR.

Post by Paul - Tracker Supp »

Hi softschool,

that is indeed a heavy OCR job! I will pass this on to the team to see what they think we or ABBYY can do.

warm regards
Best regards

Paul O'Rorke
Tracker Support North America
http://www.tracker-software.com
Post Reply