Searching words in Searchable PDF's using non-English languages

Forum for the PDF-XChange Editor - Free and Licensed Versions

Moderators: TrackerSupp-Daniel, Tracker Support, Vasyl-Tracker Dev Team, Sean - Tracker, Paul - Tracker Supp, Chris - Tracker Supp, Tracker Supp-Stefan, Ivan - Tracker Software

Post Reply
elhanan
User
Posts: 4
Joined: Thu Oct 10, 2019 8:12 am

Searching words in Searchable PDF's using non-English languages

Post by elhanan » Sat Oct 12, 2019 6:51 am

Hello there I'm trying pdf Xchange editor, I downloaded OCR Language from the given link in this website. The problem is I already installed the OCR language but pdf Xchange editor is not recognizing the pdf's language. When I select some words and paste it in other places like for example, in "find" input field , it shows symbols not the actual selected word. I attached an Image. Please help
Attachments
xchange.png
(5.16 KiB) Not downloaded yet

User avatar
Will - Tracker Supp
Site Admin
Posts: 6819
Joined: Mon Oct 15, 2012 9:21 pm
Location: London, UK
Contact:

Re: Searching words in Searchable PDF's using non-English languages

Post by Will - Tracker Supp » Sun Oct 13, 2019 9:04 pm

Hi elhanan,

Thanks for the email - This is usually due to a poor quality image in the document that is being OCR'd. Can you please send a sample document?

Thanks,
If posting files to this forum, you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded.
Thank you.

Best regards

Will Travaglini
Tracker Support (Europe)
Tracker Software Products Ltd.
http://www.tracker-software.com

elhanan
User
Posts: 4
Joined: Thu Oct 10, 2019 8:12 am

Re: Searching words in Searchable PDF's using non-English languages

Post by elhanan » Mon Oct 14, 2019 7:48 am

Actually I scanned the paper with "searchable pdf" option and "300" resolution. I don't think the problem is related to image quality, first of all I scanned to Searchible PDF format, next English words work fine, only my language doesn't work. I think I've missed some configuration in xchange-editor settings since am new to the software.

I've downloaded "Afrikaans" OCR language set from setting, but still doesn't work.
Attachments
xchange.zip
(2.67 KiB) Downloaded 9 times

User avatar
Radi - Tracker Supp
Site Admin
Posts: 401
Joined: Tue Mar 03, 2015 12:46 pm

Re: Searching words in Searchable PDF's using non-English languages

Post by Radi - Tracker Supp » Mon Oct 14, 2019 11:49 am

Hello elhanan,

It seems like you attached the wrong file. Could you please provide a sample PDF file in which we can see the issue?

Also, please do send us a screenshot of your OCR settings.

If the file is confidential, please do not post it on the forum. Instead, send it to us via e-mail at support@tracker-software.com with a link to this forum topic.

Regards,
Radi

elhanan
User
Posts: 4
Joined: Thu Oct 10, 2019 8:12 am

Re: Searching words in Searchable PDF's using non-English languages

Post by elhanan » Mon Oct 14, 2019 2:23 pm

Ok Here is the attached pic of my OCR Setting, since am using Amharic Language I have downloaded Afrikaans OCR language pack from the downloads. And I send the PDF through the email you provided since I didn't want to show it publicly.

Thank you so much for your help, and if the issue fixed I will use the software since it is the only software that seems to support my language.
Attachments
OCR setting.zip
(736.1 KiB) Downloaded 17 times

User avatar
Tracker Supp-Stefan
Site Admin
Posts: 13663
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: Searching words in Searchable PDF's using non-English languages

Post by Tracker Supp-Stefan » Mon Oct 14, 2019 2:43 pm

Hello elhanan,

Thanks for the e-mail and the settings here.
The problem is that Afrikaans is an Indo-European language (with 90-ish percent Dutch origin) written with the Latin Alphabet, while Amharic appears to be a semitic language - and the alphabets used are quite different, so it is not unexpected that trying to use the Afrikaans language pack does not produce the desired results in your document, and I am afraid that we do not seem to have your language available in our OCR tool!

Regards,
Stefan

elhanan
User
Posts: 4
Joined: Thu Oct 10, 2019 8:12 am

Re: Searching words in Searchable PDF's using non-English languages

Post by elhanan » Tue Oct 15, 2019 6:53 am

:( :( Ok then but why did you added the word Amharic in Afrikaans OCR language pack?? Anyways thanks

User avatar
Tracker Supp-Stefan
Site Admin
Posts: 13663
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: Searching words in Searchable PDF's using non-English languages

Post by Tracker Supp-Stefan » Tue Oct 15, 2019 8:38 am

Hello Elhanan,

Apologies - but I am not sure I fully understand where that is listed! Can you please e.g. share a screenshot where Amharic is listed in the Afrikaans language pack?

Thanks,
Stefan

Post Reply