Page 1 of 1

Searching words in Searchable PDF's using non-English languages

Posted: Sat Oct 12, 2019 6:51 am
by elhanan
Hello there I'm trying pdf Xchange editor, I downloaded OCR Language from the given link in this website. The problem is I already installed the OCR language but pdf Xchange editor is not recognizing the pdf's language. When I select some words and paste it in other places like for example, in "find" input field , it shows symbols not the actual selected word. I attached an Image. Please help

Re: Searching words in Searchable PDF's using non-English languages

Posted: Sun Oct 13, 2019 9:04 pm
by Will - Tracker Supp
Hi elhanan,

Thanks for the email - This is usually due to a poor quality image in the document that is being OCR'd. Can you please send a sample document?

Thanks,

Re: Searching words in Searchable PDF's using non-English languages

Posted: Mon Oct 14, 2019 7:48 am
by elhanan
Actually I scanned the paper with "searchable pdf" option and "300" resolution. I don't think the problem is related to image quality, first of all I scanned to Searchible PDF format, next English words work fine, only my language doesn't work. I think I've missed some configuration in xchange-editor settings since am new to the software.

I've downloaded "Afrikaans" OCR language set from setting, but still doesn't work.

Re: Searching words in Searchable PDF's using non-English languages

Posted: Mon Oct 14, 2019 11:49 am
by Radi - Tracker Supp
Hello elhanan,

It seems like you attached the wrong file. Could you please provide a sample PDF file in which we can see the issue?

Also, please do send us a screenshot of your OCR settings.

If the file is confidential, please do not post it on the forum. Instead, send it to us via e-mail at support@pdf-xchange.com with a link to this forum topic.

Regards,
Radi

Re: Searching words in Searchable PDF's using non-English languages

Posted: Mon Oct 14, 2019 2:23 pm
by elhanan
Ok Here is the attached pic of my OCR Setting, since am using Amharic Language I have downloaded Afrikaans OCR language pack from the downloads. And I send the PDF through the email you provided since I didn't want to show it publicly.

Thank you so much for your help, and if the issue fixed I will use the software since it is the only software that seems to support my language.

Re: Searching words in Searchable PDF's using non-English languages

Posted: Mon Oct 14, 2019 2:43 pm
by Tracker Supp-Stefan
Hello elhanan,

Thanks for the e-mail and the settings here.
The problem is that Afrikaans is an Indo-European language (with 90-ish percent Dutch origin) written with the Latin Alphabet, while Amharic appears to be a semitic language - and the alphabets used are quite different, so it is not unexpected that trying to use the Afrikaans language pack does not produce the desired results in your document, and I am afraid that we do not seem to have your language available in our OCR tool!

Regards,
Stefan

Re: Searching words in Searchable PDF's using non-English languages

Posted: Tue Oct 15, 2019 6:53 am
by elhanan
:( :( Ok then but why did you added the word Amharic in Afrikaans OCR language pack?? Anyways thanks

Re: Searching words in Searchable PDF's using non-English languages

Posted: Tue Oct 15, 2019 8:38 am
by Tracker Supp-Stefan
Hello Elhanan,

Apologies - but I am not sure I fully understand where that is listed! Can you please e.g. share a screenshot where Amharic is listed in the Afrikaans language pack?

Thanks,
Stefan