PDF-XChange - Tracker PDF Viewer - TIFF-XChange - Image-XChange - XMF-XChange - Raster-XChange - Support

Moderators: TrackerSupp-Daniel, Tracker Support, Chris - Tracker Supp, Vasyl-Tracker Dev Team, Paul - Tracker Supp, Ivan - Tracker Software, Sean - Tracker, Tracker Supp-Stefan

 
skycats
User
Topic Author
Posts: 4
Joined: Fri Jan 26, 2018 3:51 am

PDF-XChange Editor OCR Image for Chinese is not working

Fri Jan 26, 2018 4:21 am

PDF-XChange Editor
Version: 7.0.323.2
Download: Zip Installer(32/64 bit) And OCR Chinese Languages Pack

When I use the ORC Image, parse strange characters. like as
1.png


this is the result of my copying
2.png


test image
Test Image.png
 
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 12737
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: PDF-XChange Editor OCR Image for Chinese is not working

Fri Jan 26, 2018 1:34 pm

Hello skycats,

Thanks for your post and the sample file.
While the image quality is not ideal I managed to get this result (for the first three lines of the main text):
本书的写柞缂起几年前找学习 WPFn 固为找是从 Windows Forms 开发转来做 wPF 开发的. 学习过程中
遇到帷多新概念 新特性, 其中包括 Data Binding. 路由鬟伴 命令~ 各种模板等- 我的工作风格足对于每个
新知识. 一定先把它理孵进伽 涮丑白再应用于项目中. 不然总感觉仗用起未不放心1 于是就对照已有的英

I can't quite tell if it's the correct result - but to me it looks almost as if it is.

I used "Medium" quality and "Preserve original content and add text layer on top".

Regards,
Stefan
 
skycats
User
Topic Author
Posts: 4
Joined: Fri Jan 26, 2018 3:51 am

Re: PDF-XChange Editor OCR Image for Chinese is not working

Fri Jan 26, 2018 6:51 pm

Tracker Supp-Stefan wrote:
Hello skycats,

Thanks for your post and the sample file.
While the image quality is not ideal I managed to get this result (for the first three lines of the main text):
本书的写柞缂起几年前找学习 WPFn 固为找是从 Windows Forms 开发转来做 wPF 开发的. 学习过程中
遇到帷多新概念 新特性, 其中包括 Data Binding. 路由鬟伴 命令~ 各种模板等- 我的工作风格足对于每个
新知识. 一定先把它理孵进伽 涮丑白再应用于项目中. 不然总感觉仗用起未不放心1 于是就对照已有的英

I can't quite tell if it's the correct result - but to me it looks almost as if it is.

I used "Medium" quality and "Preserve original content and add text layer on top".

Regards,
Stefan


thank you for your reply.
to see your results, I conducted some tests.
finally, I found that this is a bug

I printed a pdf file using other tools. see
test.pdf
(78.24 KiB) Downloaded 20 times
(above is the text, below is a screenshot)

when I use OCR Page, it works fine.
TIM截图20180127024630.png

我是测试文本0 我是测试文本O 我是测试文本〇
I am test text. I am test text. I am test text.


when I use OCR Image, it does not work.
TIM截图20180127024921.png

332%iflfliitfidio fi%ifilflifiilfio fiEifilflifiY$o
I am test text. I am test text. I am test text.


:D these strange characters, looks like a character encoding error
 
User avatar
TrackerSupp-Daniel
Site Admin
Posts: 999
Joined: Wed Jan 03, 2018 6:52 pm

Re: PDF-XChange Editor OCR Image for Chinese is not working

Fri Jan 26, 2018 7:55 pm

By any chance, are you using english while trying to OCR? If you are you can download alternate OCR languages here:
https://www.tracker-software.com/pdf-xchange-viewer-ocr
I believe this is the most likely reason behind this situation as my test with English as the selected language net similar results to your tests.
xxasaqfiwmfimfiFméiia wm Eli‘J-fiiAk Windows Forms afiifiusk WPF fiiéfi. Ermifi'?
ifif'HtlifTM-fir. fifilk’i. fiq’flie‘; Data Binding. 33-113344'. ¢¢~ fi-‘fi'fii‘iyf- fifilfl'mfixfififfi“?
1154mm. —-2;i4wazmmsm mullaifirflm—flmfifiv. mmafimtmnflkmfiru, fiitfiflfiflfifi‘li
xvi—NH" MSDNifi-‘fifiibfi‘ifiiflfi. fi-fifififi. fixixkfinififiikfirfli. rkfikfiifiéalfifi. Lit
w—+mm.wmmgmfinfifiiizékflmfimfififimhufi.kfififififinfigiim#&m.
fijhi—$f¥3fifi. fifiTskvufriél-Eiukfi. ii$4$trfiz$¢mtfl7Mllfifiliiafigf¥~—«iii/xii
:1: WPF».

Conversely, with the chinese character pack installed
本书的写柞缂起几年前找学习 WPFn 固为找是从 wmdows FOnns开发转来做 wPF 开发的. 学习过程中
遇到帷多新概念 新特性, 其中包括 Data Binding. 路由冪伴 命令~ 各种糢槌等- 我的工作风格足对于每个
新知识. 一定先把它理解透仳 涮白再疸用于唒目屯 不然惡感党仗用起耒不放此 于是訧吋照已有的薑
文弔箝和 MSDN達一研完違些知蚔軋 每-冇所得, 都喜吹韓成榑客炭裊在岡上 一耒供大蒙擘刁奉未 二耒
做一小租懶 防止以后遺羸 博客炭表之后收刲很多煥者的反慵和蚊軌 大窣希塱我能把這些文章蝙撰成蹠
形成一本岸刁杖枕 于是我下泱心幵始葛違本扎 遑木牁的名宇也魷躚了系刊搏客文章的名丰ˍ((深入減
出 WPF».
Significantly fewer errors.
Daniel McIntyre
Support Technician
Tracker Software Products (Canada) LTD

Sales: +1 (250) 324-1621
Fax: +1 (250) 324-1623
 
skycats
User
Topic Author
Posts: 4
Joined: Fri Jan 26, 2018 3:51 am

Re: PDF-XChange Editor OCR Image for Chinese is not working

Sat Jan 27, 2018 5:36 pm

TrackerSupp-Daniel wrote:
By any chance, are you using english while trying to OCR? If you are you can download alternate OCR languages here:
https://www.tracker-software.com/pdf-xchange-viewer-ocr
I believe this is the most likely reason behind this situation as my test with English as the selected language net similar results to your tests.
xxasaqfiwmfimfiFméiia wm Eli‘J-fiiAk Windows Forms afiifiusk WPF fiiéfi. Ermifi'?
ifif'HtlifTM-fir. fifilk’i. fiq’flie‘; Data Binding. 33-113344'. ¢¢~ fi-‘fi'fii‘iyf- fifilfl'mfixfififfi“?
1154mm. —-2;i4wazmmsm mullaifirflm—flmfifiv. mmafimtmnflkmfiru, fiitfiflfiflfifi‘li
xvi—NH" MSDNifi-‘fifiibfi‘ifiiflfi. fi-fifififi. fixixkfinififiikfirfli. rkfikfiifiéalfifi. Lit
w—+mm.wmmgmfinfifiiizékflmfimfififimhufi.kfififififinfigiim#&m.
fijhi—$f¥3fifi. fifiTskvufriél-Eiukfi. ii$4$trfiz$¢mtfl7Mllfifiliiafigf¥~—«iii/xii
:1: WPF».

Conversely, with the chinese character pack installed
本书的写柞缂起几年前找学习 WPFn 固为找是从 wmdows FOnns开发转来做 wPF 开发的. 学习过程中
遇到帷多新概念 新特性, 其中包括 Data Binding. 路由冪伴 命令~ 各种糢槌等- 我的工作风格足对于每个
新知识. 一定先把它理解透仳 涮白再疸用于唒目屯 不然惡感党仗用起耒不放此 于是訧吋照已有的薑
文弔箝和 MSDN達一研完違些知蚔軋 每-冇所得, 都喜吹韓成榑客炭裊在岡上 一耒供大蒙擘刁奉未 二耒
做一小租懶 防止以后遺羸 博客炭表之后收刲很多煥者的反慵和蚊軌 大窣希塱我能把這些文章蝙撰成蹠
形成一本岸刁杖枕 于是我下泱心幵始葛違本扎 遑木牁的名宇也魷躚了系刊搏客文章的名丰ˍ((深入減
出 WPF».
Significantly fewer errors.


please see #3.
I have installed the chinese character pack.
you can download test.pdf for testing.
try my steps.
(click image to see gif)
action1.gif


action2.gif
 
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 12737
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: PDF-XChange Editor OCR Image for Chinese is not working

Mon Jan 29, 2018 1:55 pm

Hi skycats,

Thanks for the report.
I now managed to reproduce the issue and have created a ticket in our internal system:
#4218: Editor 323.2: OCR image does not work the same as Document -> OCR Pages...
To allow our developers to investigate and get this fixed.

In the mean time please use the Document -> OCR pages as a workaround!

Regards,
Stefan
 
User avatar
Sasha - Tracker Dev Team
User
Posts: 3313
Joined: Fri Nov 21, 2014 8:27 am
Contact:

Re: PDF-XChange Editor OCR Image for Chinese is not working

Mon Jan 29, 2018 3:11 pm

This was fixed and will be available from the next release.

Cheers,
Alex
Join us at Google+:
https://plus.google.com/+PDFXChangeEditorTS
Subscribe at:
https://www.youtube.com/channel/UC-TwAMNi1haxJ1FX3LvB4CQ

Who is online

Users browsing this forum: No registered users and 1 guest