PDF-XChange Editor OCR Image for Chinese is not working
Moderators: TrackerSupp-Daniel, Tracker Support, Vasyl-Tracker Dev Team, Sean - Tracker, Paul - Tracker Supp, Chris - Tracker Supp, Tracker Supp-Stefan, Ivan - Tracker Software
PDF-XChange Editor OCR Image for Chinese is not working
PDF-XChange Editor
Version: 7.0.323.2
Download: Zip Installer(32/64 bit) And OCR Chinese Languages Pack
When I use the ORC Image, parse strange characters. like as this is the result of my copying test image
Version: 7.0.323.2
Download: Zip Installer(32/64 bit) And OCR Chinese Languages Pack
When I use the ORC Image, parse strange characters. like as this is the result of my copying test image
- Tracker Supp-Stefan
- Site Admin
- Posts: 13119
- Joined: Mon Jan 12, 2009 8:07 am
- Location: London
- Contact:
Re: PDF-XChange Editor OCR Image for Chinese is not working
Hello skycats,
Thanks for your post and the sample file.
While the image quality is not ideal I managed to get this result (for the first three lines of the main text):
本书的写柞缂起几年前找学习 WPFn 固为找是从 Windows Forms 开发转来做 wPF 开发的. 学习过程中
遇到帷多新概念 新特性, 其中包括 Data Binding. 路由鬟伴 命令~ 各种模板等- 我的工作风格足对于每个
新知识. 一定先把它理孵进伽 涮丑白再应用于项目中. 不然总感觉仗用起未不放心1 于是就对照已有的英
I can't quite tell if it's the correct result - but to me it looks almost as if it is.
I used "Medium" quality and "Preserve original content and add text layer on top".
Regards,
Stefan
Thanks for your post and the sample file.
While the image quality is not ideal I managed to get this result (for the first three lines of the main text):
本书的写柞缂起几年前找学习 WPFn 固为找是从 Windows Forms 开发转来做 wPF 开发的. 学习过程中
遇到帷多新概念 新特性, 其中包括 Data Binding. 路由鬟伴 命令~ 各种模板等- 我的工作风格足对于每个
新知识. 一定先把它理孵进伽 涮丑白再应用于项目中. 不然总感觉仗用起未不放心1 于是就对照已有的英
I can't quite tell if it's the correct result - but to me it looks almost as if it is.
I used "Medium" quality and "Preserve original content and add text layer on top".
Regards,
Stefan
Re: PDF-XChange Editor OCR Image for Chinese is not working
thank you for your reply.Tracker Supp-Stefan wrote:Hello skycats,
Thanks for your post and the sample file.
While the image quality is not ideal I managed to get this result (for the first three lines of the main text):
本书的写柞缂起几年前找学习 WPFn 固为找是从 Windows Forms 开发转来做 wPF 开发的. 学习过程中
遇到帷多新概念 新特性, 其中包括 Data Binding. 路由鬟伴 命令~ 各种模板等- 我的工作风格足对于每个
新知识. 一定先把它理孵进伽 涮丑白再应用于项目中. 不然总感觉仗用起未不放心1 于是就对照已有的英
I can't quite tell if it's the correct result - but to me it looks almost as if it is.
I used "Medium" quality and "Preserve original content and add text layer on top".
Regards,
Stefan
to see your results, I conducted some tests.
finally, I found that this is a bug
I printed a pdf file using other tools. see (above is the text, below is a screenshot)
when I use OCR Page, it works fine.
Code: Select all
我是测试文本0 我是测试文本O 我是测试文本〇
I am test text. I am test text. I am test text.
Code: Select all
332%iflfliitfidio fi%ifilflifiilfio fiEifilflifiY$o
I am test text. I am test text. I am test text.

- TrackerSupp-Daniel
- Site Admin
- Posts: 1770
- Joined: Wed Jan 03, 2018 6:52 pm
Re: PDF-XChange Editor OCR Image for Chinese is not working
By any chance, are you using english while trying to OCR? If you are you can download alternate OCR languages here:
https://www.tracker-software.com/pdf-xchange-viewer-ocr
I believe this is the most likely reason behind this situation as my test with English as the selected language net similar results to your tests.
xxasaqfiwmfimfiFméiia wm Eli‘J-fiiAk Windows Forms afiifiusk WPF fiiéfi. Ermifi'?
ifif'HtlifTM-fir. fifilk’i. fiq’flie‘; Data Binding. 33-113344'. ¢¢~ fi-‘fi'fii‘iyf- fifilfl'mfixfififfi“?
1154mm. —-2;i4wazmmsm mullaifirflm—flmfifiv. mmafimtmnflkmfiru, fiitfiflfiflfifi‘li
xvi—NH" MSDNifi-‘fifiibfi‘ifiiflfi. fi-fifififi. fixixkfinififiikfirfli. rkfikfiifiéalfifi. Lit
w—+mm.wmmgmfinfifiiizékflmfimfififimhufi.kfififififinfigiim#&m.
fijhi—$f¥3fifi. fifiTskvufriél-Eiukfi. ii$4$trfiz$¢mtfl7Mllfifiliiafigf¥~—«iii/xii
:1: WPF».
Conversely, with the chinese character pack installed
本书的写柞缂起几年前找学习 WPFn 固为找是从 wmdows FOnns开发转来做 wPF 开发的. 学习过程中
遇到帷多新概念 新特性, 其中包括 Data Binding. 路由冪伴 命令~ 各种糢槌等- 我的工作风格足对于每个
新知识. 一定先把它理解透仳 涮白再疸用于唒目屯 不然惡感党仗用起耒不放此 于是訧吋照已有的薑
文弔箝和 MSDN達一研完違些知蚔軋 每-冇所得, 都喜吹韓成榑客炭裊在岡上 一耒供大蒙擘刁奉未 二耒
做一小租懶 防止以后遺羸 博客炭表之后收刲很多煥者的反慵和蚊軌 大窣希塱我能把這些文章蝙撰成蹠
形成一本岸刁杖枕 于是我下泱心幵始葛違本扎 遑木牁的名宇也魷躚了系刊搏客文章的名丰ˍ((深入減
出 WPF».
Significantly fewer errors.
https://www.tracker-software.com/pdf-xchange-viewer-ocr
I believe this is the most likely reason behind this situation as my test with English as the selected language net similar results to your tests.
xxasaqfiwmfimfiFméiia wm Eli‘J-fiiAk Windows Forms afiifiusk WPF fiiéfi. Ermifi'?
ifif'HtlifTM-fir. fifilk’i. fiq’flie‘; Data Binding. 33-113344'. ¢¢~ fi-‘fi'fii‘iyf- fifilfl'mfixfififfi“?
1154mm. —-2;i4wazmmsm mullaifirflm—flmfifiv. mmafimtmnflkmfiru, fiitfiflfiflfifi‘li
xvi—NH" MSDNifi-‘fifiibfi‘ifiiflfi. fi-fifififi. fixixkfinififiikfirfli. rkfikfiifiéalfifi. Lit
w—+mm.wmmgmfinfifiiizékflmfimfififimhufi.kfififififinfigiim#&m.
fijhi—$f¥3fifi. fifiTskvufriél-Eiukfi. ii$4$trfiz$¢mtfl7Mllfifiliiafigf¥~—«iii/xii
:1: WPF».
Conversely, with the chinese character pack installed
本书的写柞缂起几年前找学习 WPFn 固为找是从 wmdows FOnns开发转来做 wPF 开发的. 学习过程中
遇到帷多新概念 新特性, 其中包括 Data Binding. 路由冪伴 命令~ 各种糢槌等- 我的工作风格足对于每个
新知识. 一定先把它理解透仳 涮白再疸用于唒目屯 不然惡感党仗用起耒不放此 于是訧吋照已有的薑
文弔箝和 MSDN達一研完違些知蚔軋 每-冇所得, 都喜吹韓成榑客炭裊在岡上 一耒供大蒙擘刁奉未 二耒
做一小租懶 防止以后遺羸 博客炭表之后收刲很多煥者的反慵和蚊軌 大窣希塱我能把這些文章蝙撰成蹠
形成一本岸刁杖枕 于是我下泱心幵始葛違本扎 遑木牁的名宇也魷躚了系刊搏客文章的名丰ˍ((深入減
出 WPF».
Significantly fewer errors.
Daniel McIntyre
Support Technician
Tracker Software Products (Canada) LTD
Sales: +1 (250) 324-1621
Fax: +1 (250) 324-1623
Support Technician
Tracker Software Products (Canada) LTD
Sales: +1 (250) 324-1621
Fax: +1 (250) 324-1623
Re: PDF-XChange Editor OCR Image for Chinese is not working
please see #3.TrackerSupp-Daniel wrote:By any chance, are you using english while trying to OCR? If you are you can download alternate OCR languages here:
https://www.tracker-software.com/pdf-xchange-viewer-ocr
I believe this is the most likely reason behind this situation as my test with English as the selected language net similar results to your tests.
xxasaqfiwmfimfiFméiia wm Eli‘J-fiiAk Windows Forms afiifiusk WPF fiiéfi. Ermifi'?
ifif'HtlifTM-fir. fifilk’i. fiq’flie‘; Data Binding. 33-113344'. ¢¢~ fi-‘fi'fii‘iyf- fifilfl'mfixfififfi“?
1154mm. —-2;i4wazmmsm mullaifirflm—flmfifiv. mmafimtmnflkmfiru, fiitfiflfiflfifi‘li
xvi—NH" MSDNifi-‘fifiibfi‘ifiiflfi. fi-fifififi. fixixkfinififiikfirfli. rkfikfiifiéalfifi. Lit
w—+mm.wmmgmfinfifiiizékflmfimfififimhufi.kfififififinfigiim#&m.
fijhi—$f¥3fifi. fifiTskvufriél-Eiukfi. ii$4$trfiz$¢mtfl7Mllfifiliiafigf¥~—«iii/xii
:1: WPF».
Conversely, with the chinese character pack installed
本书的写柞缂起几年前找学习 WPFn 固为找是从 wmdows FOnns开发转来做 wPF 开发的. 学习过程中
遇到帷多新概念 新特性, 其中包括 Data Binding. 路由冪伴 命令~ 各种糢槌等- 我的工作风格足对于每个
新知识. 一定先把它理解透仳 涮白再疸用于唒目屯 不然惡感党仗用起耒不放此 于是訧吋照已有的薑
文弔箝和 MSDN達一研完違些知蚔軋 每-冇所得, 都喜吹韓成榑客炭裊在岡上 一耒供大蒙擘刁奉未 二耒
做一小租懶 防止以后遺羸 博客炭表之后收刲很多煥者的反慵和蚊軌 大窣希塱我能把這些文章蝙撰成蹠
形成一本岸刁杖枕 于是我下泱心幵始葛違本扎 遑木牁的名宇也魷躚了系刊搏客文章的名丰ˍ((深入減
出 WPF».
Significantly fewer errors.
I have installed the chinese character pack.
you can download test.pdf for testing.
try my steps.
(click image to see gif)
- Tracker Supp-Stefan
- Site Admin
- Posts: 13119
- Joined: Mon Jan 12, 2009 8:07 am
- Location: London
- Contact:
Re: PDF-XChange Editor OCR Image for Chinese is not working
Hi skycats,
Thanks for the report.
I now managed to reproduce the issue and have created a ticket in our internal system:
#4218: Editor 323.2: OCR image does not work the same as Document -> OCR Pages...
To allow our developers to investigate and get this fixed.
In the mean time please use the Document -> OCR pages as a workaround!
Regards,
Stefan
Thanks for the report.
I now managed to reproduce the issue and have created a ticket in our internal system:
#4218: Editor 323.2: OCR image does not work the same as Document -> OCR Pages...
To allow our developers to investigate and get this fixed.
In the mean time please use the Document -> OCR pages as a workaround!
Regards,
Stefan
- Sasha - Tracker Dev Team
- User
- Posts: 3827
- Joined: Fri Nov 21, 2014 8:27 am
- Contact:
Re: PDF-XChange Editor OCR Image for Chinese is not working
This was fixed and will be available from the next release.
Cheers,
Alex
Cheers,
Alex
Join us at Google+:
https://plus.google.com/+PDFXChangeEditorTS
Subscribe at:
https://www.youtube.com/channel/UC-TwAMNi1haxJ1FX3LvB4CQ
https://plus.google.com/+PDFXChangeEditorTS
Subscribe at:
https://www.youtube.com/channel/UC-TwAMNi1haxJ1FX3LvB4CQ