PDF-XChange - Tracker PDF Viewer - TIFF-XChange - Image-XChange - XMF-XChange - Raster-XChange - Support

Moderators: Tracker Support, TrackerSupp-Daniel, Paul - Tracker Supp, Chris - Tracker Supp, Vasyl-Tracker Dev Team, Ivan - Tracker Software, Sean - Tracker, Tracker Supp-Stefan

 
plonz
User
Topic Author
Posts: 1
Joined: Sun Oct 26, 2014 8:06 pm

Correcting OCR errors?

Sun Oct 26, 2014 8:23 pm

Hi,

is it possible to correct ocr errors in a pdf document for a single word that was not recognized correctly without starting the ocr function for the whole pdf?

I'm using pdf x change viewer 2.5

Thanks
 
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 12025
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: Correcting OCR errors?

Tue Oct 28, 2014 5:29 pm

Hello plonz,

Not in the Viewer, but if you download the PDF XChange Editor:
http://www.tracker-software.com/product ... nge-editor
You will be able to do such corrections as the OCR result is a layer of invisible text placed on top of the original image - and the Editor can modify this text (it will become visible when you edit it but you can then make it invisible once again).

Regards,
Stefan
 
pdfcoder
User
Posts: 47
Joined: Tue Aug 27, 2013 6:10 am

Re: Correcting OCR errors?

Tue Apr 28, 2015 4:28 pm

Did try this feature...

But:
If editing text for example by copy+paste to an external text editor, modifying and copying back, searching key words does not correctly highlight search hit in pdf display.

How to display text layer direct in PDF Editor?

How to modify ocr scan errors in text layer with keeping relation of pdf display layer and pdf text layer fpr searching purposes?

Thx...
 
User avatar
Patrick-Tracker Supp
Site Admin
Posts: 1546
Joined: Thu Mar 27, 2014 6:14 pm
Location: Vancouver Island
Contact:

Re: Correcting OCR errors?

Tue Apr 28, 2015 4:49 pm

Hello pdfcoder,

Thanks for the post. If you wish to edit a scanned PDF, you will absolutely need to use the Editor. See the steps below to get this working properly. Please note that this will only work well on pages without images not recognized by the OCR.

The way OCR works, is to place an invisible text layer on top of the existing image layer. Because this document is scanned, and actually has no text, you will need to OCR the document, then remove the underlying image. First, please go to Documents--> OCR pages

Choose All pages (or your preferred preference) When you are satisfied, click OK.

Once the document is OCR'd, you can edit the document, though it is a bit arduous. First, you will need to turn the invisible text placed by the OCR into visible text, then remove the underlying picture. You will then be able to Edit the text.

Change the text color by selecting the parent in the content pane (View-->Other Panes-->Content),
Image

then changing the values through the properties pane (View-->Other panes-->Properties pane)

Image

and remove the underlying picture through the contents pane (View-->Other Panes-->Content). Select the Image, then use the Delete key to remove it:

Image

Now you are left with a document that contains only text objects.

I hope this helps!
If posting files to this forum, you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded.
Thank you.

Cheers,

Patrick Charest
Tracker Support North America
 
pdfcoder
User
Posts: 47
Joined: Tue Aug 27, 2013 6:10 am

Re: Correcting OCR errors?

Wed Apr 29, 2015 7:13 am

Hello Patrick,
thanks for hint, I did find it.

BUT:
In my opinion it would be much more helpful if editing of text would be possible in content column left side ("Inhalt", red marked).

If editing in right hand viewer window, text changes are breaking layout because of different format setting. It's really stressful...
Attachments
edit-content.zip
screenshot
(16.53 KiB) Downloaded 56 times
 
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 12025
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: Correcting OCR errors?

Wed Apr 29, 2015 11:07 am

Hello pdfcoder,

Thanks for the suggestion. We will have it in mind for future improvements.

Regards,
Stefan

Who is online

Users browsing this forum: No registered users and 1 guest