How to hide the image of scanned page after OCR

Forum for the PDF-XChange Editor - Free and Licensed Versions

Moderators: TrackerSupp-Daniel, Tracker Support, Paul - Tracker Supp, Vasyl-Tracker Dev Team, Chris - Tracker Supp, Sean - Tracker, Ivan - Tracker Software, Tracker Supp-Stefan

Post Reply
HansS
User
Posts: 6
Joined: Wed Mar 18, 2015 10:19 pm

How to hide the image of scanned page after OCR

Post by HansS »

1. I tested PDF-XChange-Editor v5.5 and it seems to be good for e.g. simple editing task. I also tried to scan pages. Of course I want to correct errors, but found no easy way to do so? How can I temporary hide the picture and temporary set text colour to black?

2. In many cases it should be possible to select at least one area before OCR, not simply do OCR for the whole page and the delete unwanted parts by hand.

3. There is obviously some problem e.g. in German text. The warning about not supported functions in test mode shows "<a id="1">" etc. as text (see attached screen shot), but it should obviously show links to help instead.
User avatar
Patrick-Tracker Supp
Site Admin
Posts: 1645
Joined: Thu Mar 27, 2014 6:14 pm
Location: Vancouver Island
Contact:

Re: How to hide the image of scanned page after OCR

Post by Patrick-Tracker Supp »

Hello HansS,

Thank you for the post.
Of course I want to correct errors, but found no easy way to do so?
Once the document is OCR'd, you can edit the document, though it is a bit arduous. First, you will need to turn the invisible text placed by the OCR into visible text, then remove the underlying picture. You can then Edit the text. Change the text color through the properties toolbar (CTRL+E), and remove the underlying picture through the contents pane (View-->Other Panes-->Content).
2. In many cases it should be possible to select at least one area before OCR, not simply do OCR for the whole page and the delete unwanted parts by hand.
This is not possible at this time.
3. There is obviously some problem e.g. in German text. The warning about not supported functions in test mode shows "<a id="1">" etc. as text
I have notified our German translator, though please note he is a volunteer.
(see attached screen shot)
There is no attachment to your post. If attaching something, you will need to archive it in a zip folder.

I hope this helps!
If posting files to this forum, you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded.
Thank you.

Cheers,

Patrick Charest
Tracker Support North America
HansS
User
Posts: 6
Joined: Wed Mar 18, 2015 10:19 pm

Re: How to hide the image of scanned page after OCR

Post by HansS »

Thanks for the infos.
Once the document is OCR'd, you can edit the document, though it is a bit arduous. First, you will need to turn the invisible text placed by the OCR into visible text, then remove the underlying picture. You can then Edit the text. Change the text color through the properties toolbar (CTRL+E), and remove the underlying picture through the contents pane (View-->Other Panes-->Content).
1. So this surely needs some improvements to be rally usable, especially if you take into account, that in many cases users want to restore the image after the corrections. So at least a simple option to hide/show the picture should be implemented in some upcoming version.

2. I miss a context menu in the content pane, e.g. with the delete function (and hide/show in the future).
There is no attachment to your post.
I tried a *.png (what is normally no problem and could also be displayed inline in other forums) and did not recognize that it was not accepted - next try with this post.
Attachments
MWSnap 2015-03-17 22_36_28.zip
Screen shot of missing links in German text.
(14.43 KiB) Downloaded 83 times
User avatar
Patrick-Tracker Supp
Site Admin
Posts: 1645
Joined: Thu Mar 27, 2014 6:14 pm
Location: Vancouver Island
Contact:

Re: How to hide the image of scanned page after OCR

Post by Patrick-Tracker Supp »

Hello HansS

Thank you for the post. Currently, editing scanned documents is not supported beyond what I mentioned previously. Rest assured that improvements are coming in due for this.
2. I miss a context menu in the content pane, e.g. with the delete function (and hide/show in the future).
This is indeed something that should be added. I have created an internal development ticket and official feature request. Please reference RT#2989 when requesting an update to this issue.

As for the translation issue, we are still waiting to hear back from our German translation specialist 'BigMike'.

I hope that helps!
If posting files to this forum, you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded.
Thank you.

Cheers,

Patrick Charest
Tracker Support North America
User avatar
BigMike
User
Posts: 307
Joined: Wed Nov 07, 2007 10:07 am

Re: How to hide the image of scanned page after OCR

Post by BigMike »

Hi :)

I hope I've fixed all relevant items in the language file. Please try the latest version of the German language files.

@Patrick: Stefan replied to my mail and forwarded the fixed translation files ;)
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 17948
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: How to hide the image of scanned page after OCR

Post by Tracker Supp-Stefan »

Hi Mike,

Indeed I did forward the files to the relevant people!
Thanks once again for your hard work!

Cheers,
Stefan
Markt-a1b
User
Posts: 45
Joined: Sat Sep 07, 2019 7:10 pm

Re: How to hide the image of scanned page after OCR

Post by Markt-a1b »

Patrick-Tracker Supp wrote: Thu Mar 19, 2015 9:10 pm
... Currently, editing scanned documents is not supported beyond what I mentioned previously. Rest assured that improvements are coming in due [time] for this.
As a new user of PDF-eXchange Pro and its Enhanced OCR module, I am attempting to temporarily hide the underlying image while text is edited, and then make it visible again. Has the above-referenced capability since been added to the program? Thanks!
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 17948
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: How to hide the image of scanned page after OCR

Post by Tracker Supp-Stefan »

Hello Markt-a1b,

The Enhanced OCR is able of removing the original image, and replacing it directly with visible text, and you can then fine tune this text using e.g. the "Edit Content" tool.

You can still make a "Searchable image" with the EOCR, and then manually toggle the visibility of the underlying image while editing the (at first invisible) layer of text that will be added on top.

If you want - you can now also move the original image in a 'layer' of it's own - so that you can then turn on/off the visibility of that layer quickly.

Regards,
Stefan
Markt-a1b
User
Posts: 45
Joined: Sat Sep 07, 2019 7:10 pm

Re: How to hide the image of scanned page after OCR

Post by Markt-a1b »

Thank you, Stefan! You wrote, "If you want - you can now also move the original image in a 'layer' of it's own - so that you can then turn on/off the visibility of that layer quickly."

I am able to first create a new layer, and then cut from the original image layer, in order to paste that image into the new one. However, if there is a command to directly "Move selection to a new layer" or similar, I seem not to be finding that.

Tabs Guide > View Tab > Editing Panes > Layers:

https://help.pdf-xchange.com/pdfxe8/layers-pane_ed.html?zoom_highlightsub=hide+layers

Thanks again,

MarkT
User avatar
Tracker Supp-Stefan
Site Admin
Posts: 17948
Joined: Mon Jan 12, 2009 8:07 am
Location: London
Contact:

Re: How to hide the image of scanned page after OCR

Post by Tracker Supp-Stefan »

Hello Mark,

I am afraid that I just found out that we currently can only add annotations to a Layer. Adding base content while supported by the PDF specification is quite more complex, and as such we do not yet support that.
For annotations - you can directly create a layer from e.g. a selected annotation, or in the properties of an annotation you can move it to a layer that you created earlier.

Regards,
Stefan
Post Reply