Page 1 of 1

OCR Duplicates the Images in the Document!

Posted: Thu Aug 17, 2017 8:58 am
by jamiestroud
Whenever I use PDF-Tools (v6) to do OCR on a pdf, any images in that document get duplicated. This seems to make a trivial difference in terms of the size of the pdf, but it makes an annoying difference when it comes to text-to-audio software. It'll read each page that has image-text on it TWICE (it essentially reads the foreground image object, then the background image object after).

At first glance, I didn't even notice that it was duplicating the images, since the duplicated image object is layered exactly above the first. I was able to figure it out in editing though. I can fix the problem by just deleting the foreground image objects. Fortunately, the OCR ability is still retained after deleting the foreground image objects, and it stops reading pages twice afterward. However, it's a lot of extra work.

Why does this happen??? Can it be prevented???

Re: OCR Duplicates the Images in the Document!

Posted: Thu Aug 17, 2017 9:02 am
by Will - Tracker Supp
Hi Jamie,

Thanks for the post - I believe that this may have been a known issue in an older release of Tools 6. Please make sure that you are using Version 6 Build 322.7:
https://www.pdf-xchange.com/downloads

Thanks,

Re: OCR Duplicates the Images in the Document!

Posted: Fri Aug 18, 2017 7:26 am
by jamiestroud
Will - Tracker Supp wrote:Hi Jamie,

Thanks for the post - I believe that this may have been a known issue in an older release of Tools 6. Please make sure that you are using Version 6 Build 322.7:
https://www.pdf-xchange.com/downloads

Thanks,
Ah, kk, got it, thanks!

On an unrelated note, logging in through this link:
https://www.pdf-xchange.com/forum3 ... mode=login
doesn't work for me.

I've only been able to log in through the home page of this site:
https://www.pdf-xchange.com/

I quadruply checked that I was entering my user-name and p-word correctly. I was, I'm sure of it.

Re: OCR Duplicates the Images in the Document!

Posted: Fri Aug 18, 2017 8:20 am
by Will - Tracker Supp
Hi Jamie,

What happens when you try to login via that first link?

How did you login? Was it via this:
Image

Cheers,

Re: OCR Duplicates the Images in the Document!

Posted: Sun Sep 03, 2017 5:11 am
by jamiestroud
Will - Tracker Supp wrote:Hi Jamie,

What happens when you try to login via that first link?

How did you login? Was it via this:
Image

Cheers,
It says that the user-name isn't found.

It's through this link:
login.png

Re: OCR Duplicates the Images in the Document!

Posted: Sun Sep 03, 2017 5:14 am
by jamiestroud
Also, I'm using the newest version of PDF-Tools, and it's duplicating Objects in the PDFs again. This time, it seems that even after I delete all duplicated Objects, a Text-To-Audio Reader will still read Pages twice. This is even more problematic than before.

Re: OCR Duplicates the Images in the Document!

Posted: Tue Sep 05, 2017 10:30 pm
by Patrick-Tracker Supp
Hi Jaimie,

Could you please send us some sample documents with which to test as well as their source files?

Thank you!