Hi,
I have a PDF document with an OCR text layer and a webpage with a PDFXchange ActiveX component on it.
When I try to copy and paste the following line "Op donderdag 7 augustus 2014 is er uiteindelijk door mij, verbalisant, een aangifte " from the document, it displays the text as:
"Op donderdag 7 augustus 2014 i s er u i t e i n d e l i j k door mij, v e r b a l i s a n t , een a a n g i f t e ". There is some weird character spacing.
I am using version 2.5.207.0 of the PDF-X OCR SDK, I have also tried it with version 2.5.309.0 (same result)
If I copy and paste the same text from the PDFXchange Editor (version 5.5.309), it is pasted correctly without the extra spacing.
Can you tell me why this is happening and can you fix it?
Difference in text copy-paste between ActiveX and Editor
Moderators: TrackerSupp-Daniel, Tracker Support, Vasyl-Tracker Dev Team, Chris - Tracker Supp, Sean - Tracker, Ivan - Tracker Software, Tracker Supp-Stefan
-
- User
- Posts: 14
- Joined: Tue Sep 23, 2014 7:00 am
-
- Site Admin
- Posts: 17949
- Joined: Mon Jan 12, 2009 8:07 am
- Location: London
Re: Difference in text copy-paste between ActiveX and Editor
Hello divspirit,
The Editor has more advanced text copying algorithms in it - and will result in better copy/pasting than what the Viewer could do.
The AX is a bit older and as such carries the older text copying code.
The spaces are most likely really there - to allow for the OCR text layer to "match" the underlying image, but the Editor manages to selectively remove them. The Editor SDK is not yet available but we are working on it.
Regards,
Stefan
The Editor has more advanced text copying algorithms in it - and will result in better copy/pasting than what the Viewer could do.
The AX is a bit older and as such carries the older text copying code.
The spaces are most likely really there - to allow for the OCR text layer to "match" the underlying image, but the Editor manages to selectively remove them. The Editor SDK is not yet available but we are working on it.
Regards,
Stefan
-
- User
- Posts: 14
- Joined: Tue Sep 23, 2014 7:00 am
Re: Difference in text copy-paste between ActiveX and Editor
Hi Stefan,
Thanks for your reply. I have used Pdfxplorer to view the OCR layer and the character spacing is indeed there.
Do you know when the editor SDK will become available?
Kind Regards,
Auke
Thanks for your reply. I have used Pdfxplorer to view the OCR layer and the character spacing is indeed there.
Do you know when the editor SDK will become available?
Kind Regards,
Auke
-
- Site Admin
- Posts: 17949
- Joined: Mon Jan 12, 2009 8:07 am
- Location: London
Re: Difference in text copy-paste between ActiveX and Editor
Hi Auke,
I believe that most of it is ready, and we need to prepare the documentation and sample projects but I still do not have any precise date for you.
Regards,
Stefan
I believe that most of it is ready, and we need to prepare the documentation and sample projects but I still do not have any precise date for you.
Regards,
Stefan
-
- User
- Posts: 14
- Joined: Tue Sep 23, 2014 7:00 am
Re: Difference in text copy-paste between ActiveX and Editor
Hi Stefan,
Thanks for your quick reply!
Hopefully the SDK will be released soon, please keep me posted.
Kind Regards,
Auke
Thanks for your quick reply!
Hopefully the SDK will be released soon, please keep me posted.
Kind Regards,
Auke
-
- Site Admin
- Posts: 17949
- Joined: Mon Jan 12, 2009 8:07 am
- Location: London
Re: Difference in text copy-paste between ActiveX and Editor
Hi Auke,
Yep - we will make clear announcements once the Editor SDK is available.
Regards,
Stefan
Yep - we will make clear announcements once the Editor SDK is available.
Regards,
Stefan
-
- User
- Posts: 14
- Joined: Tue Sep 23, 2014 7:00 am
Re: Difference in text copy-paste between ActiveX and Editor
Hi Stefan,
Is there any news about the release date of the editor SDK?
Is there maybe a beta version available for test purposes?
Kind regards,
Auke
Is there any news about the release date of the editor SDK?
Is there maybe a beta version available for test purposes?
Kind regards,
Auke
-
- Site Admin
- Posts: 17949
- Joined: Mon Jan 12, 2009 8:07 am
- Location: London
Re: Difference in text copy-paste between ActiveX and Editor
Hello Auke,
We've decided against releasing a beta version - as without documentation it will be pretty unusable by yourself, and will put a lot of strain on us to guide you through all the available methods, so a beta will not be available before the documentation is complete.
As for the actual release of the SDK - it will be in the new year, but I don't have an exact date yet.
Regards,
Stefan
We've decided against releasing a beta version - as without documentation it will be pretty unusable by yourself, and will put a lot of strain on us to guide you through all the available methods, so a beta will not be available before the documentation is complete.
As for the actual release of the SDK - it will be in the new year, but I don't have an exact date yet.
Regards,
Stefan