How to convert OCR scanned PDF into *.rtf or *.doc?

Forum for the PDF-XChange Editor - Free and Licensed Versions

Moderators: TrackerSupp-Daniel, Tracker Support, Paul - Tracker Supp, Vasyl-Tracker Dev Team, Chris - Tracker Supp, Sean - Tracker, Ivan - Tracker Software, Tracker Supp-Stefan

mattad
User
Posts: 143
Joined: Sat Nov 29, 2008 10:37 am

How to convert OCR scanned PDF into *.rtf or *.doc?

Post by mattad »

I loaded a *.pdf file into PDF XChange Editor and clicked menu

Document--->OCR pages....

The pdf is successfully scanned.

But what next?

How can I save the just inspected PDF as *.doc or *.rtf file?

Is this possible at all?

How do I specify the output directory?

Matt
User avatar
Will - Tracker Supp
Site Admin
Posts: 6815
Joined: Mon Oct 15, 2012 9:21 pm
Location: London, UK

Re: How to convert OCR scanned PDF into *.rtf or *.doc?

Post by Will - Tracker Supp »

Hi Matt,

Thanks for the post - I'm afraid that this cannot currently be done with our software, but we hope to see this implemented within the next few builds of PDF-Tools 6.

Thanks,
If posting files to this forum, you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded.
Thank you.

Best regards

Will Travaglini
Tracker Support (Europe)
Tracker Software Products Ltd.
http://www.tracker-software.com
mattad
User
Posts: 143
Joined: Sat Nov 29, 2008 10:37 am

Re: How to convert OCR scanned PDF into *.rtf or *.doc?

Post by mattad »

Thank you for reply.

I am unsure if you are aware of the important difference between real OCR and just converting a pdf to *.RTF/*.DOC

OCR is much more sophiticated since recognizing and diffentiating between e.g. a "G" and a "6" from
a scanned page/article in an image file is much more demanding.

On the other side when you have already a pdf then all the text inside is precisely given.
You just need to convert it to *.doc format.

So I suggest to introduce as a first step a "stupid" pdf-to-doc only feature.

Personally I need this much more often than real OCR from scanned image.

Matt
Willy Van Nuffel
User
Posts: 2395
Joined: Wed Jan 18, 2006 12:10 pm

Re: How to convert OCR scanned PDF into *.rtf or *.doc?

Post by Willy Van Nuffel »

Like Will said, the conversion feature from PDF to DOC is not yet implemented in the new version V6 of PDF-Tools.

But, if you really are interested in a very basic conversion option, then you can still download PDF-Tools version 4 via the Tracker Software download page, and - at PDF Tools - clicking "Previous Builds":
https://www.pdf-xchange.com/version/pdf-tools

In that "old" version, there is an option to convert PDF to RTF (Rich Text Format).

RTF is a format that can be read by Microsoft Word and other text processing applications.
User avatar
Will - Tracker Supp
Site Admin
Posts: 6815
Joined: Mon Oct 15, 2012 9:21 pm
Location: London, UK

Re: How to convert OCR scanned PDF into *.rtf or *.doc?

Post by Will - Tracker Supp »

Thanks Willy :)
If posting files to this forum, you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded.
Thank you.

Best regards

Will Travaglini
Tracker Support (Europe)
Tracker Software Products Ltd.
http://www.tracker-software.com