I am so happy you added OCR! Previously, to get a word searchable pdf I had to use a slow scanner w/ OCR software. Now I can use our speedy copier/printer/scanner and scan a document in seconds rather than minutes, but the resulting file is not searchable. Your OCR feature makes it so! Capt. Picard would be happy
A question: I've generally found that searchable pdf's created using OCR software are much smaller than the same document scanned as an image (e.g., 80k vs 400k). I was hoping your OCR feature might make the file smaller but it doesn't seem to be the case. Is this there anyway to make the file smaller?
OCR file size
Moderators: TrackerSupp-Daniel, Tracker Support, Paul - Tracker Supp, Vasyl-Tracker Dev Team, Chris - Tracker Supp, Sean - Tracker, Ivan - Tracker Software, Tracker Supp-Stefan
-
- User
- Posts: 381
- Joined: Mon Jun 13, 2011 5:10 pm
Re: OCR file size
There are a couple of options which can reduce file size in the resulting PDF.
See the two attached screen shots for the location of these settings in the OCR dialog.
1. Preserve original content and add text layer - this keeps the original image intact and adds text on top of it. In some cases this results in a lower file size, since the alternative replaces the current page with a slightly altered copy (which may be, in some cases, larger).
2. Image quality: lower DPI means lower file size. This may result in loss of image quality from the original scan, but in many cases it won't be noticeable. If you scan at higher DPI for more accurate OCR (e.g. 150 or 300 DPI), then you can use this setting to reduce the image size after OCR and consequently reduce file size. In a test I ran here, a 300 DPI input document which used 14 MB of disk space was reduced to 1.5 MB with image quality set to 72 DPI. Be sure to select "Convert page content to image only ... ", in the PDF Output field, to enable this option.
We have also developed some new output features, but you will have to wait until our next major release which will be coming soon.
Hope this helps
-Walter
See the two attached screen shots for the location of these settings in the OCR dialog.
1. Preserve original content and add text layer - this keeps the original image intact and adds text on top of it. In some cases this results in a lower file size, since the alternative replaces the current page with a slightly altered copy (which may be, in some cases, larger).
2. Image quality: lower DPI means lower file size. This may result in loss of image quality from the original scan, but in many cases it won't be noticeable. If you scan at higher DPI for more accurate OCR (e.g. 150 or 300 DPI), then you can use this setting to reduce the image size after OCR and consequently reduce file size. In a test I ran here, a 300 DPI input document which used 14 MB of disk space was reduced to 1.5 MB with image quality set to 72 DPI. Be sure to select "Convert page content to image only ... ", in the PDF Output field, to enable this option.
We have also developed some new output features, but you will have to wait until our next major release which will be coming soon.
Hope this helps
-Walter
- Attachments
-
- preserve-original-content.zip
- (61.78 KiB) Downloaded 262 times
-
- image-quality.zip
- (71.14 KiB) Downloaded 248 times
-
- User
- Posts: 5
- Joined: Tue Nov 20, 2012 6:00 pm
Re: OCR file size
Thanks so much! My document recipients will be happy
- Paul - Tracker Supp
- Site Admin
- Posts: 6897
- Joined: Wed Mar 25, 2009 10:37 pm
- Location: Chemainus, Canada
- Contact:
Re: OCR file size
Best regards
Paul O'Rorke
Tracker Support North America
http://www.tracker-software.com
Paul O'Rorke
Tracker Support North America
http://www.tracker-software.com