Document scanning

This Forum is for the use of Clarion For Windows - Software Developers requiring help and assistance for Tracker Software's PDF-Tools SDK of Library DLL functions(only) - Please use the PDF-XChange Drivers API SDK Forum for assistance with all PDF Print Driver related topics.

Moderators: Tracker Support, TrackerSupp-Daniel, Chris - Tracker Supp, Vasyl-Tracker Dev Team, Sean - Tracker, Tracker - Clarion Support, John - Tracker Supp, Tracker Supp-Stefan, Ivan - Tracker Software, Support Staff, moderators

Post Reply
bramkip
User
Posts: 19
Joined: Fri Nov 12, 2004 12:28 pm

Document scanning

Post by bramkip » Tue May 30, 2006 3:26 pm

Hi,

Is there a way to scan documents with a sheetfeeder and save every page with a different documentname?

Is there maybe a sample app for that?

Thank you.

Bram Kip

John - Tracker Supp
Site Admin
Posts: 8201
Joined: Tue Jun 29, 2004 10:34 am
Location: Vancouver Island - Canada
Contact:

Post by John - Tracker Supp » Tue May 30, 2006 4:23 pm

Hi Bram,

we have not created a method for that as it is a fairly unusual request - I would suggest that what you do is scan all to one file and then extract each page to a seperate file named as you require - this would I think also be quicker and more reliable too !
If posting files to this forum - you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded - thank you.

Best regards
Tracker Support
http://www.tracker-software.com

bramkip
User
Posts: 19
Joined: Fri Nov 12, 2004 12:28 pm

Post by bramkip » Tue May 30, 2006 7:51 pm

Hi Craig,

I have to scan many invoices and they has to be saved in separate files (each invoice in a file). You understand that I want to scan them with a sheetfeeder.

How can I separate the different pages from one pdf file then, as you suggest. Do you have an example app?

Thanks

Bram

John - Tracker Supp
Site Admin
Posts: 8201
Joined: Tue Jun 29, 2004 10:34 am
Location: Vancouver Island - Canada
Contact:

Post by John - Tracker Supp » Wed May 31, 2006 9:57 am

Hi Bram,

There is not an exact example matching your needs - but if you look on page 102 of the manual (PDFToolsCW35.pdf) there are 2 examples detailed showing (amongst other functionality) how to extract pages from a PDF file and create a new PDF file - these should give you a headstart on the process.

One is legacy and the other ABC

pdftbx30.app and legtbx30.app

HTH
If posting files to this forum - you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded - thank you.

Best regards
Tracker Support
http://www.tracker-software.com

bramkip
User
Posts: 19
Joined: Fri Nov 12, 2004 12:28 pm

Post by bramkip » Wed May 31, 2006 1:42 pm

Hi,

In the pdftbx35.app I only see an example of extracting imaging and text not a complete page. Can you give me a hand on this?

Or better, can you make an example app to archive this, it is very important for me to have separate files. I want to pay for the work you have to put in it.

Thanks.

Bram

Tracker - Clarion Support
Site Admin
Posts: 1412
Joined: Wed Jun 30, 2004 4:45 pm
Location: Maryland, USA
Contact:

Post by Tracker - Clarion Support » Wed May 31, 2006 7:34 pm

Hi Bram!

Actually John has been answering you up to now.

I just want to understand - are you wanting to scan a stack of pages and make one PDF per page, or is it more complicated than that?
Craig Ransom
Tracker Software - Clarion Support
http://www.tracker-software.com

bramkip
User
Posts: 19
Joined: Fri Nov 12, 2004 12:28 pm

Post by bramkip » Thu Jun 01, 2006 9:12 am

Hi Craig,

Correct, I have to put a 50 pages in the sheetfeeder and then scan the pages to different pdf files.

Thanks

Bram

John - Tracker Supp
Site Admin
Posts: 8201
Joined: Tue Jun 29, 2004 10:34 am
Location: Vancouver Island - Canada
Contact:

Post by John - Tracker Supp » Fri Jun 02, 2006 10:43 am

Hi Bram,

Craig is working on this now and hopes to have available Monday of next week.
If posting files to this forum - you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded - thank you.

Best regards
Tracker Support
http://www.tracker-software.com

bramkip
User
Posts: 19
Joined: Fri Nov 12, 2004 12:28 pm

Post by bramkip » Fri Jun 02, 2006 4:23 pm

Hi Craig, John

This would really be great! I'm very stucked with it. In about a few weeks I have to scan a lot of paper (about 100000 sheets). Page by page is not really an option :-)

No problem to pay for the hours works you put in it.

Thanks again.

Bram

John - Tracker Supp
Site Admin
Posts: 8201
Joined: Tue Jun 29, 2004 10:34 am
Location: Vancouver Island - Canada
Contact:

Post by John - Tracker Supp » Fri Jun 02, 2006 4:47 pm

Hi Bram,

hopefully Craig can sort this by Monday :)

We rarely charge for such work - we prefer to take such requests and make updates for the benefit of all. The issue is rarely the cost unless the request is so unusual - but sometimes time is the problem.

It just so happens that Craig was able to slot this in between 2 jobs :)
If posting files to this forum - you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded - thank you.

Best regards
Tracker Support
http://www.tracker-software.com

Tracker - Clarion Support
Site Admin
Posts: 1412
Joined: Wed Jun 30, 2004 4:45 pm
Location: Maryland, USA
Contact:

Post by Tracker - Clarion Support » Sun Jun 04, 2006 11:49 pm

Hi Bram!

We're in the "home stretch" on this one.

I expect to deliver it in a "beta" mode Monday PM - working code but not documented and docs by Tuesday PM.

What I have done is to separate the PDF "building" functionality into a PDFBuilderClass that you could use "raw" to actually build a PDF from the ground up - assuming you didn't mind all the tedium.

I have derived the PDF-Tools Report class from the PDFBuilderClass and provided the Generate method in that derived class, which does all the WMF-based pocessing including searching the WMF's for coded text fields to locate Bookmarks and Annotations.

YOUR class will be the PDFImageClass that takes a queue of Image data - which may be obtained by means other than scanning - and uses a Generate method to create a PDF with exactly the same kinds of functionality you have with the reports: Bookmarks, Annotations, Watermarks et al. It will be your responsiblity to set those kinds of info "by hand" as there are no text-based fields in scanned image data. If you do go the OCR route - then the sky's the limit!
Craig Ransom
Tracker Software - Clarion Support
http://www.tracker-software.com

bramkip
User
Posts: 19
Joined: Fri Nov 12, 2004 12:28 pm

Post by bramkip » Tue Jun 06, 2006 2:09 pm

Hi Craig,

Can I download already the "beta" version, so I can do some testing?

Thanks.

Bram

John - Tracker Supp
Site Admin
Posts: 8201
Joined: Tue Jun 29, 2004 10:34 am
Location: Vancouver Island - Canada
Contact:

Post by John - Tracker Supp » Tue Jun 06, 2006 2:26 pm

Hi Bram,

This has taken a little longer than anticpated as Craig has delved into he has found that it is possible to add some extended functionality useful to all users is possible and is incorporating this for the benefit of all - he is hoping all should be complete in the next 24 hours or so - thanks for your patience.
If posting files to this forum - you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded - thank you.

Best regards
Tracker Support
http://www.tracker-software.com

bramkip
User
Posts: 19
Joined: Fri Nov 12, 2004 12:28 pm

Post by bramkip » Thu Jun 08, 2006 3:59 pm

Hi Craig,

Did you already made some progress?

Please inform me about the status.

Thanks.

Bram

bramkip
User
Posts: 19
Joined: Fri Nov 12, 2004 12:28 pm

Post by bramkip » Sat Jun 10, 2006 9:36 pm

Hi Craig,

Can you inform me about the status?

Thanks

Bram

John - Tracker Supp
Site Admin
Posts: 8201
Joined: Tue Jun 29, 2004 10:34 am
Location: Vancouver Island - Canada
Contact:

Post by John - Tracker Supp » Mon Jun 12, 2006 9:17 am

Hi Bram,

Craig lost his DSL connection on Thursday afternoon - he called me Saturday to tell me all was done - but he could not upload - he hopes to have a new modem today - so should be able to provide sometime today (monday) - one of the occassional problems of modern working I am afraid :(

Thanks for your patience.
If posting files to this forum - you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded - thank you.

Best regards
Tracker Support
http://www.tracker-software.com

bramkip
User
Posts: 19
Joined: Fri Nov 12, 2004 12:28 pm

Post by bramkip » Fri Jun 16, 2006 12:41 pm

Hi Craig,

I saw you placed an new version of the templates on the docu-track website. I downloaded and installed it, but can't find anything about the feature I asked for.
Did you implement this in this version? If so, can you point me in the right direction so I can save the scanned documents in separate files.

Thank you.

Bram

John - Tracker Supp
Site Admin
Posts: 8201
Joined: Tue Jun 29, 2004 10:34 am
Location: Vancouver Island - Canada
Contact:

Post by John - Tracker Supp » Sat Jun 17, 2006 10:25 am

Hi Bram,

the update was for the core library (DLL's etc) - I am afraid there were no updates to the Clarion specific items (Classes/Templates/Doc's etc)

Please see your email inbox.
If posting files to this forum - you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded - thank you.

Best regards
Tracker Support
http://www.tracker-software.com

John - Tracker Supp
Site Admin
Posts: 8201
Joined: Tue Jun 29, 2004 10:34 am
Location: Vancouver Island - Canada
Contact:

Post by John - Tracker Supp » Wed Jun 28, 2006 8:15 pm

Hi Bram,

Finally this is now ready !

Please download from : http://www.tracker-software.com/downloads/dev/

It is also important you read page 9 of the PDFToolsCW35.pdf manual.

There is a demo showing the required functionality called :

scnpdf35.app

Sorry it all took a little longer than hoped.
If posting files to this forum - you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded - thank you.

Best regards
Tracker Support
http://www.tracker-software.com

John - Tracker Supp
Site Admin
Posts: 8201
Joined: Tue Jun 29, 2004 10:34 am
Location: Vancouver Island - Canada
Contact:

Post by John - Tracker Supp » Thu Jun 29, 2006 7:56 pm

Hi - addendum to yesterday's release - available today - build 3.60121 after several requests :
+++++++++++++

5) PDFBuilderClass has a new feature: the ability to maintain and access a
list of the generated file
names. Both derived classes (PDFXToolsReportClass and PDFXToolsImageClass)
support
creation of this list in their respective Generate methods.
a) To access the list the PDFBuilderClass class has two methods:
i) GenFileCount, which returns the number of filenames in the internal list.
ii) GetGenFile which returns the specified file name by an index number
which may be
from 1 to GenFileCount.
b) The Clarion Report and new Scanner templates support accessing this list
by having embed
points right after the Generate methods:
i) In the PDF-Tools Report template use "PDF-Tools After Good Generate".
ii) In the new Scanning templates use "PDF-Scanner After PDF Generation".
--
Best Regards
If posting files to this forum - you must archive the files to a ZIP, RAR or 7z file or they will not be uploaded - thank you.

Best regards
Tracker Support
http://www.tracker-software.com

bramkip
User
Posts: 19
Joined: Fri Nov 12, 2004 12:28 pm

Post by bramkip » Sat Jul 01, 2006 2:57 pm

Hi Craig,

The scanning works fine now! With GenFileCount() and GetGenFile(1) I can manage the files perfectly.

I have only one thing on my wish list and that's OCR. Now I have to use another Thirdparty for that, but I believe I read something about it in the Forum that you are working on that also?

Thank very much for the good work and support.

Bram

JaWaRo
User
Posts: 3
Joined: Thu Jul 26, 2007 1:11 pm

Document scanning

Post by JaWaRo » Fri Sep 14, 2007 5:42 pm

Hi,

I´ve found this conversation here and it ist very interesting also for me.
But we would like to go a little further, one file per invoice containing more then one page. We´d like to label the invoices with barcode or number labels, always at the same place, then scann the pile of invoices and at the and generate one pdf file per invoice.

Questions:
1. How can we detect the connecting pages per invoice?
2. Can we get the labeled number and use it for the file name?

Thanks a lot for Your information on this questions.

Tracker - Clarion Support
Site Admin
Posts: 1412
Joined: Wed Jun 30, 2004 4:45 pm
Location: Maryland, USA
Contact:

Post by Tracker - Clarion Support » Sat Sep 15, 2007 8:51 pm

Hi JaWaRo!

When a document page is scanned into a bitmap, it becomes a jumble of pixels. The only way to extract meaningful content is to employ Optical Character Recoginition to identify and interpret that content.

We don't offer such a capability at this time, although you could try to use another package to perform this task.

HTH!
Craig Ransom
Tracker Software - Clarion Support
http://www.tracker-software.com

JaWaRo
User
Posts: 3
Joined: Thu Jul 26, 2007 1:11 pm

OCR

Post by JaWaRo » Wed Sep 26, 2007 7:09 pm

Hi,

my boss pleased me to ask You if You might recommend us any OCR package which we could use to solve our Problem?

Thank You;)

P.S.
You wrote:
We don't offer such a capability at this time...
Do You plan to do so?

Tracker - Clarion Support
Site Admin
Posts: 1412
Joined: Wed Jun 30, 2004 4:45 pm
Location: Maryland, USA
Contact:

Post by Tracker - Clarion Support » Thu Sep 27, 2007 1:52 am

Hi JaWaRo!

OCR is not my area of expertise so I had to fly that by John V.

He suggests Abby's might do what you want.

http://www.abbyy.com/sdk/

Also they know us so that might ease the process of integrating.

Frankly, I have not seen an OCR package that really worked well in an totally automated environment. Be warned!
Craig Ransom
Tracker Software - Clarion Support
http://www.tracker-software.com

Post Reply

Return to “PDF-Tools SDK (DLL Libraries Only) - Clarion For Windows Developers Only Please”