Moderators: TrackerSupp-Daniel, Tracker Support, Vasyl-Tracker Dev Team, Sean - Tracker, Chris - Tracker Supp, Tracker Supp-Stefan, Ivan - Tracker Software, Andrew - Tracker Support, Tracker - Clarion Support, John - Tracker Supp, Support Staff, moderators
With 3.5 when using PDF2TEXT, how can I identify the x,y position of the text pieces? I need to be able to extract/identify the contents of a text string from a given x,y position but can't seem to find a function that returns this info.
- Site Admin
- Posts: 8203
- Joined: Tue Jun 29, 2004 10:34 am
- Location: Vancouver Island - Canada
Please see the example provided in the evaluation SDK Folder :
The example shows how to extract formatted text from one PDF and insert to another pdf.
However the principle is the same as required for your use.
You must extract text from the page element by element using a matrix to acquire each element.
In the matrix the first four parameters define scaling, rotation and so on, and the last two - the offset from the lower-left corner of the page (as described in pdf specification).
That should provide you with the required methodolgy to progress.