PDF Select Text

Example requests & Code samples for GdPicture Toolkits.
Post Reply
Sly
Posts: 15
Joined: Sun Dec 30, 2007 2:16 pm

PDF Select Text

Post by Sly » Fri Jul 02, 2010 11:33 am

Hi,
I was tring to write some code in order to select text from pdf file (word like), copy and paste it somewhere, but it seems realy hard. Once you convert a file to a pdf OCR it's not possible to get information about the recognized text. Any clue is really appreciated....

Thanks in advance

Sly

User avatar
Loïc
Site Admin
Posts: 5881
Joined: Tue Oct 17, 2006 10:48 pm
Location: France
Contact:

Re: PDF Select Text

Post by Loïc » Sun Jul 04, 2010 6:49 pm

Hi Sly,

GdPicture.NET 7 will come with a new method called PdfReaderGetPageTextWithCoords. This version will be released on September 2010.
Here the definition:
Returns the text of the current page of a PDF including coordinates of the 4 points of each word. The return string will be formatted like this:
TopLeftX + [FieldSeparator] + TopLeftY + [FieldSeparator] +
TopRightX + [FieldSeparator] + TopRightY + [FieldSeparator] +
BottomRightX + [FieldSeparator] + BottomRightY + [FieldSeparator] +
BottomLeftX + [FieldSeparator] + BottomLeftY + [FieldSeparator] +
Word + CRLF
It should allows you to do what you want. In current version, we have no solution for your need.

Kind regards,

Loïc

Post Reply

Who is online

Users browsing this forum: Bing [Bot] and 1 guest