Extract the highlighted text in the scanned pdf

Discussions about PDF management.
Post Reply
farrukh
Posts: 1
Joined: Thu Sep 15, 2022 3:15 pm

Extract the highlighted text in the scanned pdf

Post by farrukh » Thu Sep 15, 2022 3:20 pm

Hi,
I came across to this SDK . I need to extract out the highlighted text or maybe circled text in a scanned pdf file .
The idea then is to replace that highlighted region with a new text
Attachments
Screenshot 2022-09-15 181953.jpg

Hugo
Posts: 227
Joined: Tue Dec 18, 2018 10:09 am

Re: Extract the highlighted text in the scanned pdf

Post by Hugo » Fri Sep 16, 2022 2:31 pm

Hi Farrukh,

Thanks for your question.

You can absolutely do this and this only requires 2 steps for you to do so.
The first requires you to redact both words on your page. You can use this demo to preview how this is done (interactively):
https://www.gdpicture.com/pdf-sdk/pdf-redaction/
We have another solution to do this programmatically (get the position of the word with coordinates and automatically apply redaction area around it).

Once your area has been redacted you can simply draw new text in its place using the same font, font size etc
Such a method to do this can be found here:
(For PDF):
https://www.gdpicture.com/guides/gdpict ... rawtext%2C

These are the only 2 processes you need to achieve this in GdPicture. Let me know if you need anything else.

Regards,
Hugo Cudd
Technical Support

Post Reply

Who is online

Users browsing this forum: No registered users and 1 guest