Page 1 of 1

Extract the highlighted text in the scanned pdf

Posted: Thu Sep 15, 2022 3:20 pm
by farrukh
Hi,
I came across to this SDK . I need to extract out the highlighted text or maybe circled text in a scanned pdf file .
The idea then is to replace that highlighted region with a new text

Re: Extract the highlighted text in the scanned pdf

Posted: Fri Sep 16, 2022 2:31 pm
by Hugo
Hi Farrukh,

Thanks for your question.

You can absolutely do this and this only requires 2 steps for you to do so.
The first requires you to redact both words on your page. You can use this demo to preview how this is done (interactively):
https://www.gdpicture.com/pdf-sdk/pdf-redaction/
We have another solution to do this programmatically (get the position of the word with coordinates and automatically apply redaction area around it).

Once your area has been redacted you can simply draw new text in its place using the same font, font size etc
Such a method to do this can be found here:
(For PDF):
https://www.gdpicture.com/guides/gdpict ... rawtext%2C

These are the only 2 processes you need to achieve this in GdPicture. Let me know if you need anything else.

Regards,
Hugo Cudd
Technical Support