Replace Image in PDF without changing text
Posted: Tue Jun 05, 2018 10:12 am
Hey,
I have millions of images which have been OCR'd to searchable PDF using Abbyy Recognition Server. I now want to replace the black&white image in the PDF with a colour copy without affecting the text of the page. Re-OCR is not an option because Abbyy Recognition Server has also output various other files formats for each document, like ALTO-XML, Accuracy statistics, etc.
Is this possible with gdpicture. Is there either a way to replace the black&white image in a PDF with a colour version without destroying the searchable text, or is there a way I can extract the text and positions and put that into a colour version PDF?
Cheers,
Matthew Jones
I have millions of images which have been OCR'd to searchable PDF using Abbyy Recognition Server. I now want to replace the black&white image in the PDF with a colour copy without affecting the text of the page. Re-OCR is not an option because Abbyy Recognition Server has also output various other files formats for each document, like ALTO-XML, Accuracy statistics, etc.
Is this possible with gdpicture. Is there either a way to replace the black&white image in a PDF with a colour version without destroying the searchable text, or is there a way I can extract the text and positions and put that into a colour version PDF?
Cheers,
Matthew Jones