Auto convert to bitonal before PDF OCR

Feature Requests for GdPicture.NET.
Post Reply
sbarlow
Posts: 33
Joined: Fri Jun 19, 2009 3:13 pm

Auto convert to bitonal before PDF OCR

Post by sbarlow » Tue Jul 27, 2010 6:47 pm

Hi Loïc,

In the methods SaveAsPDFOCR and PdfAddGdPictureImageToPdfOCR (and maybe even PdfOCRCreateFromMultipageTIFF, but I know it would be much harder) , would it be possible to have an overload that converted color images to bitonal before performing the OCR.

The image within the PDF would still be the original image (color) , but the OCR would be done on a temporary bitonal extract. This would greatly enhance the PDF OCR output for color images.

Not sure if this is even possible

Scott

sbarlow
Posts: 33
Joined: Fri Jun 19, 2009 3:13 pm

Re: Auto convert to bitonal before PDF OCR

Post by sbarlow » Tue Jul 27, 2010 11:02 pm

Adding another thought...if using SaveAsPDFOCR and PdfAddGdPictureImageToPdfOCR,

Just work the method so that it takes a B&W image as an argument

(PDFID,IMAGEID,BWID,,,,)

That way we could supply the black and white image , either extracted from the single or multipage , filter as needed to get the optimum, and then feed it to the method without affecting the original image that will become the image in the PDF.If done corretly the pixel dimesions remain the same , so the overlay should remain the same.

Post Reply

Who is online

Users browsing this forum: No registered users and 2 guests