Page 1 of 1

Enhancements for GdPicturePDF.OcrPage()

Posted: Tue Apr 15, 2014 9:54 am
by jok
Hello,

GdPicturePDF.OcrPage() seems to make OCRing of a PDF page much easier. It yet does not support all the settings which are possible with GdPictureImaging's OCR functions.

We have been using OCRTesseractSetPassCount() with the traditional functions. There does not seem to be a way to set the pass count with OcrPage(), however, so we cannot yet use it, and would be grateful if such a possibility was added.

One way to do so would be an overload with an extended parameter list (as suggested by Sami, see support ticket #39610).

Since there are lots of other settings as well, however, e.g. the OCR context and the Tesseract variables, it does not seem to be good to provide overloads for all these possibilities.

So, in order to provide GdPicture.OcrPage() with all the functionality given with GdPictureImaging, I suggest to add setter functions to GdPicturePDF in the same way as they exist in GdPictureImaging, i.e. OCRTesseractSetPassCount(), OCRTesseractSetOCRContext(), OCRTesseractSetVariable() etc.

Best regards,

jok

Re: Enhancements for GdPicturePDF.OcrPage()

Posted: Tue Jan 29, 2019 2:29 pm
by Gabriela
Hello,

This is already solved by introducing the new GdPictureOCR class:
https://www.gdpicture.com/guides/gdpicture/we ... reOCR.html