In This Topic

OcrPages Method (GdPicturePDF)

In This Topic
Runs the optical character recognition (OCR) on the specified page range of the loaded PDF document using a defined number of threads. All these overloads are running asynchronously.
Overload List
OverloadDescription
Runs the optical character recognition (OCR) on the specified page range of the loaded PDF document using a defined number of threads. You can also set other parameters according to your preferences. The recognized text is added as invisible text on each processed page. The page orientation is automatically detected for each page as well.

This method involves a rasterization process so any existing visible text within the processed pages will become a part of the images of those pages before the OCR process starts. The same applies to the invisible text contained within pages. It is not kept because of the rasterization process, which simply means any invisible text is removed from processed pages before the OCR process starts.

This method is running asynchronously, in other words you have to wait for the OCR process ending before manipulating the document further. You can benefit from using several OCR related events like GdPicturePDF.BeforePageOcr, GdPicturePDF.OcrPagesProgress and GdPicturePDF.OcrPagesDone.

 
Runs the optical character recognition (OCR) on the specified page range of the loaded PDF document using a defined number of threads. You can also set other parameters according to your preferences. The recognized text is added as invisible text on each processed page. The page orientation is automatically detected for each page as well.

This method involves a rasterization process so any existing visible text within the processed pages will become a part of the images of those pages before the OCR process starts. The same applies to the invisible text contained within pages. It is not kept because of the rasterization process, which simply means any invisible text is removed from processed pages before the OCR process starts.

This method is running asynchronously, in other words you have to wait for the OCR process ending before manipulating the document further. You can benefit from using several OCR related events like GdPicturePDF.BeforePageOcr, GdPicturePDF.OcrPagesProgress and GdPicturePDF.OcrPagesDone.

 
See Also