GdPicture.NET Logo

OCR SDK

GdPicture includes an Optical Character Recognition engine to develop any kind of application requiring OCR technology.

With GdPicture OCR SDK, put the power of more than 15 years of continuously improved technologies into your own application.

60 Days Free Trial Download GdPicture.NET Now!

Royalty-Free OCR SDK & Searchable PDF Toolkit for GdPicture.NET SDK

Outstanding support

GdPicture.NET OCR SDK

GdPicture OCR SDK

Based on a continuously improved technology, the GdPicture OCR engine provides features such as text recognition on a specific area of an image and the ability to create searchable PDF/A files (PDF-OCR) from scanned documents, images or existing PDF documents.

The GdPicture OCR engine offers built-in Multi-threading support, handles more than 100 languages (full list here) and can process more than 100 document formats.

Main features

  • OCR SDK with full Unicode support.
  • Multi-thread support (demo application included in the GdPicture.NET SDK package).
  • Character recognition confidence.
  • Retrieves characters location.
  • Retrieve fonts information (style, family...).
  • Retrieve paragraphs information (justification, alignment, bounding box...)
  • Output text.
  • Support for PDF/A OCR generation (PDF Image + hidden searchable text).
  • Can produce PDF & PDF/A with Unicode characters with very small size.
  • Supports more than 100 languages such as English, French, Italian, German, Spanish, Brazilian, Portuguese, Vietnamese, Chinese, Russian, Polish, Dutch, etc.
  • Can recognize only digits, only alpha or only “white listed” characters. Plus option to specify black list of characters.
  • OCR context support. Defines if the engine is processing a document, single word, single character, text block, vertical text etc.
  • Integration of external engines
  • Fast area processing.
  • Automatic document orientation detection.
  • Can detect paragraphs of the same document with different orientations.
  • Automatic skew correction.
  • Intelligent automatic image correction to increase OCR accuracy and speed.
  • Segmentation features to detect block, paragraphs, lines, words and characters.
  • Built-in multi-threaded engine for PDF/OCR creation.
  • Recognize and convert more than 100 formats to DOCX, HTML, PDF, and text files.
  • Any-CPU: available in 32-bit & 64-bit versions.
  • Can work in multi-thread applications.
  • And more than 100 other features...

Try with your document

Other OCR technologies

ADR

ADR - Automatic Document Recognition

The GdPicture.NET ADR engine is designed for automatic document classification and categorization tasks in a document and information management system. It allows your applications to identify invoices, checks, forms, orders, delivery notes, page separators, or any kind of structured document.

Magnetic Ink Character Recognition

MICR - Magnetic Ink Character Recognition

The GdPicture.NET MICR SDK allows decoding "CMC7" and "E-13B" characters from documents with outstanding speed and accuracy.
It can also detect and decode the MICR line from any structured document such as checks by analyzing the full page layout.

Machine Readable Zone

MRZ - Machine Readable Zone

ID documents like passports, visas, and other ID cards contain a Machine Readable Zone (MRZ) which makes them readable by machines. The GdPicture.NET MRZ recognition engine allows you to create applications to extract and decode MRZ characters on all types of documents.

Optical Mark Recognition

OMR - Optical Mark Recognition

The GdPicture.NET OMR engine helps to detect the content of a checkbox, fill-in-area, multiple-choice examination form, or any area where highlighting is required to indicate a certain choice.
It also provides an anchoring mechanism (also known as template recognition) to specify the area that needs to be processed.

Mixed Raster Content

MRC - Mixed Raster Content

The GdPicture.NET MRC engine is producing spectacular results by automatically adjusting the tradeoff between quality and compression rate to provide top quality PDF MRC documents at the lower possible size.
It uses very elaborated adaptative document learning algorithms permitting to identify and classify any form of any nature very quickly.

Intelligent Character Recognition

ICR - Intelligent Character Recognition

The GdPicture.NET ICR engine expands the machine vision capabilities of the OCR SDK. At the moment, it recognizes handwritten numerics located in boxes. The next versions will support more contexts.

KVP - Key-Value Pair Extraction

KVP - Key-Value Pair Extraction

Bring Intelligent Document Understanding and Processing features to your unstructured and semi-structured documents with the new key-value pair data extractor.
The engine can instantly identify valuable information in a document, extract, and qualify it.

GdPicture.NET OCR SDK - Example

GdPicture.NET - How to use

How to use the GdPicture.NET OCR SDK

Download and install GdPicture.NET package from here.

You will be able to find some compiled demo applications in
[Install directory]\Samples\Bin\
You will be able to find C# and VB.NET demo applications including source code in
[Install directory]\Samples\WinForm\
You will find other code snippets within the online reference guide found here GdPicture.NET Guides
You can find some discussions about the GdPicture Tesseract OCR Plugin in the dedicated section of our community forums located here Tesseract OCR

Download GdPicture.NET now!

60 Days Free Trial Download GdPicture.NET Now!