GdPicture.NET OCR SDK :: Tesseract Plugin

Royalty-Free OCR SDK & Searchable PDF Toolkit for GdPicture.NET SDK

Looking for a strong OCR SDK?

GdPicture OCR Tesseract is a 100% royalty-free Optical Character Recognition engine to develop applications requiring OCR technology. Developers can add robust, fast & thread-safe OCR support in managed and non managed applications with few lines of code.

Based on Google’s open source Tesseract OCR V3 engine, the GdPicture OCR Tesseract Plugin adds features to GdPicture.NET such as text recognition on a specific area of an image and the ability to create searchable PDF/A files (PDF-OCR) from scanned documents, images or existing PDF documents.

GdPicture OCR Tesseract Plugin supports many languages (see the list in Main Features below) and can process more than 90 document formats.

Note: In order to OCR PDF files, the GdPicture Managed PDF plugin is required

Main Features

  • OCR SDK with full Unicode support.
  • Multi-thread support (demo application included in the GdPicture.NET SDK package).
  • Character recognition confidence.
  • Retrieves character location.
  • Output text.
  • Support for PDF/A OCR generation (PDF Image + hidden searchable text).
  • Can produce PDF & PDF/A with Unicode characters with very small size.
  • Supports more than 50 languages such as English, French, Italian, German, Spanish, Brazilian Portuguese, Vietnamese, Chinese, Russian, Polish, Dutch, etc.
  • Can recognize only digits, only alpha or only “white listed” characters.
  • OCR context support. Defines if the engine is processing a document, single word, single character, text block, vertical text etc.
  • Fast area processing.
  • Document orientation detection.
  • AnyCPU: available in 32 bit & 64 bit versions.
  • Can work in multi-thread applications.

How to use the GdPicture Tesseract OCR SDK

Download and install GdPicture.NET package from here.

  • You will be able to find some compiled demo applications in [Install directory]\samples\Bin\
  • You will be able to find C# and VB.NET demo applications including source code in [Install directory]\samples\AnyCPU\
  • You will find other code snippets within the online reference guide found here http://guides.gdpicture.com
  • You can find some discussions about the GdPicture Tesseract OCR Plugin in the dedicated section of our community forums located here http://forums.gdpicture.com/ocr-tesseract/
© Copyright 2003-2014 ORPALIS. All rights reserved