GdPicture

GdPicture Tesseract Plugin - Royalty-Free OCR Engine. OCR SDK

GdPicture Tesseract Plugin is a low-cost, fast, accurate and royalty-free OCR Engine for developing applications using GdPicture Imaging SDK Toolkits.

Based on Google's open source Tesseract OCR, the GdPicture Tesseract Plugin adds OCR features to GdPicture Toolkits, such as text recognition on a specific area of an image and the ability to create searchable PDF/A files (PDF-OCR) from scanned documents, images or existing PDF documents.

GdPicture Tesseract Plugin supports many languages (see below) and can process more than 90 document formats.

Main Features


Unicode Support.
Character recognition confidence.
Retrieve character location.
Output text.
Support for PDF/A OCR generation (PDF Image + hidden searchable text).
Multiple languages: English, French, Italian, German, Spanish, Brazilian Portuguese, Vietnamese, Polish and Dutch.
Can recognize only digits, only alpha or only "white listed" characters.
Fast area processing.
Document orientation detection.
Easy to use.
Fast, accurate & bug free.
Royalty-free licensing: no distribution license required for server or desktop.
 

FAQ


 

Where can I use or evaluate the GdPicture Tesseract OCR Plugin?



The binaries of this plugin are included within the following GdPicture SDKs:
By downloading one of these SDKs, you can use or evaluate the GdPicture Tesseract Plugin.
The plugin will need to be unlocked, see "How can I unlock the GdPicture Tesseract OCR Plugin?".


What is the minimum text size to get reasonable accuracy?



The minimum text height is about 15-20 pixels. Below 15 pixels, accurate results decrease dramatically.


How can I unlock the GdPicture Tesseract OCR Plugin?



Just call the SetLicenseNumberOCRTesseract() method passing your license KEY as parameter:
    I.E., Object.SetLicenseNumberOCRTesseract("YourKey");

You can get a one month trial KEY here. You can also purchase licenses here.


How do I perform OCR on a specific zone of an image?



1- Load the image (see your toolkit reference guide).
2- Define the zone (also called region of interest) using the SetROI() method.
     I.E., Object.SetRoi(100,100,250,50)
3- Perform the OCR using the OCRTesseractDoOCR() method.


How do I build searchable PDF/A files (PDF-OCR) from multi-page TIFF images, PDFs or scanned documents?



Using GdPicture.NET editions: click here
Using GdPicture ActiveX editions: click here


How do I make a custom dictionary to increase the recognition of specific words?



For an English dictionary: edit eng.user-words, then add your own words in UTF8 format, one word per line, sorted alphabetically.
For a French dictionary, edit fra.user-words, then add your own words in UTF8 format, one word per line, sorted alphabetically.
 

Pricing


License
Price/License (USD)

Single Developer
499.00
2-5 Developers
319.96
per developer
Site license
1999.00
per site

All licenses include royalty-free distribution with your application or system.
Software license key will continue to be valid for all future 1.X versions of GdPicture Tesseract Plugin with free upgrades.

Per developer licenses:
Each valid license key entitles a single developer in a single organization to write software with access to GdPicture Tesseract Plugin.

Site license:
This license entitles an unlimited number of developers in a single organization to write software with access to GdPicture Tesseract Plugin.

 

Examples of usage










Join us on Facebook!
Home | Company | Toolkits Overview | Downloads | Evaluate | Order | Forums | Contact | F.A.Q. | Documentation | Search | Blog | Sitemap
All elements present on this site are the exclusive property of ORPALIS and are protected by intellectual property law. Any total or partial reproduction of the elements of the website is prohibited without the explicit consent of the site's publication director.
Feedback Form