|
|
GdPicture Tesseract Plugin - Royalty-Free OCR Engine. OCR SDK |
|
|
GdPicture Tesseract Plugin is a low-cost, fast, accurate and royalty-free OCR Engine for developing applications using GdPicture Imaging SDK Toolkits.
Based on Google's open source Tesseract OCR, the GdPicture Tesseract Plugin adds OCR features to GdPicture Toolkits, such as text recognition on a specific area of an image and the ability to create searchable PDF/A files (PDF-OCR) from scanned documents, images or existing PDF documents.
GdPicture Tesseract Plugin supports many languages (see below) and can process more than 90 document formats. |
Main Features
|
|
 Unicode Support.
 Character recognition confidence.
 Retrieve character location.
 Output text.
 Support for PDF/A OCR generation (PDF Image + hidden searchable text).
 Multiple languages: English, French, Italian, German, Spanish, Brazilian Portuguese, Vietnamese, Polish and Dutch.
 Can recognize only digits, only alpha or only "white listed" characters.
 Fast area processing.
 Document orientation detection.
 Easy to use.
 Fast, accurate & bug free.
Royalty-free licensing: no distribution license required for server or desktop.
|
| |
FAQ
|
| |
Where can I use or evaluate the GdPicture Tesseract OCR Plugin?
The binaries of this plugin are included within the following GdPicture SDKs:
By downloading one of these SDKs, you can use or evaluate the GdPicture Tesseract Plugin.
The plugin will need to be unlocked, see "How can I unlock the GdPicture Tesseract OCR Plugin?".
What is the minimum text size to get reasonable accuracy?
The minimum text height is about 15-20 pixels. Below 15 pixels, accurate results decrease dramatically.
How can I unlock the GdPicture Tesseract OCR Plugin?
Just call the SetLicenseNumberOCRTesseract() method passing your license KEY as parameter:
I.E., Object.SetLicenseNumberOCRTesseract("YourKey");
You can get a one month trial KEY here. You can also purchase licenses here.
How do I perform OCR on a specific zone of an image?
1- Load the image (see your toolkit reference guide).
2- Define the zone (also called region of interest) using the SetROI() method.
I.E., Object.SetRoi(100,100,250,50)
3- Perform the OCR using the OCRTesseractDoOCR() method.
How do I build searchable PDF/A files (PDF-OCR) from multi-page TIFF images, PDFs or scanned documents?
Using GdPicture.NET editions: click here
Using GdPicture ActiveX editions: click here
How do I make a custom dictionary to increase the recognition of specific words?
For an English dictionary: edit eng.user-words, then add your own words in UTF8 format, one word per line, sorted alphabetically.
For a French dictionary, edit fra.user-words, then add your own words in UTF8 format, one word per line, sorted alphabetically. |
| |
Pricing
| License |
Price/License (USD) |
|
| Single Developer |
499.00 |
|
| 2-5 Developers |
319.96 |
per developer |
| Site license |
1999.00 |
per site |
All licenses include royalty-free distribution with your application or system.
Software license key will continue to be valid for all future 1.X versions of GdPicture Tesseract Plugin with free upgrades.
Per developer licenses:
Each valid license key entitles a single developer in a single organization to write software with access to GdPicture Tesseract Plugin.
Site license:
This license entitles an unlimited number of developers in a single organization to write software with access to GdPicture Tesseract Plugin.
|
|
|
| |
Examples of usage
 |
|
|
|