GdPicture.NET is a Nutrient product. Learn more

.NET OCR SDK with AI-powered text recognition

Build intelligent document processing applications with GdPicture .NET OCR SDK. Extract text from images and PDFs with more than 99 percent accuracy across 100+ languages using AI and ML technology. Built-in multithreading, automatic preprocessing, and enterprise-grade reliability — without the complexity and support gaps of open source solutions.



Advanced OCR capabilities

Extract text with more than 99 percent accuracy using AI-powered recognition across 100+ languages. Built-in preprocessing automatically corrects image quality issues — deskew, denoise, and enhance characters. Flexible OCR modes, character filtering, and zonal extraction enable precise control. Create searchable PDFs and process 100+ document formats with multithreaded performance.

Advanced OCR capabilities illustration
Unicode support

Full Unicode support for accurate recognition and output of multilingual text. Process and generate PDFs with Unicode characters in any size. Built-in support for 30+ languages, including English, French, Italian, German, Spanish, Portuguese, Vietnamese, Chinese, Russian, Polish, and Dutch. Extend to 100+ additional languages with Tesseract packs.


Character detection

Advanced character recognition with confidence scoring and precise location data. Configure character allowlists (digits only, alpha only) or denylists to improve accuracy. Retrieve character bounding boxes for exact positioning. Define OCR context — document, page, paragraph, block, line, or word level — to optimize recognition for your specific use case.


Structure extraction

Extract rich text metadata — including font information (style, family), formatting details (bold, italic), and layout properties (justification, alignment, bounding boxes). Intelligent segmentation detects text blocks, paragraphs, lines, words, and individual characters. Output structured text with accurate positioning for downstream document analysis and data extraction workflows.


Image correction

Automatic preprocessing improves OCR accuracy without manual intervention. Built-in capabilities include deskew (orientation correction), paragraph detection, noise removal, character enhancement, and line/punch hole removal. Fast area processing accelerates operations on selected regions. Intelligent corrections deliver high-quality results from poor-quality scans.


Format conversion

Generate searchable PDFs with embedded text layers and PDF/A-4f archival compliance. Our multithreaded engine converts 100+ formats — images, Office documents, CAD files — to searchable PDFs. Recognize and convert documents to DOCX, HTML, PDF, and text formats. Flexible output options ensure broad compatibility and document reuse across your workflows.


System integration

Seamlessly integrates with the .NET SDK’s 100+ document processing features. Multithreaded support for high-volume batch processing with configurable CPU limits. 32-bit and 64-bit compatibility across .NET Framework, .NET Core, and .NET 6+. Works with external Tesseract engines for extended language support. Enterprise-grade architecture scales from single documents to high-volume automated workflows.



Highlights

Searchable output

Convert scanned documents, images, and existing PDFs into searchable PDF/A files with embedded text layers. Our AI-powered OCR engine extracts text and preserves it invisibly behind the original image, enabling full-text search while maintaining visual fidelity. Create PDF/A-4f-compliant archives for long-term document preservation with perfect searchability.

Multithreading performance

Built-in multithreading processes multiple pages simultaneously for faster OCR operations. Configurable CPU limits optimize performance across diverse workloads — from single documents to high-volume batch processing. Scale seamlessly from desktop applications to enterprise document automation workflows with intelligent thread allocation.

Language coverage

Process documents in 100+ languages with 30+ built-in language packs and support for 120+ additional Tesseract languages. Full Unicode support ensures accurate recognition of multilingual content, including English, Chinese, Arabic, Russian, Japanese, and European languages. Recognize multiple languages in a single document for international workflows.

Demo

Test OCR accuracy with your documents

Upload scanned images, PDFs, or photos to evaluate our OCR engine’s performance.

Other OCR technologies

ICR — Intelligent character recognition

Recognize handwritten text with AI-powered character recognition. Currently supports handwritten numerics in boxes, with expansion planned for additional contexts. Ideal for processing handwritten forms, applications, and documents where printed text isn’t available. Machine learning algorithms continuously improve accuracy across diverse handwriting styles.


Get started

How to use

Download and install the GdPicture.NET package to access compiled demo applications and multi-language sample projects with full source code.

Explore demo apps
Find compiled demo applications in \Samples\Bin\.
Explore multi-language source code
Find C# and VB.NET demo apps and source code in \Samples\WinForm\.
Visit reference guide
Explore other code snippets within the online reference guide.

Examples of usage

using GdPictureImaging gdpictureImaging = new GdPictureImaging();
// Select the image to process.
int imageID = gdpictureImaging.CreateGdPictureImageFromFile(@"C:\temp\source.png");
// Scan the barcodes.
gdpictureImaging.Barcode1DReaderDoScan(imageID);
// Determine the number of scanned barcodes.
int barcodeCount = gdpictureImaging.Barcode1DReaderGetBarcodeCount();
string content = "";
if (barcodeCount > 0)
{
content = "Number of barcodes scanned: " + barcodeCount.ToString();
// Save the value of each barcode.
for (int i = 1; i <= barcodeCount; i++)
{
content += $"\nBarcode Number: {i} Value: {gdpictureImaging.Barcode1DReaderGetBarcodeValue(i)}";
}
}
// Write the values to the console.
Console.WriteLine(content);
// Release unnecessary resources.
gdpictureImaging.Barcode1DReaderClear();
gdpictureImaging.ReleaseGdPictureImage(imageID);

Trusted by 3,000+ customers and Fortune 500 companies

15Y+
More than 15 years of experience developing our SDK
10K+
Trusted by more than 10,000 developers

Frequently asked questions

What is the GdPicture.NET OCR SDK?

The GdPicture.NET OCR SDK is a comprehensive document imaging toolkit that provides powerful optical character recognition (OCR) capabilities for .NET applications. It enables developers to extract text from scanned documents, images, and PDFs with high accuracy using AI and machine learning technology.

Which languages does the GdPicture.NET OCR SDK support?

The SDK supports recognition of more than 130 languages, including complex scripts and right-to-left languages like Arabic and Hebrew. It comes with 30+ built-in language packs and supports 100+ additional Tesseract language packs for extended coverage.

How does the GdPicture.NET OCR SDK ensure high accuracy in text recognition?

The SDK employs advanced preprocessing and segmentation techniques to enhance OCR accuracy. Built-in image correction features like deskewing, denoising, and character enhancement automatically improve source quality before recognition. The AI-powered engine delivers more than 99 percent accuracy across diverse document types.

Can the GdPicture.NET OCR SDK convert documents into searchable PDFs?

Yes. The SDK enables the creation of searchable PDFs by embedding recognized text within the PDF as an invisible layer. This makes the content fully searchable and selectable while preserving the original document appearance. It also supports PDF/A archival compliance for long-term document preservation.

Is the GdPicture.NET OCR SDK compatible with multithreaded applications?

Yes. The SDK includes full multithreading support for high-performance document processing. You can configure CPU limits to optimize performance across diverse workloads — from single documents to high-volume batch processing in enterprise environments.

60-day free trial

Try GdPicture.NET now!