OCR Languages

Discussions about machine vision support in GdPicture.
Post Reply
Vangeti
Posts: 1
Joined: Wed Jun 20, 2012 12:13 pm

OCR Languages

Post by Vangeti » Wed Jun 20, 2012 12:33 pm

Hi,

Could anyone please list the languages which supports OCR :?:

Thank You,
Harry

SamiKharma
Posts: 352
Joined: Tue Sep 27, 2011 11:47 am

Re: OCR Languages

Post by SamiKharma » Mon Jun 25, 2012 12:11 pm

Hi,
For V8 supported languages and older, you need to download the language pack:
ttp://www.gdpicture.com/download/ocr_language_pack.zip

There is a list of language that are supported in GdPicture V9:
Arabic language data: ara.traineddata, ara.cube.bigrams, ara.cube.fold, ara.cube.lm, ara.cube.nn, ara.cube.params, ara.cube.size, ara.cube.word-freq

Bulgarian language data: bul.traineddata

Catalan language data: cat.traineddata

Czech language data: ces.traineddata

Chinese (Simplified) language data: chi_sim.traineddata

Chinese (Traditional) language data: chi_tra.traineddata

Cherokee language data: chr.traineddata

Danish language data: dan.traineddata

Danish (Fraktur) language data: dan-frak.traineddata

German language data: deu.traineddata

Fraktur Language data (Old German) : deu-frak.traineddata

Greek language data: ell.traineddata

English language data: eng.traineddata

Finnish language data: fin.traineddata

French language data: fra.traineddata

Hebrew language data: heb.traineddata

Hindi language data: hin.traineddata, hin.cube, hin.cube.fold, hin.cube.lm, hin.cube.nn, hin.cube.params, hin.cube.word-freq, hin.tesseract_cube.nn,

Hungarian language data : hun.traineddata

Indonesian language data: ind.traineddata

Italian language data: ita.traineddata

Japanese language data: jpn.traineddata

Korean language data: kor.traineddata

Latvian language data: lav.traineddata

Lithuanian language data: lit.traineddata

Dutch language data: nld.traineddata

Norwegian language data: nor.traineddata

Polish language data: pol.traineddata

Portuguese language data: por.traineddata

Romanian language data: ron.traineddata

Russian language data: rus.traineddata

Slovakian language data: slk.traineddata

Slovakian Fraktur Language data: slk-frak.traineddata

Slovenian language data: slv.traineddata

Spanish language data: spa.traineddata

Serbian (Latin) language data: srp.traineddata

Swedish language data: swe.traineddata

Swedish (Fraktur) language data: swe-frak.traineddata

Tagalog language data: tgl.traineddata

Thai language data: tha.traineddata

Turkish language data: tur.traineddata

Ukrainian language data: ukr.traineddata

Vietnamese language data: vie.traineddata

Best Regards,
Sami

User avatar
Loïc
Site Admin
Posts: 5881
Joined: Tue Oct 17, 2006 10:48 pm
Location: France
Contact:

Re: OCR Languages

Post by Loïc » Tue Jun 26, 2012 12:45 pm

Additional information: the latest updated list can be found in the reference guide (starting GdPicture.NET 9) / Appendix / Tesseract OCR Language Dictionaries

see: https://www.gdpicture.com/guides/gdpicture

Post Reply

Who is online

Users browsing this forum: No registered users and 2 guests