List of the predefined languages

Here is the list of internal names of the predefined languages that are supported in ABBYY FineReader Engine Linux (= CLI is based on the version 11 of the SDK).

There are 2 types of languages

  • languages that have full built-in dictionary support
  • “simple” languages with a specific language definition, for example allowed/used character sets.

Standard Languages

Internal name Recognition language Can be used for OCR Full dictionary support available

Abkhaz

Abkhaz

+

Adyghe

Adyghe

+  

Afrikaans

Afrikaans

+  

Agul

Agul

+  

Albanian

Albanian

+  

Altaic

Altaic

+  

Arabic

Arabic (Saudi Arabia)

+ +

ArmenianEastern

Armenian (Eastern)

+ +

ArmenianGrabar

Armenian (Grabar)

+ +

ArmenianWestern

Armenian (Western)

+ +

Awar

Avar

+  

Aymara

Aymara

+  

AzeriCyrillic

Azerbaijani (Cyrillic)

+  

AzeriLatin

Azerbaijani (Latin)

+ +

Bashkir

Bashkir

+ +

Basic

Basic programming language

+  

Basque

Basque

+  

Belarusian

Belarussian

+  

Bemba

Bemba

+  

Blackfoot

Blackfoot

+  

Breton

Breton

+  

Bugotu

Bugotu

+  

Bulgarian

Bulgarian

+ +

Buryat

Buryat

+  

C++

C/C++ programming language

+  

Catalan

Catalan

+ +

Chamorro

Chamorro

+  

Chechen

Chechen

+  

Chemistry

Simple chemical formulas

+  

ChinesePRC

Chinese Simplified

+  

ChinesePRC+English*

Chinese Simplified and English

+  

ChineseTaiwan

Chinese Traditional

+  

ChineseTaiwan+English*

Chinese Traditional and English

+  

Chukcha

Chukcha

+  

Chuvash

Chuvash

+  

"CMC7">CMC7

For MICR CMC-7 text type

+  

Cobol

Cobol programming language

+  

Corsican

Corsican

+  

CrimeanTatar

Crimean Tatar

+  

Croatian

Croatian

+ +

Crow

Crow

+  

Czech

Czech

+ +

Danish

Danish

+ +

Dargwa

Dargwa

+  

Digits

Numbers

+  

Dungan

Dungan

+  

Dutch

Dutch (Netherlands) + +

DutchBelgian

Dutch (Belgium) +  

E13B

For MICR (E-13B) text type

+  

English

English

+ +

EskimoCyrillic

Eskimo (Cyrillic)

+  

EskimoLatin

Eskimo (Latin)

+  

Esperanto

Esperanto

+  

Estonian

Estonian

+ +

Even

Even

+  

Evenki

Evenki

+  

Faeroese

Faeroese

+  

Fijian

Fijian

+  

Finnish

Finnish

+ +

Fortran

Fortran programming language

+  

French

French

+ +

Frisian

Frisian

+  

Friulian

Friulian

+  

GaelicScottish

Scottish Gaelic

+  

Gagauz

Gagauz

+  

Galician

Galician

+  

Ganda

Ganda

+  

German

German

+ +

GermanNewSpelling

German (new spelling)

+ +

GermanLuxembourg

German (Luxembourg)

+  

Greek

Greek

+ +

Guarani

Guarani

+  

Hani

Hani

+  

Hausa

Hausa

+  

Hawaiian

Hawaiian

+  

Hebrew

Hebrew

+ +

Hungarian

Hungarian

+ +

Icelandic

Icelandic

+  

Ido

Ido

+  

Indonesian

Indonesian

+ +

Ingush

Ingush

+  

Interlingua

Interlingua

+  

Irish

Irish

+  

Italian

Italian

+ +

Japanese

Japanese

+ +

Japanese+English*

Japanese and English

+ +

Java

Java programming language

+  

Kabardian

Kabardian

+  

Kalmyk

Kalmyk

+  

KarachayBalkar

Karachay-Balkar

+  

Karakalpak

Karakalpak

+  

Kasub

Kasub

+  

Kawa

Kawa

+  

Kazakh

Kazakh

+  

Khakas

Khakas

+  

Khanty

Khanty

+  

Kikuyu

Kikuyu

+  

Kirgiz

Kirghiz

+  

Kongo

Kongo

+  

Korean

Korean

+ +

Korean+English*

Korean and English

+ +

KoreanHangul

Korean (Hangul)

+ +

Koryak

Koryak

+  

Kpelle

Kpelle

+  

Kumyk

Kumyk

+  

Kurdish

Kurdish

+  

Lak

Lak

+  

Lappish

Sami (Lappish)

+  

Latin

Latin

+ +

Latvian

Latvian

+ +

LatvianGothic

Latvian language written in Gothic script

+  

Lezgin

Lezgin

+  

Lithuanian

Lithuanian

+ +

Luba

Luba

+  

Macedonian

Macedonian

+  

Malagasy

Malagasy

+  

Malay

Malay

+  

Malinke

Malinke

+  

Maltese

Maltese

+  

Mansi

Mansi

+  

Maori

Maori

+  

Mari

Mari

+  

Maya

Maya

+  

Miao

Miao

+  

Minankabaw

Minangkabau

+  

Mixed*

Russian and English

+ +

Mohawk

Mohawk

+  

Mongol

Mongol

+  

Mordvin

Mordvin

+  

Nahuatl

Nahuatl

+  

Nenets

Nenets

+  

Nivkh

Nivkh

+  

Nogay

Nogay

+  

Norwegian

NorwegianNynorsk and NorwegianBokmal

+ +

NorwegianBokmal

Norwegian (Bokmal)

+ +

NorwegianNynorsk

Norwegian (Nynorsk)

+ +

Nyanja

Nyanja

+  

Occidental

Occidental

+  

OcrA

For OCR-A text type

+  

OcrB

For OCR-B text type

+  

Ojibway

Ojibway

+  

Papiamento

Papiamento

+  

Pascal

Pascal programming language

+  

PidginEnglish

Tok Pisin

+  

Polish

Polish

+ +

PortugueseBrazilian

Portuguese (Brazil)

+ +

PortugueseStandard

Portuguese (Portugal)

+ +

Provencal

Provencal

+  

Quechua

Quechua

+  

RhaetoRomanic

Rhaeto-Romanic

+  

Romanian

Romanian

+ +

RomanianMoldavia

Romanian (Moldavia)

+  

Romany

Romany

+  

Ruanda

Ruanda

+  

Rundi

Rundi

+  

RussianOldSpelling

Russian (old spelling)

+ +

Russian

Russian

+ +

RussianWithAccent

Russian (with accents marking stress position)

+ +

Samoan

Samoan

+  

Selkup

Selkup

+  

SerbianCyrillic

Serbian (Cyrillic)

+  

SerbianLatin

Serbian (Latin)

+  

Shona

Shona

+  

Sioux

Sioux (Dakota)

+  

Slovak

Slovak

+ +

Slovenian

Slovenian

+ +

Somali

Somali

+  

Sorbian

Sorbian

+  

Sotho

Sotho

+  

Spanish

Spanish

+ +

Sunda

Sunda

+  

Swahili

Swahili

+  

Swazi

Swazi

+  

Swedish

Swedish

+ +

Tabassaran

Tabassaran

+  

Tagalog

Tagalog

+  

Tahitian

Tahitian

+

Tajik

Tajik

+  

Tatar

Tatar

+ +

Thai

Thai

+ +

Tinpo

Jingpo

+  

Tongan

Tongan

+  

Tswana

Tswana

+  

Tun

Tun

+  

Turkish

Turkish

+ +

Turkmen

Turkmen

+  

TurkmenLatin

Turkmen (Latin)

+  

Tuvin

Tuvan

+  

Udmurt

Udmurt

+  

UighurCyrillic

Uighur (Cyrillic)

+  

UighurLatin

Uighur (Latin)

+  

Ukrainian

Ukrainian

+ +

UzbekCyrillic

Uzbek (Cyrillic)

+  

UzbekLatin

Uzbek (Latin)

+  

Vietnamese

Vietnamese

+ +

Visayan

Cebuano

+  

Welsh

Welsh

+  

Wolof

Wolof

+  

Xhosa

Xhosa

+  

Yakut

Yakut

+  

Yiddish

Yiddish

+  

Zapotec

Zapotec

+  

Zulu

Zulu

+  

* These are compound recognition languages. The compound predefined languages are to be removed in future versions.

Note: Support for historic fonts and the related "old" languages are not included per default in ABBYYs CLI OCR.

 

Language Keys when the CJK module is licensed

  • ChinesePRC
  • ChineseTaiwan
  • Japanese
  • Korean
  • KoreanHangul
This website uses cookies which enable you to see pages or use other functions of our websites. You can turn off such cookies in your browser’s settings. If you continue to use these pages, you consent to the use of cookies.