Supported languages: 100+ OCR language dictionaries

Nutrient .NET SDK (formerly GdPicture.NET) includes the following language dictionaries for recognizing text with optical character recognition (OCR):

LanguageCode
Arabicara
Germandeu
Englisheng
Frenchfra
Hebrewheb
Italianita
Dutch, Flemishnld
Portuguesepor
Spanish, Castilianspa
Vietnamesevie

To recognize languages not listed above, follow the steps below:

  1. Download the language files(opens in a new tab) provided by the Tesseract team, which include more than 120 languages. To use previous language data files without long short-term memory (LSTM) engine use, download a previous release(opens in a new tab) provided by the Tesseract team.
  2. Add the language files to the folder where your OCR dictionaries are already installed. The default language resources are located in GdPicture.NET 14\Redist\OCR.
  3. Determine language names based on the language codes and the Tesseract documentation(opens in a new tab).