Getting Started
The following section will walk you through how to best make use of all the functionality the OCR API provides.
The Basics of OCR
Letโs start with the basics. The OCR API allows you to take files in any supported format and make any text in them selectable and searchable. This is useful for images and scanned documents. To learn more about OCR itself, see here.
For our first example, weโll run OCR on a single image file. To do this, add a page1.jpg file to
the same folder as your code. You can use any image containing text, or use our provided
sample page.
Run the code, and youโll get a result.pdf with your page OCRed. Our example sets the OCR language
to English. If your content is in a different language, make sure to update the language property
accordingly. You can find a list of all supported languages at the
bottom of this page.
Code to perform OCR:
curl -X POST https://api.nutrient.io/processor/ocr \
-H "Authorization: Bearer your_api_key_here" \
-o result.pdf \
--fail \
-F file=@page1.jpg \
-F data='{
"language": "english"
}'
curl -X POST https://api.nutrient.io/processor/ocr ^
-H "Authorization: Bearer your_api_key_here" ^
-o result.pdf ^
--fail ^
-F file=@page1.jpg ^
-F data="{\"language\": \"english\"}"
package com.example.pspdfkit;
import java.io.File;
import java.io.IOException;
import java.nio.file.FileSystems;
import java.nio.file.Files;
import java.nio.file.StandardCopyOption;
import org.json.JSONArray;
import org.json.JSONObject;
import okhttp3.MediaType;
import okhttp3.MultipartBody;
import okhttp3.OkHttpClient;
import okhttp3.Request;
import okhttp3.RequestBody;
import okhttp3.Response;
public final class PspdfkitApiExample {
public static void main(final String[] args) throws IOException {
final RequestBody body = new MultipartBody.Builder()
.setType(MultipartBody.FORM)
.addFormDataPart(
"file",
"page1.jpg",
RequestBody.create(
MediaType.parse("image/jpeg"),
new File("page1.jpg")
)
)
.addFormDataPart(
"data",
new JSONObject()
.put("language", "english").toString()
)
.build();
final Request request = new Request.Builder()
.url("https://api.nutrient.io/processor/ocr")
.method("POST", body)
.addHeader("Authorization", "Bearer your_api_key_here")
.build();
final OkHttpClient client = new OkHttpClient()
.newBuilder()
.build();
final Response response = client.newCall(request).execute();
if (response.isSuccessful()) {
Files.copy(
response.body().byteStream(),
FileSystems.getDefault().getPath("result.pdf"),
StandardCopyOption.REPLACE_EXISTING
);
} else {
// Handle the error
throw new IOException(response.body().string());
}
}
}
using System;
using System.IO;
using System.Net;
using RestSharp;
namespace PspdfkitApiDemo
{
class Program
{
static void Main(string[] args)
{
var client = new RestClient("https://api.nutrient.io/processor/ocr");
var request = new RestRequest(Method.POST)
.AddHeader("Authorization", "Bearer your_api_key_here")
.AddFile("file", "page1.jpg")
.AddParameter("data", new JsonObject
{
["language"] = "english"
}.ToString());
request.AdvancedResponseWriter = (responseStream, response) =>
{
if (response.StatusCode == HttpStatusCode.OK)
{
using (responseStream)
{
using var outputFileWriter = File.OpenWrite("result.pdf");
responseStream.CopyTo(outputFileWriter);
}
}
else
{
var responseStreamReader = new StreamReader(responseStream);
Console.Write(responseStreamReader.ReadToEnd());
}
};
client.Execute(request);
}
}
}
// This code requires Node.js. Do not run this code directly in a web browser.
const axios = require('axios')
const FormData = require('form-data')
const fs = require('fs')
const formData = new FormData()
formData.append('data', JSON.stringify({
language: "english"
}))
formData.append('file', fs.createReadStream('page1.jpg'))
;(async () => {
try {
const response = await axios.post('https://api.nutrient.io/processor/ocr', formData, {
headers: formData.getHeaders({
'Authorization': 'Bearer your_api_key_here'
}),
responseType: "stream"
})
response.data.pipe(fs.createWriteStream("result.pdf"))
} catch (e) {
const errorString = await streamToString(e.response.data)
console.log(errorString)
}
})()
function streamToString(stream) {
const chunks = []
return new Promise((resolve, reject) => {
stream.on("data", (chunk) => chunks.push(Buffer.from(chunk)))
stream.on("error", (err) => reject(err))
stream.on("end", () => resolve(Buffer.concat(chunks).toString("utf8")))
})
}
import requests
import json
response = requests.request(
'POST',
'https://api.nutrient.io/processor/ocr',
headers = {
'Authorization': 'Bearer your_api_key_here'
},
files = {
'file': open('page1.jpg', 'rb')
},
data = {
'data': json.dumps({
'language': 'english'
})
},
stream = True
)
if response.ok:
with open('result.pdf', 'wb') as fd:
for chunk in response.iter_content(chunk_size=8096):
fd.write(chunk)
else:
print(response.text)
exit()
<?php
$FileHandle = fopen('result.pdf', 'w+');
$curl = curl_init();
curl_setopt_array($curl, array(
CURLOPT_URL => 'https://api.nutrient.io/processor/ocr',
CURLOPT_CUSTOMREQUEST => 'POST',
CURLOPT_RETURNTRANSFER => true,
CURLOPT_ENCODING => '',
CURLOPT_POSTFIELDS => array(
'data' => '{
"language": "english"
}',
'file' => new CURLFILE('page1.jpg')
),
CURLOPT_HTTPHEADER => array(
'Authorization: Bearer your_api_key_here'
),
CURLOPT_FILE => $FileHandle,
));
$response = curl_exec($curl);
curl_close($curl);
fclose($FileHandle);
POST https://api.nutrient.io/processor/ocr HTTP/1.1
Content-Type: multipart/form-data; boundary=--customboundary
Authorization: Bearer your_api_key_here
--customboundary
Content-Disposition: form-data; name="data"
Content-Type: application/json
{
"language": "english"
}
--customboundary
Content-Disposition: form-data; name="file"; filename="page1.jpg"
Content-Type: image/jpeg
(file data)
--customboundary--
Advanced OCR
While running OCR on a single page is useful, often youโll have a folder full of scanned pages that you want to both run OCR on and merge into a single searchable PDF.
Luckily, this is easy; pass in multiple images โ one for each page in your request โ and Nutrient DWS Processor API will merge all of them into a PDF before running OCR on it.
Add more files in the same folder as your code and run the updated code. You can duplicate and rename the existing file you have, or you can add some other images containing text.
Code to perform OCR on multiple pages:
curl -X POST https://api.nutrient.io/build \
-H "Authorization: Bearer your_api_key_here" \
-o result.pdf \
--fail \
-F page1.jpg=@/path/to/page1.jpg \
-F page2.jpg=@/path/to/page2.jpg \
-F page3.jpg=@/path/to/page3.jpg \
-F page4.jpg=@/path/to/page4.jpg \
-F instructions='{
"parts": [
{
"file": "page1.jpg"
},
{
"file": "page2.jpg"
},
{
"file": "page3.jpg"
},
{
"file": "page4.jpg"
}
],
"actions": [
{
"type": "ocr",
"language": "english"
}
]
}'
curl -X POST https://api.nutrient.io/build ^
-H "Authorization: Bearer your_api_key_here" ^
-o result.pdf ^
--fail ^
-F page1.jpg=@/path/to/page1.jpg ^
-F page2.jpg=@/path/to/page2.jpg ^
-F page3.jpg=@/path/to/page3.jpg ^
-F page4.jpg=@/path/to/page4.jpg ^
-F instructions="{\"parts\": [{\"file\": \"page1.jpg\"}, {\"file\": \"page2.jpg\"}, {\"file\": \"page3.jpg\"}, {\"file\": \"page4.jpg\"}], \"actions\": [{\"type\": \"ocr\", \"language\": \"english\"}]}"
package com.example.pspdfkit;
import java.io.File;
import java.io.IOException;
import java.nio.file.FileSystems;
import java.nio.file.Files;
import java.nio.file.StandardCopyOption;
import org.json.JSONArray;
import org.json.JSONObject;
import okhttp3.MediaType;
import okhttp3.MultipartBody;
import okhttp3.OkHttpClient;
import okhttp3.Request;
import okhttp3.RequestBody;
import okhttp3.Response;
public final class PspdfkitApiExample {
public static void main(final String[] args) throws IOException {
final RequestBody body = new MultipartBody.Builder()
.setType(MultipartBody.FORM)
.addFormDataPart(
"page1.jpg",
"/path/to/page1.jpg",
RequestBody.create(
MediaType.parse("image/jpeg"),
new File("/path/to/page1.jpg")
)
)
.addFormDataPart(
"page2.jpg",
"/path/to/page2.jpg",
RequestBody.create(
MediaType.parse("image/jpeg"),
new File("/path/to/page2.jpg")
)
)
.addFormDataPart(
"page3.jpg",
"/path/to/page3.jpg",
RequestBody.create(
MediaType.parse("image/jpeg"),
new File("/path/to/page3.jpg")
)
)
.addFormDataPart(
"page4.jpg",
"/path/to/page4.jpg",
RequestBody.create(
MediaType.parse("image/jpeg"),
new File("/path/to/page4.jpg")
)
)
.addFormDataPart(
"instructions",
new JSONObject()
.put("parts", new JSONArray()
.put(new JSONObject()
.put("file", "page1.jpg")
)
.put(new JSONObject()
.put("file", "page2.jpg")
)
.put(new JSONObject()
.put("file", "page3.jpg")
)
.put(new JSONObject()
.put("file", "page4.jpg")
)
)
.put("actions", new JSONArray()
.put(new JSONObject()
.put("type", "ocr")
.put("language", "english")
)
).toString()
)
.build();
final Request request = new Request.Builder()
.url("https://api.nutrient.io/build")
.method("POST", body)
.addHeader("Authorization", "Bearer your_api_key_here")
.build();
final OkHttpClient client = new OkHttpClient()
.newBuilder()
.build();
final Response response = client.newCall(request).execute();
if (response.isSuccessful()) {
Files.copy(
response.body().byteStream(),
FileSystems.getDefault().getPath("result.pdf"),
StandardCopyOption.REPLACE_EXISTING
);
} else {
// Handle the error
throw new IOException(response.body().string());
}
}
}
using System;
using System.IO;
using System.Net;
using RestSharp;
namespace PspdfkitApiDemo
{
class Program
{
static void Main(string[] args)
{
var client = new RestClient("https://api.nutrient.io/build");
var request = new RestRequest(Method.POST)
.AddHeader("Authorization", "Bearer your_api_key_here")
.AddFile("page1.jpg", "/path/to/page1.jpg")
.AddFile("page2.jpg", "/path/to/page2.jpg")
.AddFile("page3.jpg", "/path/to/page3.jpg")
.AddFile("page4.jpg", "/path/to/page4.jpg")
.AddParameter("instructions", new JsonObject
{
["parts"] = new JsonArray
{
new JsonObject
{
["file"] = "page1.jpg"
},
new JsonObject
{
["file"] = "page2.jpg"
},
new JsonObject
{
["file"] = "page3.jpg"
},
new JsonObject
{
["file"] = "page4.jpg"
}
},
["actions"] = new JsonArray
{
new JsonObject
{
["type"] = "ocr",
["language"] = "english"
}
}
}.ToString());
request.AdvancedResponseWriter = (responseStream, response) =>
{
if (response.StatusCode == HttpStatusCode.OK)
{
using (responseStream)
{
using var outputFileWriter = File.OpenWrite("result.pdf");
responseStream.CopyTo(outputFileWriter);
}
}
else
{
var responseStreamReader = new StreamReader(responseStream);
Console.Write(responseStreamReader.ReadToEnd());
}
};
client.Execute(request);
}
}
}
// This code requires Node.js. Do not run this code directly in a web browser.
const axios = require('axios')
const FormData = require('form-data')
const fs = require('fs')
const formData = new FormData()
formData.append('instructions', JSON.stringify({
parts: [
{
file: "page1.jpg"
},
{
file: "page2.jpg"
},
{
file: "page3.jpg"
},
{
file: "page4.jpg"
}
],
actions: [
{
type: "ocr",
language: "english"
}
]
}))
formData.append('page1.jpg', fs.createReadStream('/path/to/page1.jpg'))
formData.append('page2.jpg', fs.createReadStream('/path/to/page2.jpg'))
formData.append('page3.jpg', fs.createReadStream('/path/to/page3.jpg'))
formData.append('page4.jpg', fs.createReadStream('/path/to/page4.jpg'))
;(async () => {
try {
const response = await axios.post('https://api.nutrient.io/build', formData, {
headers: formData.getHeaders({
'Authorization': 'Bearer your_api_key_here'
}),
responseType: "stream"
})
response.data.pipe(fs.createWriteStream("result.pdf"))
} catch (e) {
const errorString = await streamToString(e.response.data)
console.log(errorString)
}
})()
function streamToString(stream) {
const chunks = []
return new Promise((resolve, reject) => {
stream.on("data", (chunk) => chunks.push(Buffer.from(chunk)))
stream.on("error", (err) => reject(err))
stream.on("end", () => resolve(Buffer.concat(chunks).toString("utf8")))
})
}
import requests
import json
response = requests.request(
'POST',
'https://api.nutrient.io/build',
headers = {
'Authorization': 'Bearer your_api_key_here'
},
files = {
'page1.jpg': open('/path/to/page1.jpg', 'rb'),
'page2.jpg': open('/path/to/page2.jpg', 'rb'),
'page3.jpg': open('/path/to/page3.jpg', 'rb'),
'page4.jpg': open('/path/to/page4.jpg', 'rb')
},
data = {
'instructions': json.dumps({
'parts': [
{
'file': 'page1.jpg'
},
{
'file': 'page2.jpg'
},
{
'file': 'page3.jpg'
},
{
'file': 'page4.jpg'
}
],
'actions': [
{
'type': 'ocr',
'language': 'english'
}
]
})
},
stream = True
)
if response.ok:
with open('result.pdf', 'wb') as fd:
for chunk in response.iter_content(chunk_size=8096):
fd.write(chunk)
else:
print(response.text)
exit()
<?php
$FileHandle = fopen('result.pdf', 'w+');
$curl = curl_init();
curl_setopt_array($curl, array(
CURLOPT_URL => 'https://api.nutrient.io/build',
CURLOPT_CUSTOMREQUEST => 'POST',
CURLOPT_RETURNTRANSFER => true,
CURLOPT_ENCODING => '',
CURLOPT_POSTFIELDS => array(
'instructions' => '{
"parts": [
{
"file": "page1.jpg"
},
{
"file": "page2.jpg"
},
{
"file": "page3.jpg"
},
{
"file": "page4.jpg"
}
],
"actions": [
{
"type": "ocr",
"language": "english"
}
]
}',
'page1.jpg' => new CURLFILE('/path/to/page1.jpg'),
'page2.jpg' => new CURLFILE('/path/to/page2.jpg'),
'page3.jpg' => new CURLFILE('/path/to/page3.jpg'),
'page4.jpg' => new CURLFILE('/path/to/page4.jpg')
),
CURLOPT_HTTPHEADER => array(
'Authorization: Bearer your_api_key_here'
),
CURLOPT_FILE => $FileHandle,
));
$response = curl_exec($curl);
curl_close($curl);
fclose($FileHandle);
POST https://api.nutrient.io/build HTTP/1.1
Content-Type: multipart/form-data; boundary=--customboundary
Authorization: Bearer your_api_key_here
--customboundary
Content-Disposition: form-data; name="instructions"
Content-Type: application/json
{
"parts": [
{
"file": "page1.jpg"
},
{
"file": "page2.jpg"
},
{
"file": "page3.jpg"
},
{
"file": "page4.jpg"
}
],
"actions": [
{
"type": "ocr",
"language": "english"
}
]
}
--customboundary
Content-Disposition: form-data; name="page1.jpg"; filename="/path/to/page1.jpg"
Content-Type: image/jpeg
(page1.jpg data)
--customboundary
Content-Disposition: form-data; name="page2.jpg"; filename="/path/to/page2.jpg"
Content-Type: image/jpeg
(page2.jpg data)
--customboundary
Content-Disposition: form-data; name="page3.jpg"; filename="/path/to/page3.jpg"
Content-Type: image/jpeg
(page3.jpg data)
--customboundary
Content-Disposition: form-data; name="page4.jpg"; filename="/path/to/page4.jpg"
Content-Type: image/jpeg
(page4.jpg data)
--customboundary--
Supported Languages
The OCR action supports a wide range of languages for text extraction. You can specify languages using either:
- Full language name (lowercase, e.g.,
english,german) - available for commonly used languages - ISO 639-2 language code (e.g.,
eng,deu) - available for all languages - ISO 639-2 language code with variant (e.g.,
chi_sim_vertordeu_frak)
| Description | Language code | Full language name |
|---|---|---|
| ๐ฟ๐ฆ Afrikaans | afr | |
| ๐ฆ๐ฑ Albanian | sqi | |
| ๐ช๐น Amharic | amh | |
| ๐ธ๐ฆ Arabic | ara | |
| ๐ฆ๐ฒ Armenian | hye | |
| ๐ฎ๐ณ Assamese | asm | |
| ๐ฆ๐ฟ Azerbaijani | aze | |
| ๐ฆ๐ฟ Azerbaijani - Cyrillic | aze_cyrl | |
| ๐ช๐ธ Basque | eus | |
| ๐ง๐พ Belarusian | bel | |
| ๐ง๐ฉ Bengali | ben | |
| ๐ง๐ฆ Bosnian | bos | |
| ๐ซ๐ท Breton | bre | |
| ๐ง๐ฌ Bulgarian | bul | |
| ๐ฒ๐ฒ Burmese | mya | |
| ๐ช๐ธ Catalan; Valencian | cat | |
| ๐ต๐ญ Cebuano | ceb | |
| ๐ฐ๐ญ Central Khmer | khm | |
| ๐บ๐ธ Cherokee | chr | |
| ๐จ๐ณ Chinese - Simplified | chi_sim | |
| ๐จ๐ณ Chinese - Simplified (Vertical) | chi_sim_vert | |
| ๐น๐ผ Chinese - Traditional | chi_tra | |
| ๐น๐ผ Chinese - Traditional (Vertical) | chi_tra_vert | |
| ๐ซ๐ท Corsican | cos | |
| ๐ญ๐ท Croatian | hrv | croatian |
| ๐จ๐ฟ Czech | ces | czech |
| ๐ฉ๐ฐ Danish | dan | danish |
| ๐ฉ๐ฐ Danish - Fraktur | dan_frak | |
| ๐ฒ๐ป Dhivehi; Maldivian | div | |
| ๐ณ๐ฑ Dutch; Flemish | nld | dutch |
| ๐ง๐น Dzongkha | dzo | |
| ๐ฌ๐ง English | eng | english |
| ๐ฌ๐ง English, Middle (1100-1500) | enm | |
| Esperanto | epo | |
| ๐ช๐ช Estonian | est | |
| ๐ซ๐ด Faroese | fao | |
| ๐ต๐ญ Filipino | fil | |
| ๐ซ๐ฎ Finnish | fin | finnish |
| ๐ซ๐ท French | fra | french |
| ๐ซ๐ท French, Middle (ca. 1400-1600) | frm | |
| ๐ช๐ธ Galician | glg | |
| ๐ฌ๐ช Georgian | kat | |
| ๐ฌ๐ช Georgian - Old | kat_old | |
| ๐ฉ๐ช German | deu | german |
| ๐ฉ๐ช German - Fraktur | deu_frak | |
| ๐ฉ๐ช German Fraktur | frk | |
| ๐ฌ๐ท Greek, Ancient | grc | |
| ๐ฌ๐ท Greek, Modern | ell | |
| ๐ฎ๐ณ Gujarati | guj | |
| ๐ญ๐น Haitian; Haitian Creole | hat | |
| ๐ฎ๐ฑ Hebrew | heb | |
| ๐ฎ๐ณ Hindi | hin | |
| ๐ญ๐บ Hungarian | hun | |
| ๐ฎ๐ธ Icelandic | isl | |
| ๐ฎ๐ฉ Indonesian | ind | indonesian |
| ๐จ๐ฆ Inuktitut | iku | |
| ๐ฎ๐ช Irish | gle | |
| ๐ฎ๐น Italian | ita | italian |
| ๐ฎ๐น Italian - Old | ita_old | |
| ๐ฏ๐ต Japanese | jpn | |
| ๐ฏ๐ต Japanese (Vertical) | jpn_vert | |
| ๐ฎ๐ฉ Javanese | jav | |
| ๐ฎ๐ณ Kannada | kan | |
| ๐ฐ๐ฟ Kazakh | kaz | |
| ๐ฐ๐ฌ Kirghiz; Kyrgyz | kir | |
| ๐ฐ๐ท Korean | kor | |
| ๐ฐ๐ท Korean (Vertical) | kor_vert | |
| ๐ฎ๐ถ Kurdish | kur | |
| ๐น๐ท Kurmanji (Kurdish) | kmr | |
| ๐ฑ๐ฆ Lao | lao | |
| ๐ป๐ฆ Latin | lat | |
| ๐ฑ๐ป Latvian | lav | |
| ๐ฑ๐น Lithuanian | lit | |
| ๐ฑ๐บ Luxembourgish | ltz | |
| ๐ฒ๐ฐ Macedonian | mkd | |
| ๐ฒ๐พ Malay | msa | malay |
| ๐ฎ๐ณ Malayalam | mal | |
| ๐ฒ๐น Maltese | mlt | |
| ๐ณ๐ฟ Maori | mri | |
| ๐ฎ๐ณ Marathi | mar | |
| Math/Equation detection | equ | |
| ๐ฒ๐ณ Mongolian | mon | |
| ๐ณ๐ต Nepali | nep | |
| ๐ณ๐ด Norwegian | nor | norwegian |
| ๐ซ๐ท Occitan | oci | |
| ๐ฎ๐ณ Oriya | ori | |
| ๐ฎ๐ณ Panjabi; Punjabi | pan | |
| ๐ฎ๐ท Persian | fas | |
| ๐ต๐ฑ Polish | pol | polish |
| ๐ต๐น Portuguese | por | portuguese |
| ๐ฆ๐ซ Pushto; Pashto | pus | |
| ๐ต๐ช Quechua | que | |
| ๐ท๐ด Romanian; Moldavian | ron | |
| ๐ท๐บ Russian | rus | |
| ๐ฎ๐ณ Sanskrit | san | |
| ๐ฌ๐ง Scottish Gaelic | gla | |
| ๐ท๐ธ Serbian | srp | serbian |
| ๐ท๐ธ Serbian - Latin | srp_latn | |
| ๐ต๐ฐ Sindhi | snd | |
| ๐ฑ๐ฐ Sinhala; Sinhalese | sin | |
| ๐ธ๐ฐ Slovak | slk | slovak |
| ๐ธ๐ฐ Slovak - Fraktur | slk_frak | |
| ๐ธ๐ฎ Slovenian | slv | slovenian |
| ๐ช๐ธ Spanish; Castilian | spa | spanish |
| ๐ช๐ธ Spanish - Old | spa_old | |
| ๐ฎ๐ฉ Sundanese | sun | |
| ๐ฐ๐ช Swahili | swa | |
| ๐ธ๐ช Swedish | swe | swedish |
| ๐ธ๐พ Syriac | syr | |
| ๐ต๐ญ Tagalog | tgl | |
| ๐น๐ฏ Tajik | tgk | |
| ๐ฎ๐ณ Tamil | tam | |
| ๐ท๐บ Tatar | tat | |
| ๐ฎ๐ณ Telugu | tel | |
| ๐น๐ญ Thai | tha | |
| ๐จ๐ณ Tibetan | bod | |
| ๐ช๐ท Tigrinya | tir | |
| ๐น๐ด Tonga | ton | |
| ๐น๐ท Turkish | tur | turkish |
| ๐จ๐ณ Uighur; Uyghur | uig | |
| ๐บ๐ฆ Ukrainian | ukr | |
| ๐ต๐ฐ Urdu | urd | |
| ๐บ๐ฟ Uzbek | uzb | |
| ๐บ๐ฟ Uzbek - Cyrillic | uzb_cyrl | |
| ๐ป๐ณ Vietnamese | vie | |
| ๐ฌ๐ง Welsh | cym | |
| ๐ณ๐ฑ Western Frisian | fry | |
| ๐ฎ๐ฑ Yiddish | yid | |
| ๐ณ๐ฌ Yoruba | yor |