Make scanned PDFs searchable with OCR

Make your documents searchable, selectable, and ready for extraction using powerful OCR tools — available through the Web SDK with Document Engine or our C# .NET OCR library.

LAUNCH DEMO VIEW GUIDES

Convert scans to searchable text

Turn scanned PDFs and image files into fully searchable, selectable documents with embedded text layers — ideal for indexing, compliance, and downstream processing.

Extract data for automation

Pair OCR with our data extraction API to identify key values, detect tables, and power intelligent workflows.

Work with 30+ languages

Recognize text across dozens of built-in languages, including accented scripts, symbols, and multilingual files.

Enable compliance and archiving

Generate PDF/A outputs with full text layer for eDiscovery, records management, and accessibility.

HOW IT WORKS

Run OCR on your PDF documents

Transform image-based PDFs into searchable, selectable content — ready for annotation, editing, or data extraction — with just three intuitive steps.

KEY FEATURES

Fast, multilingual OCR with flexible output options

Unlock the text inside scanned PDFs and images — with export options for downstream workflows.

EXPLORE GUIDES

Image of a scanned document with the ability to highlight and select text

Searchable PDF generation

Make scanned documents searchable and highlightable.

Multi-language support

OCR more than 30 built-in languages, including those with non-Latin character sets.

Text layer embedding

Embed a selectable text layer beneath the visual content — while preserving the original layout.

Data extraction integration

Pair OCR with our DWS-based data extraction API to identify tables and key-value pairs and trigger automated workflows.

One-call API execution

Trigger OCR in production with a single operation using Web SDK and Document Engine.

Mobile OCR support

Process documents offline using SDKs for iOS, Android, Mac Catalyst, React Native, and Flutter.

Language fallback and multi-language detection

Process multilingual documents with ease.

.NET SDK

Enterprise-grade OCR with AI preprocessing and zonal control

If you’re building in C# or need more advanced OCR capabilities, Nutrient .NET SDK offers a powerful alternative with expanded functionality:

EXPLORE GUIDES TRY IT

OCR more than 100 languages and 100+ file types
Zonal OCR and custom region scanning
Image preprocessing (deskew, noise removal, auto-enhance)
Orientation detection and layout retention
Confidence scoring and ML-powered structure recognition

Supported on your platform

Related components

SHOW ALL

Run it your way

Whether you’re building for web, mobile, or the backend, Nutrient gives you full flexibility — all with a consistent developer experience.

EXPLORE DEPLOYMENT

Client-side SDKs

Use the web, mobile, and cross-platform SDKs to embed document capabilities directly into your app — fully client-side, no backend required

Server-backed control

Use Document Engine for server-backed processing, data control, and deep infrastructure integration

Cloud-hosted scale

Use Document Web Services (DWS) for instant scale, with cloud-hosted APIs for viewing, annotating, converting, and more

WEB PLAYGROUND

Looking for the code? Come play in our sandbox

LAUNCH PLAYGROUND

Discover why developers love Nutrient SDKs

EXPLORE CUSTOMER FEEDBACK

"Great tool that covers one of our product’s core functionalities."

Dmitro H.

PRINCIPAL ENGINEER

"A true partner in delivering value for our customers."

KIM T.

Small Business (50 or fewer employees)

"The software is reliable and easy to integrate with."

VERIFIED USER IN COMPUTER SOFTWARE

MID-MARKET (51–1,000 EMPLOYEES)

REAL-WORLD IMPACT

Built for scale. Trusted in production.

Nutrient powers mission-critical document workflows for global enterprises — from compliance-heavy industries to modern SaaS teams.

Streamlined construction project collaboration with in-app markup and PDF tools — helping teams coordinate more effectively from jobsite to back office.

FULL STORY

Used Nutrient to enable fast, focused board pack review with in-document annotations and threaded feedback — enhancing governance workflows across schools and nonprofits.

FULL STORY

Empowers 34,000 pilots to view, annotate, and sign 90‑page flight releases on iPad using Nutrient iOS SDK, saving minutes — and money — on every flight.

FULL STORY

FREE TRIAL

Add OCR to your app — no credit card required

Try for free

Frequently asked questions

What is a PDF OCR SDK and how does it work?

A PDF OCR SDK (optical character recognition software development kit) enables developers to integrate text recognition into apps, making scanned PDFs and image-based documents searchable and selectable. It works by detecting and converting text from rasterized content into a machine-readable layer that supports search, annotation, and editing.

What platforms and devices does Nutrient’s OCR SDK support?

Nutrient’s OCR functionality is available across Web (with Document Engine); .NET (C#); and mobile platforms, including iOS, Android, Mac Catalyst, React Native, and Flutter. This allows developers to deploy OCR across desktop, server, and mobile environments with a consistent API and architecture.

Can the SDK recognize text in multiple languages or complex layouts?

Yes. Nutrient OCR supports more than 30 languages (and 100+ in .NET), Unicode, and smart features like multi-language fallback and orientation detection — helping teams handle global content, mixed-language files, and skewed scans.

How does OCR improve accessibility and search?

OCR transforms image-based documents into machine-readable text, making them accessible to screen readers, and enabling users to search, copy, and extract content. This is critical for digital accessibility, legal archiving, and productivity tools.

What’s the best way to extract data after OCR?

Pair OCR with Nutrient’s Data Extraction API to identify tables, key-value pairs, or form fields. Extracted data can then feed into downstream workflows like automation, analytics, or business logic.

Is OCR available offline?

Yes. On-device OCR is available in our mobile SDKs, allowing apps to recognize text without an internet connection. This is ideal for field work, remote teams, or environments with strict privacy requirements.

How do I start using OCR in my app?

Use the Web SDK with Document Engine or the appropriate native SDK. Import your file, define the language and page range, and trigger the OCR operation. Nutrient provides robust guides, samples, and a free trial to get started.

SOLUTION OVERVIEW

For structured data extraction, try our OCR solution

See solution

OCR and text recognition for real-world workflows

Modern teams rely on OCR to unlock the value hidden in scanned PDFs and images — whether on the web, mobile, or desktop. From legal archives to field service forms, Nutrient’s OCR SDKs help teams move faster, work smarter, and stay compliant.

Common use cases:

Convert scanned PDFs into fully searchable, selectable documents
Extract data from contracts, invoices, forms, and ID cards
Digitize multilingual records with Unicode support and fallback
Enable search and screen reader access for compliance and accessibility
Capture documents in the field using on-device OCR for mobile apps

Key features developers look for:

Cross-platform support (Web, .NET, iOS, Android, Mac Catalyst, Flutter, React Native)
More than 30 supported languages (100+ in .NET)
Fast, accurate text recognition — even on skewed or noisy scans
Export as searchable PDF, structured data, or editable content
Easy integration with guides, samples, and a unified OCR API

Looking for an OCR SDK that works where your users do? Nutrient makes it easy to add enterprise-grade OCR to your app — no complex setup or external tools required.

Make scanned PDFs searchable with OCR

Convert scans to searchable text

Extract data for automation

Work with 30+ languages

Enable compliance and archiving

Run OCR on your PDF documents

Import a document

Set recognition preferences

Interact with the results

Fast, multilingual OCR with flexible output options

Searchable PDF generation

Multi-language support

Text layer embedding

Data extraction integration

One-call API execution

Mobile OCR support

Language fallback and multi-language detection

Enterprise-grade OCR with AI preprocessing and zonal control

Supported on your platform

Related components

Annotations

Content Editor

Comments and Replies

PDF-to-Office Conversion

Run it your way

Client-side SDKs

Server-backed control

Cloud-hosted scale

Looking for the code? Come play in our sandbox

Discover why developers love Nutrient SDKs

Built for scale. Trusted in production.

Add OCR to your app — no credit card required

Frequently asked questions

For structured data extraction, try our OCR solution

OCR and text recognition for real-world workflows

Common use cases:

Key features developers look for: