Recognize text in scanned PDFs and images

Make your documents searchable, selectable, and ready for extraction using powerful OCR tools — available through the Web SDK with Document Engine or our C# .NET OCR library.


Convert scans to searchable text

Turn scanned PDFs and image files into fully searchable, selectable documents with embedded text layers — ideal for indexing, compliance, and downstream processing.

Extract data for automation

Pair OCR with our data extraction API to identify key values, detect tables, and power intelligent workflows.

Work with 30+ languages

Recognize text across dozens of built-in languages, including accented scripts, symbols, and multilingual files.

Enable compliance and archiving

Generate PDF/A outputs with full text layer for eDiscovery, records management, and accessibility.


HOW IT WORKS

Run OCR on your PDF documents

Transform image-based PDFs into searchable, selectable content — ready for annotation, editing, or data extraction — with just three intuitive steps.

KEY FEATURES

Fast, multilingual OCR with flexible output options

Unlock the text inside scanned PDFs and images — with export options for downstream workflows.

Image of a scanned document with the ability to highlight and select text
Searchable PDF generation

Make scanned documents searchable and highlightable.


Multi-language support

OCR more than 30 built-in languages, including those with non-Latin character sets.


Text layer embedding

Embed a selectable text layer beneath the visual content — while preserving the original layout.


Data extraction integration

Pair OCR with our DWS-based data extraction API to identify tables and key-value pairs and trigger automated workflows.


One-call API execution

Trigger OCR in production with a single operation using Web SDK and Document Engine.


Mobile OCR support

Process documents offline using SDKs for iOS, Android, Mac Catalyst, React Native, and Flutter.


Language fallback and multi-language detection

Process multilingual documents with ease.


.NET SDK

Enterprise-grade OCR with AI preprocessing and zonal control

If you’re building in C# or need more advanced OCR capabilities, Nutrient .NET SDK offers a powerful alternative with expanded functionality:


  • OCR more than 100 languages and 100+ file types
  • Zonal OCR and custom region scanning
  • Image preprocessing (deskew, noise removal, auto-enhance)
  • Orientation detection and layout retention
  • Confidence scoring and ML-powered structure recognition
Use OCR to search images and PDFs

Supported on your platform



Run it your way

Whether you’re building for web, mobile, or the backend, Nutrient gives you full flexibility — all with a consistent developer experience.

Client-side SDKs

Use the web, mobile, and cross-platform SDKs to embed document capabilities directly into your app — fully client-side, no backend required


Server-backed control

Use Document Engine for server-backed processing, data control, and deep infrastructure integration


Cloud-hosted scale

Use Document Web Services (DWS) for instant scale, with cloud-hosted APIs for viewing, annotating, converting, and more


WEB PLAYGROUND

Looking for the code? Come play in our sandbox



REAL-WORLD IMPACT

Built for scale. Trusted in production.

Nutrient powers mission-critical document workflows for global enterprises — from compliance-heavy industries to modern SaaS teams.


Streamlined construction project collaboration with in-app markup and PDF tools — helping teams coordinate more effectively from jobsite to back office.


Used Nutrient to enable fast, focused board pack review with in-document annotations and threaded feedback — enhancing governance workflows across schools and nonprofits.


Empowers 34,000 pilots to view, annotate, and sign 90‑page flight releases on iPad using Nutrient iOS SDK, saving minutes — and money — on every flight.

FREE TRIAL

Add OCR to your app — no credit card required

Frequently asked questions

What is a PDF OCR SDK and how does it work?

A PDF OCR SDK (optical character recognition software development kit) enables developers to integrate text recognition into apps, making scanned PDFs and image-based documents searchable and selectable. It works by detecting and converting text from rasterized content into a machine-readable layer that supports search, annotation, and editing.

What platforms and devices does Nutrient’s OCR SDK support?

Nutrient’s OCR functionality is available across Web (with Document Engine); .NET (C#); and mobile platforms, including iOS, Android, Mac Catalyst, React Native, and Flutter. This allows developers to deploy OCR across desktop, server, and mobile environments with a consistent API and architecture.

Can the SDK recognize text in multiple languages or complex layouts?

Yes. Nutrient OCR supports more than 30 languages (and 100+ in .NET), Unicode, and smart features like multi-language fallback and orientation detection — helping teams handle global content, mixed-language files, and skewed scans.

How does OCR improve accessibility and search?

OCR transforms image-based documents into machine-readable text, making them accessible to screen readers, and enabling users to search, copy, and extract content. This is critical for digital accessibility, legal archiving, and productivity tools.

What’s the best way to extract data after OCR?

Pair OCR with Nutrient’s Data Extraction API to identify tables, key-value pairs, or form fields. Extracted data can then feed into downstream workflows like automation, analytics, or business logic.

Is OCR available offline?

Yes. On-device OCR is available in our mobile SDKs, allowing apps to recognize text without an internet connection. This is ideal for field work, remote teams, or environments with strict privacy requirements.

How do I start using OCR in my app?

Use the Web SDK with Document Engine or the appropriate native SDK. Import your file, define the language and page range, and trigger the OCR operation. Nutrient provides robust guides, samples, and a free trial to get started.


OCR and text recognition for real-world workflows

Modern teams rely on OCR to unlock the value hidden in scanned PDFs and images — whether on the web, mobile, or desktop. From legal archives to field service forms, Nutrient’s OCR SDKs help teams move faster, work smarter, and stay compliant.

Common use cases:
  • Convert scanned PDFs into fully searchable, selectable documents
  • Extract data from contracts, invoices, forms, and ID cards
  • Digitize multilingual records with Unicode support and fallback
  • Enable search and screen reader access for compliance and accessibility
  • Capture documents in the field using on-device OCR for mobile apps
Key features developers look for:
  • Cross-platform support (Web, .NET, iOS, Android, Mac Catalyst, Flutter, React Native)
  • More than 30 supported languages (100+ in .NET)
  • Fast, accurate text recognition — even on skewed or noisy scans
  • Export as searchable PDF, structured data, or editable content
  • Easy integration with guides, samples, and a unified OCR API

Looking for an OCR SDK that works where your users do? Nutrient makes it easy to add enterprise-grade OCR to your app — no complex setup or external tools required.