Question 1

What can the Nutrient Java SDK do beyond data extraction?

Accepted Answer

The SDK covers the full document lifecycle. Beyond AI-powered extraction (OCR, ICR, VLM-enhanced), it includes document conversion between PDF, Word, Excel, PowerPoint, HTML, and Markdown. It provides PDF editing with annotations, form fields, digital signatures, and content redaction, and it supports template-based document generation from Word templates with accessible PDF/UA output. All capabilities work on-premises with a single dependency.

Question 2

What document formats can I convert between?

Accepted Answer

The SDK converts Word (DOCX) to PDF and PDF to Word, Excel (XLSX) to PDF and PDF to Excel, PowerPoint (PPTX) to PDF and PDF to PowerPoint, Markdown to PDF, and PDF to HTML. It also supports Word-to-PDF/UA conversion for accessible document output. All conversions run locally without external services.

Question 3

How does the AI extraction work? What are OCR, ICR, and VLM-enhanced ICR?

Accepted Answer

OCR is the fastest engine, extracting text with word-level bounding boxes. ICR (intelligent content recognition) is an AI-powered engine that runs on-premises — it detects tables, equations, key-value regions, and document structure without external API calls. VLM-enhanced ICR adds a vision language model (Claude, OpenAI, or local) on top for the highest accuracy on complex documents. All three return structured JSON output.

Question 4

Can I process documents entirely on-premises?

Accepted Answer

Yes. All SDK capabilities — extraction, conversion, editing, and generation — run on your infrastructure. The OCR and ICR engines require zero external API calls. VLM-enhanced mode can also stay on-premises when connected to a local model server (Ollama, LM Studio, or vLLM). Document conversion, PDF editing, and template generation are entirely local with no cloud dependencies.

Question 5

How does the SDK compare to Google Document AI and AWS Textract?

Accepted Answer

Google Document AI and AWS Textract are cloud-only extraction services — your documents must be uploaded to their servers. Nutrient Java SDK runs on your infrastructure by default and offers much more than extraction alone. It combines AI-powered data extraction with document conversion, PDF editing, digital signatures, redaction, and template generation in a single dependency. You get data sovereignty, predictable costs, and a complete document processing platform — not just an extraction API.

Question 6

What PDF editing capabilities are included?

Accepted Answer

The SDK supports eight annotation types (text, free text, shapes, stamps, sticky notes, links, text markup, and redaction), form field creation and filling, visible and invisible digital signatures with advanced workflows, page management (reordering, merging, adding custom pages), and metadata editing. Redaction annotations permanently remove content from the document, supporting GDPR and HIPAA compliance.

Question 7

Can I generate documents from templates?

Accepted Answer

Yes. The SDK processes Word templates with dynamic content injection, enabling you to generate reports, contracts, and other documents programmatically from data. Template output can be converted directly to accessible PDF/UA format for compliance workflows. This is useful for automated document generation pipelines where you need consistent formatting with variable data.

Question 8

Which vision language model providers are supported for extraction?

Accepted Answer

VLM-enhanced ICR supports Anthropic Claude, OpenAI, and any OpenAI-compatible custom endpoint. The custom endpoint option works with Ollama, LM Studio, vLLM, and other local inference servers. You can switch providers with a single configuration change. Image description also supports all three provider types, giving you full control over accuracy, cost, and data privacy.

Question 9

Is the SDK suitable for compliance-sensitive industries?

Accepted Answer

Yes. On-premises processing means documents never leave your servers. Content redaction permanently removes sensitive data. Digital signatures provide document integrity and authentication. PDF/UA output meets accessibility standards. The combination of on-premises extraction, redaction, and signatures makes the SDK suitable for healthcare (HIPAA), finance (SOC 2), legal, and government (air-gapped) environments.

Question 10

How do I get started with Nutrient Java SDK?

Accepted Answer

Add Nutrient Java SDK as a dependency to your project and follow the getting started guide. All capabilities — extraction, conversion, editing, and generation — are available immediately. The documentation includes step-by-step guides for each feature area with working examples you can adapt for your use case.

Extraction, conversion, and editing in one Java SDK

One SDK for the entire document lifecycle

AI-powered extraction

Document conversion

PDF editing and redaction

Template-based generation

What you can build

AI data extraction

Document conversion

PDF editing and annotations

Forms and digital signatures

Content redaction

Document generation

Full capability map

AI extraction

Conversion

PDF editing

Generation

AI extraction that runs on your terms

On-premises by default

VLM-enhanced accuracy

Structured JSON output

AI image descriptions

Frequently asked questions