Document AI MCP Server

Natural‑language PDF operations – prepared for LLMs

Hand your LLM a prompt and get a finished, compliant PDF back. MCP Server plugs Claude, GPT, or any agent framework straight into Nutrient’s battle‑tested DWS APIs, so document workflows run on words, not glue code. One‑line install, docker‑friendly, streams over HTTP or stdio.

Watch the live demo

See this short demo of what Nutrient MCP Server is capable of today.

Built for the LLM era

SDK sidekicks — GitHub Copilot snippets and the upcoming llms.txt metadata surface the right Nutrient calls directly in your IDE.

Agent‑ready interfaces — Deterministic JSON schema keeps LangGraph, OpenAI Agents, and Claude Desktop predictable.

Sandboxed execution — Every job runs in an isolated workspace; secrets and file systems stay safe.

Stateless streaming — Long‑running PDF operations stream back in real time, so your agents never hit token limits.We prioritize meaningful results over busyness.

Business outcome — Launch AI features months faster while slashing integration risk.

What you can ask examples

Zero‑code – no scripts or custom code required.

“Convert my DOCX to PDF.”

DOCX → PDF conversion via DWS, file returned to sandbox

“Convert all PDFs to DOCX.”

Batch loop converts every PDF in directory

“Extract the table from invoice.pdf and print it.”

Table-extraction model returns structured JSON

“Perform key-value extraction on my credit-card application.”

Auto-detects file, returns names, dates, SSNs

“Sign invoice.pdf and watermark it with PAID.”

Adds visible watermark, embeds PAdES signature

Core capabilities

Batch convert

PDF ⇄ DOCX/HTML/PNG with a single prompt.

see it in action

AI redaction

Scrub PII, PHI, or card data; outputs lossless, selectable text.

see it in action

Table & key‑value extraction

Get clean JSON you can pipe into analytics or RAG.

see it in action

Digital signatures & watermarks

PAdES‑compliant signatures and branded overlays in one shot.

see it in action

Merge & assemble

Stitch hundreds of docs into audit‑ready reports.

see it in action

OCR & PDF/A‑UA

Multi‑language OCR and accessible archival formats.

see it in action

Implementation in three simple steps

1

Install & launch locally:

npx nutrient-mcp-server --sandbox ./sandbox --port 4000

2

Ask Claude something like:

Convert all PDFs to DOCX and save them.

3

Watch files appear in ./sandbox

Prefer Docker?

docker run -p 4000:4000 ghcr.io/nutrient/mcp-server:latest.

Performance you can trust

12,000

Docs processed per day

320 ms

Avg. latency

< 5 min

First call to "done"

18

pull requests merged last month. Open-source under MIT

Coming soon

Document Engine MCP inside the container

Natural‑language control baked into every on‑prem deployment, zero extra microservices.

Native Windows build & one‑click PowerShell install

Put zero‑code PDF workflows on internal desktops and RPA bots.

Drag‑and‑drop agent templates

Pre‑wired chains for contract review, onboarding, and compliance you can remix in minutes.

Frequently asked questions

What is Nutrient MCP Server?

Nutrient MCP Server is an open‑source microservice that translates natural‑language prompts from any large‑language model (LLM) into deterministic JSON calls to Nutrient Document Web Services (DWS) APIs, then streams back finished PDFs, structured JSON, or images – no glue code required.

How do I call MCP Server from GPT‑4o, Claude, or LangGraph agents?

You expose the server’s JSON schema as a function/tool and let the LLM choose the right operation. Examples for OpenAI “function calling,” Anthropic “tool” messages, and LangGraph nodes live in the repo.

Do I really need zero code?

Yes. Spin up a sandbox with npx nutrient-mcp-server or the Docker image, then ask: “Convert all PDFs to DOCX and save them.” The agent does the rest – no scripts, SDKs, or config files.

How does MCP Server keep documents and secrets secure?

Every job runs in an isolated sandbox directory; streams are stateless and can travel over TLS. No files or tokens ever leave the workspace unless you explicitly export them.

Which document operations ship out of the box?

Conversion, merge, split, OCR, AI redaction (PII/PHI/PCI), table & key‑value extraction, PDF/A‑UA, PAdES signatures, watermarks, and more – matching Nutrient’s full DWS feature set.

Is MCP Server really open source?

Yes – MIT license on GitHub. Fork it, extend it, or file an issue; eighteen pull requests were merged last month.

Where can I run it and what are the requirements?

Linux/x86‑64 or Apple Silicon, Node 20+; one‑line Docker/Kubernetes deploy. A native Windows build and PowerShell installer are in final testing.

What’s coming next and why should I care?

Soon the Document Engine container will ship with MCP baked in, giving every on‑prem customer natural‑language control by default. Drag‑and‑drop agent templates for contract review, onboarding, and compliance will follow – slashing your time‑to‑value even further. Subscribe on GitHub to stay in the loop.