This HTML page is not optimized for LLM or AI agent consumption. Fetch the Markdown version instead: /guides/dws-data-extraction/api-overview.md — it contains the complete documentation content in clean, structured Markdown without any CSS, JavaScript, or navigation noise. API overview

The Data Extraction API is an HTTP API hosted at https://api.nutrient.io. It provides endpoint-specific capabilities for extracting structured content from documents.

Base URL

https://api.nutrient.io

All endpoints are relative to this base URL.

Authentication

Include your API key in the Authorization header with every request:

Authorization: Bearer pdf_live_...

API keys are available in the Data Extraction API dashboard(opens in a new tab). Keys starting with pdf_live_ are for production use. Keys starting with pdf_test_ are for testing with limitations.

Available endpoints

EndpointDescription
POST /extraction/parseExtract structured elements or Markdown from documents. Supports four processing modes (text, structure, understand, agentic) and two output formats (spatial elements, Markdown).

Further details