---
title: "PDF to JSON conversion automation"
canonical_url: "https://www.nutrient.io/guides/document-automation-server/document-conversion/pdf-recognition-to-json-job-step/"
md_url: "https://www.nutrient.io/guides/document-automation-server/document-conversion/pdf-recognition-to-json-job-step.md"
last_updated: "2026-05-25T12:14:42.836Z"
description: "Easily convert searchable PDFs to JSON by extracting key-value pairs with versatile key recognition and synonyms for varying formats."
---

# Extract key-value pairs from PDFs to JSON

The PDF recognition to JSON step automatically extracts important data from searchable PDF files in the form of key-value pairs. The output is given as a JSON file that contains each expected key, along with its value pair.

A UI program exists to test PDF files and show what data pairs will be extracted from the file. This program can be found at:\
`“<Autobahn DX Installation directory>\distribution\recognition\AquaforestDataExtractorUI.exe”`.

You must use an **Expected Key** file to tell Nutrient Document Automation Server (DAS) — previously known as Autobahn DX — which keys to extract from the input files. You can also specify synonyms for your keys, so that values paired with any synonym will also be extracted with the key. This is useful when processing files with varying formats and different ways of framing the same data. The example **Expected Key** file below highlights how this file can be used to cover multiple ways of naming for the same key.

Read about [Job Designer](https://www.nutrient.io/guides/document-automation-server/document-conversion/the-autobahn-dx-administration-tool/job-designer.md#step-type-properties) for more information about each step property.

Example of an **Expected Key** file:

```json

{

"expectedKeys": \[

{

"expectedKey": "Invoice No",

"synonyms": \[

"Invoice Number",

"Invoice No.",

"Invoice Num"

\]

},

{

"expectedKey": "Inv Date",

"synonyms": \[

"Invoice Date",

"Inv. Date",

"Inv date"

\]

},

{

"expectedKey": "Reference",

"synonyms": \[ \]

},

{

"expectedKey": "City/State/Zip",

"synonyms": \[ "Postcode" \]

}

\]

}

```
---

## Related pages

- [Explore document automation server directories](/guides/document-automation-server/document-conversion/autobahn-dx-directories.md)
- [Comprehensive guide to document automation servers](/guides/document-automation-server/document-conversion/acknowledgments.md)
- [Efficiently execute jobs with the DAS .NET API](/guides/document-automation-server/document-conversion/autobahn-dx-net-api.md)
- [Streamline your document processing jobs](/guides/document-automation-server/document-conversion/autobahn-dx-quick-start.md)
- [Essential configuration files for Document Automation Server](/guides/document-automation-server/document-conversion/autobahn-dx-configuration-files.md)
- [Split and rename PDF files using barcode detection](/guides/document-automation-server/document-conversion/barcode-support.md)
- [Streamline PDF management with advanced automation](/guides/document-automation-server/document-conversion/dascontentextraction-kingfisher-job-step.md)
- [Managing Windows file name length in DAS](/guides/document-automation-server/document-conversion/file-name-length.md)
- [Optimizing load balancing with distributed polling](/guides/document-automation-server/document-conversion/distributed-polling.md)
- [Enhance document processing with the .NET SDK](/guides/document-automation-server/document-conversion/gdpicture-additional-steps.md)
- [Enhanced cloud OCR capabilities for text recognition](/guides/document-automation-server/document-conversion/cloud-ocr.md)
- [Essential document automation server installation guide](/guides/document-automation-server/document-conversion/installation-and-licensing.md)
- [Streamline document automation with XML job definitions](/guides/document-automation-server/document-conversion/job-definition-xml-files.md)
- [Streamline your document conversion processes](/guides/document-automation-server/document-conversion.md)
- [Enhance document processing with multicore support](/guides/document-automation-server/document-conversion/multicore-support.md)
- [Automate document processing with custom scripts](/guides/document-automation-server/document-conversion/scripting-custom-steps.md)
- [Optimize OCR with advanced preprocessing options](/guides/document-automation-server/document-conversion/ocr-properties-file-and-the-advanced-pre-processing-option.md)
- [Streamline document processing with automation](/guides/document-automation-server/document-conversion/product-overview-and-concepts.md)
- [Effortlessly convert files to PDF with ToPDF](/guides/document-automation-server/document-conversion/topdf-bcl-easypdf.md)
- [Upgrade Document Automation Server](/guides/document-automation-server/document-conversion/release-notes.md)

