---
title: "Efficient PDF content extraction operations"
canonical_url: "https://www.nutrient.io/guides/document-automation-server/content-extraction/introduction/"
md_url: "https://www.nutrient.io/guides/document-automation-server/content-extraction/introduction.md"
last_updated: "2026-06-15T16:16:06.394Z"
description: "Discover powerful PDF content extraction tools for smart splitting, renaming, and OCR text extraction tailored for SharePoint environments."
---

# Unlocking PDF processing with content extraction

Nutrient Document Automation Server (DAS) Content Extraction (formerly known as Kingfisher) provides a set of operations for processing PDF files. It includes splitting, extracting pages, extracting text, and renaming files based on the content of the files. DAS Content Extraction also provides the ability to set the security, metadata, and other attributes of a PDF document.

What makes DAS Content Extraction stand out is the ability to perform split, extract, and rename operations based on text or barcodes found in zones or coordinates in a PDF page. DAS Content Extraction can use optical character recognition (OCR) to extract text data from image PDF files or pages.

## System requirements

| Requirement                       | Value                                                                                                                                                          |
| --------------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| **Supported operating systems**   | Windows 10 (64bit)
Windows 11
Windows Server 2012 (64bit)
Windows Server 2016
Windows Server 2019
Windows Server 2022 |
| **Supported SharePoint versions** | SharePoint Online (Office 365)
SharePoint Server 2019
SharePoint Server 2016
SharePoint Server 2013
SharePoint Server 2010     |
| **Disk space**                    | 450 Mb                                                                                                                                                         |
| **.NET framework**                | 4.7.2                                                                                                                                                          |

## Installation

DAS Content Extraction requires installation by a user with administrator privileges.

The installer will guide you through the process of setting up DAS Content Extraction.

### User permissions

The logged-in user for the user interface (UI) and the service user require full control of the following folders under the Kingfisher <!-- > Keeping the same kingfisher here for now because folder names are yet to be updated in the Integrations Solutions as confirmed by the team on Slack <--> installation folder:

- Conditions

- License

- Logfiles

- Status

- Temp

In addition, the user(s) will require full control of the following files in the bin folder:

- `Kingfisher.Config`

- `Kingfisher.Service.exe.config`

### Service user

When DAS Content Extraction is installed, its service logs on as the local system account. This is generally suitable for installations where the files being processed are present on the installation machine or are stored in SharePoint (online or on-premises).

Windows Server 2019 machines have slightly more restrictive permissions; ensure the service user can access the locations listed under the **User permissions** section above.

If the files are to be referenced via Universal Naming Convention (UNC) paths, an account with the required permissions on those paths will need to be created.

Log on to the computer as a user with administrator permissions.

Start the **Services** app from the **Control Panel**, or search from the Windows task bar for **Services** and launch the **Services** app.![Services App Started](@/assets/guides/document-automation-server/content-extraction/image6.png)

Find the DAS Content Extraction service, and right-click and select the **Properties** option.

Select the **Log On** tab.![Log on Properties](@/assets/guides/document-automation-server/content-extraction/image7.png)

Enter the username and password and click **OK**.

For the service to use the new credentials, it must be restarted. Right-click the DAS Content Extraction service and select **Restart**.

## Licensing and purchasing

From 31 March 2022, DAS Content Extraction (Kingfisher) is only available as a subscription product.

A subscription license will need to be renewed annually and comes with support and maintenance coverage (SMC) for the duration of the subscription.

When a license is purchased, the relevant subscription license key will be issued for use with the product. This can be entered using the **Options** tab of the graphical user interface (GUI). There’s no need to download another version of the software.

The SMC includes support by email, telephone, or remote session, and it includes free upgrades for its duration.

Existing permanent DAS Content Extraction 2.5 licenses will remain valid and continue to function as permanent licenses.

Additional SMC for existing permanent licenses will continue to be available for version 2.5.

Trial licenses are fully functional, but all generated PDF files will have the Integrations Solutions trial stamps on them, and text extraction is limited to the first three pages.

## Version breakdown

| Available features                                              | Standard                         | Professional                                        | Server                                                        |
| --------------------------------------------------------------- | -------------------------------- | --------------------------------------------------- | ------------------------------------------------------------- |
| **Use cases**                                                   | Smart PDF splitting and renaming | Smart PDF page extraction, splitting, and renaming. | Automated smart PDF page extraction, splitting, and renaming. |
| **Split PDF by content, barcode, or page ranges**               | ✔                                | ✔                                                   | ✔                                                             |
| **Rename PDF by content or barcode**                            | ✔                                | ✔                                                   | ✔                                                             |
| **Extract PDF pages by content or barcode**                     | ✔                                | ✔                                                   | ✔                                                             |
| **Includes optical recognition for processing image PDF files** | ✔                                | ✔                                                   | ✔                                                             |
| **Graphical user interface**                                    | ✔                                | ✔                                                   | ✔                                                             |
| **Process files in SharePoint**                                 | ✔                                | ✔                                                   | ✔                                                             |
| **Extract PDF text and data to CSV, Excel, or text files**      |                                  | ✔                                                   | ✔                                                             |
| **Command-line interface**                                      |                                  |                                                     | ✔                                                             |
| **Automated scheduling**                                        |                                  |                                                     | ✔                                                             |
| **Number of cores supported**                                   | 1                                | 1                                                   | 4                                                             |

---

## Related pages

- [Streamline document automation with content extraction](/guides/document-automation-server/content-extraction.md)
- [Streamline document processing with DAS](/guides/document-automation-server/content-extraction/autobahn-dx-integration.md)
- [Explore document automation and content extraction](/guides/document-automation-server/content-extraction/acknowledgements.md)
- [Organize your Document Automation Server files](/guides/document-automation-server/content-extraction/kingfisher-files-and-directories.md)
- [Streamline document automation with DAS content extraction](/guides/document-automation-server/content-extraction/kingfisher-configuration-file-kingfisherconfig.md)
- [Document Automation Server Content Extraction release notes](/guides/document-automation-server/content-extraction/release-notes.md)
- [Efficient content extraction for diverse PDF files](/guides/document-automation-server/content-extraction/support.md)
- [How to use trigger files for job automation](/guides/document-automation-server/content-extraction/trigger-files.md)
- [Efficient content extraction with DAS command line](/guides/document-automation-server/content-extraction/using-kingfisher-from-the-command-line.md)
- [Streamline document automation with XML job files](/guides/document-automation-server/content-extraction/xml-job-condition-files.md)

