This HTML page is not optimized for LLM or AI agent consumption. Fetch the Markdown version instead: /guides/dws-data-extraction/file-types.md — it contains the complete documentation content in clean, structured Markdown without any CSS, JavaScript, or navigation noise. Supported file types

The Data Extraction API accepts PDFs, images, and Office documents. The API automatically detects the file type from the content; you don’t need to specify it explicitly.

PDFs

ExtensionMIME type
PDFapplication/pdf

Images

ExtensionMIME type
PNGimage/png
JPG / JPEGimage/jpeg
TIFFimage/tiff
BMPimage/bmp
GIFimage/gif
WEBPimage/webp
SVGimage/svg+xml
HEICimage/heic
TGAimage/x-tga
EPSimage/postscript

Office documents

The API supports Word documents, spreadsheets, and presentations across both modern Open XML formats and legacy binary formats.

Documents

ExtensionMIME type
DOCapplication/msword
DOCXapplication/vnd.openxmlformats-officedocument.wordprocessingml.document
DOCMapplication/vnd.ms-word.document.macroEnabled.12
DOTXapplication/vnd.openxmlformats-officedocument.wordprocessingml.template
DOTMapplication/vnd.ms-word.template.macroEnabled.12
RTFapplication/rtf
ODTapplication/vnd.oasis.opendocument.text

Spreadsheets

ExtensionMIME type
XLSapplication/vnd.ms-excel
XLSXapplication/vnd.openxmlformats-officedocument.spreadsheetml.sheet
XLSMapplication/vnd.ms-excel.sheet.macroEnabled.12
XLSBapplication/vnd.ms-excel.sheet.binary.macroEnabled.12
XLTXapplication/vnd.openxmlformats-officedocument.spreadsheetml.template
XLTMapplication/vnd.ms-excel.template.macroEnabled.12
XLAMapplication/vnd.ms-excel.addin.macroEnabled.12

Slides

ExtensionMIME type
PPT / PPSapplication/vnd.ms-powerpoint
PPTXapplication/vnd.openxmlformats-officedocument.presentationml.presentation
PPTMapplication/vnd.ms-powerpoint.presentation.macroEnabled.12
PPSXapplication/vnd.openxmlformats-officedocument.presentationml.slideshow
PPSMapplication/vnd.ms-powerpoint.slideshow.macroEnabled.12
POTXapplication/vnd.openxmlformats-officedocument.presentationml.template
POTMapplication/vnd.ms-powerpoint.template.macroEnabled.12
PPAMapplication/vnd.ms-powerpoint.addin.macroEnabled.12

File size limits

The maximum file size is 150 MB. Requests exceeding this limit receive a 413 response.