---
title: "ContentExtractionSettings"
canonical_url: "https://www.nutrient.io/api/python/settings/vision/advanced/content-extraction-settings/"
md_url: "https://www.nutrient.io/api/python/settings/vision/advanced/content-extraction-settings.md"
last_updated: "2026-06-12T15:43:20.748Z"
description: "Settings for ContentExtraction. Values fall back through three levels: document → SDK → built-in default."
---

Settings for ContentExtraction. Values fall back through three levels: document → SDK → built-in default. Writes target the document only when set on a document's settings, otherwise the SDK globally when set on SdkSettings.

**Tags:** `Vision`, `Advanced`

```python

from nutrient_sdk import ContentExtractionSettings

```

## Construction

`ContentExtractionSettings` is accessed through a [`Document`](/api/python/document/) instance for per-document overrides, or via [`SdkSettings`](/api/python/settings/document/sdk-settings/) for SDK-wide defaults.

```python

# Per-document override

with Document.open("input.pdf") as doc:
    settings = doc.settings.content_extraction_settings
    settings.some_field = new_value          # mutate fields directly

# SDK-wide default (applies to all documents)

SdkSettings.content_extraction_settings.some_field = new_value

```

Settings are configured by writing to fields on the returned object. The settings property itself cannot be reassigned — `doc.settings.content_extraction_settings = other_settings` is rejected.

## Properties

### enable_full_page_ocr_fallback

```python

@property
def enable_full_page_ocr_fallback(self) -> bool

@enable_full_page_ocr_fallback.setter
def enable_full_page_ocr_fallback(self, value: bool) -> None

```

Indicates whether to run full page OCR when no zones are detected. This provides a simple OCR-only pipeline without segmentation.

**Type:** `bool`

**Default:** `true`

---

### enable_image_extraction

```python

@property
def enable_image_extraction(self) -> bool

@enable_image_extraction.setter
def enable_image_extraction(self, value: bool) -> None

```

Indicates whether image metadata extraction is enabled for image zones.

**Type:** `bool`

**Default:** `true`

---

### enable_ocr_extraction

```python

@property
def enable_ocr_extraction(self) -> bool

@enable_ocr_extraction.setter
def enable_ocr_extraction(self, value: bool) -> None

```

Indicates whether OCR extraction is enabled for text zones.

**Type:** `bool`

**Default:** `true`

---

### enable_table_extraction

```python

@property
def enable_table_extraction(self) -> bool

@enable_table_extraction.setter
def enable_table_extraction(self, value: bool) -> None

```

Indicates whether table structure extraction is enabled for table zones.

**Type:** `bool`

**Default:** `true`

---

### minimum_zone_confidence

```python

@property
def minimum_zone_confidence(self) -> float

@minimum_zone_confidence.setter
def minimum_zone_confidence(self, value: float) -> None

```

Minimum confidence threshold to process a zone (0.0 - 1.0). Zones below this confidence will be skipped.

**Type:** `float`

**Default:** `0.5f`

---

---

## Related pages

- [Per-document override](/api/python/settings/vision/advanced/ai-augmenter-settings.md)
- [Per-document override](/api/python/settings/vision/advanced/claude-api-settings.md)
- [Per-document override](/api/python/settings/vision/advanced/document-layout-json-export-settings.md)
- [Per-document override](/api/python/settings/vision/advanced/deskew-settings.md)
- [Per-document override](/api/python/settings/vision/advanced/handwriting-settings.md)
- [Advanced](/api/python/settings/vision/advanced.md)
- [Per-document override](/api/python/settings/vision/advanced/open-ai-api-endpoint-settings.md)
- [Per-document override](/api/python/settings/vision/advanced/vision-descriptor-settings.md)
- [Per-document override](/api/python/settings/vision/advanced/table-recognition-settings.md)

