VisionEngine
Specifies which vision processing pipeline to use for content extraction.
ICR (Intelligent Content Recognition) refers to intelligent document understanding that goes beyond traditional OCR. While OCR simply converts images of text into machine-readable characters, ICR analyzes the full document structure.
from nutrient import VisionEngineValues
| Name | Value | Description |
|---|---|---|
VisionEngine.VLM_ENHANCED_ICR | 0 | VLM-enhanced ICR extraction pipeline combining ICR with Vision Language Models. |
VisionEngine.OCR | 1 | Fast OCR-only extraction pipeline. |
VisionEngine.ICR | 2 | Local ICR extraction pipeline using only ONNX models (no VLM required). |
Usage Example
from nutrient import VisionEngine
# Access enum valuesvalue = VisionEngine.VLM_ENHANCED_ICRprint(f"Value: {value}") # Output: Value: VisionEngine.VLM_ENHANCED_ICRprint(f"Integer value: {value.value}") # Output: Integer value: 0