Nutrient Java SDK/io.nutrient.sdk.enums/VisionEngine

VisionEngine

Specifies which vision processing pipeline to use for content extraction. ICR (Intelligent Content Recognition) refers to intelligent document understanding that goes beyond traditional OCR. While OCR simply converts images of text into machine-readable characters, ICR analyzes the full document structure.

Entries

VlmEnhancedIcr

VLM-enhanced ICR extraction pipeline combining ICR with Vision Language Models. Requires a VLM provider to be configured via with appropriate API credentials (if required). The VLM enhances extraction quality by providing contextual understanding with AI.

AdaptiveOcr

Adaptive OCR pipeline: heuristic-first with OCR fallback per page. Born-digital pages are extracted directly from the PDF content stream — no rasterization, no segmentation, no OCR — yielding sub-2 s/page throughput on typical documents. Image-based or non-PDF pages transparently fall through to OCR so callers don't need to know the document type up front. Does not require a VLM provider.

Icr

Local ICR extraction pipeline using only small models (no VLM required). Provides full document layout analysis and content extraction using local inference only. This mode runs entirely offline without requiring external API calls, making it suitable for air-gapped environments or when VLM setup/costs/latency are a concern.

Functions

valueOf

public static VisionEngine valueOf(String name)

Returns the enum constant of this type with the specified name. The string must match exactly an identifier used to declare an enum constant in this type. (Extraneous whitespace characters are not permitted.)

values

public static Array<VisionEngine> values()

Returns an array containing the constants of this enum type, in the order they're declared. This method may be used to iterate over the constants.