VisionEngine

public enum VisionEngine

Specifies which vision processing pipeline to use for content extraction. ICR (Intelligent Content Recognition) refers to intelligent document understanding that goes beyond traditional OCR. While OCR simply converts images of text into machine-readable characters, ICR analyzes the full document structure.

Entries

Link copied to clipboard

VLM-enhanced ICR extraction pipeline combining ICR with Vision Language Models. Requires a VLM provider to be configured via with appropriate API credentials (if required). The VLM enhances extraction quality by providing contextual understanding with AI.

Link copied to clipboard

Fast OCR-only extraction pipeline. Skips segmentation and AI augmentation for faster processing. Best for simple documents with straightforward layouts where speed is the priority. Does not require a VLM provider.

Link copied to clipboard

Local ICR extraction pipeline using only ONNX models (no VLM required). Provides full document layout analysis and content extraction using local inference only. This mode runs entirely offline without requiring external API calls, making it suitable for air-gapped environments or when VLM setup/costs/latency are a concern.

Functions

Link copied to clipboard
public static VisionEngine valueOf(String name)

Returns the enum constant of this type with the specified name. The string must match exactly an identifier used to declare an enum constant in this type. (Extraneous whitespace characters are not permitted.)

Link copied to clipboard
public static Array<VisionEngine> values()

Returns an array containing the constants of this enum type, in the order they're declared. This method may be used to iterate over the constants.