OCR-Free

architecture

A model that understands text in images without needing a separate optical character recognition (OCR) tool to extract the text first.

Related Capabilities

Quality of vision, audio, and image understanding (distinct from modality support)