The ability of a model to identify and interpret handwritten characters and words from images, accounting for variations in writing style and quality.
Quality of vision, audio, and image understanding (distinct from modality support)