The specialized component of a model that processes and interprets image data to extract visual information.
Quality of vision, audio, and image understanding (distinct from modality support)