Multimodal

Quality of vision, audio, and image understanding (distinct from modality support)

Sort:

Scores indicate capability fit (1–5). Models with the same score perform comparably — order within a score level is not a ranking.