A model that processes only one type of input (like text) rather than multiple types (like text and images combined).