A model that predicts the next word in a sequence by only looking at previous words, not future ones, making it suitable for text generation.