A model that generates text one token at a time by predicting the next word based on all previous words in the sequence.