Learning approach where an agent builds a model of how the environment works, then uses it to plan actions.