The process of an agent analyzing its past actions and environment feedback to extract lessons for improving future behavior.