Testing an AI system's ability to maintain context and preferences across many sequential interactions over time.