Do LLMs Benefit From Their Own Words?

Jenny Y. Huang, Leshem Choshen, Ramon Astudillo, Tamara Broderick, Jacob Andreas|February 27, 2026arXiv

Key Takeaway

You can often remove an LLM's previous responses from conversation history without losing quality, saving memory while sometimes improving accuracy.

Summary

This paper tests whether LLMs actually need to see their own previous responses in multi-turn conversations. Surprisingly, removing past assistant responses often doesn't hurt quality and can shrink context by 10x. The researchers found that models sometimes get worse when they over-rely on their own prior outputs, introducing errors that compound across turns.

efficiency evaluation

Key Terms

context-length context-pollution multi-turn-conversation prompt-engineering