The process of breaking large documents into smaller pieces so a model with a limited context window can process them separately.
Performance retention over long documents and conversations