Learn/Core Concept How does context compaction work? Context compaction intelligently summarises long conversation histories to fit within model token limits whilst preserving essential information. The technique identifies which messages contain key decisions, facts, or context and compresses redundant exchanges into concise summaries. Tools like fable-mode use staged planning to manage this process systematically. It's crucial for long-running agents that would otherwise lose important context or hit expensive token limits. MemoryCompression |