Learn/Core Concept How does token compression actually work? Token compression reduces LLM API costs by encoding common patterns more efficiently before sending requests. Instead of sending raw text, systems like RTK analyse command outputs and replace repetitive structures with compact representations that models can still understand. This cuts token usage by 70-92% on typical dev commands without losing meaning. TokenisationInference |