Learn/Core Concept How does context budgeting work? Context budgeting manages how much of an LLM's context window gets allocated to different parts of our application. Instead of cramming everything into the prompt, we reserve space for system instructions, conversation history, and tool outputs. Projects like smallcode use budget-managed context to compensate for smaller models' limitations. Smart budgeting prevents context overflow and keeps our AI responses coherent. QuantisationInference |