Learn/Core Concept How does on-demand computation work? On-demand computation calculates values when needed rather than storing pre-computed results. LEANN's vector database exemplifies this by computing embeddings in real-time instead of storing them, achieving 97% storage savings. This trade-off exchanges memory for computation cycles, making it ideal for resource-constrained environments where storage costs exceed processing overhead. EmbeddingsCaching |