Learn/Core Concept How does dual-view diffusion work? Dual-view diffusion decoding generates text by processing multiple perspectives simultaneously rather than sequentially. Orthrus demonstrates this achieving 5x speedups over autoregressive methods whilst maintaining lossless output. Instead of predicting one token at a time, the model considers parallel generation paths, dramatically reducing inference time without quality loss. AutoregressiveInference |