9:39Faster LLMs: Accelerate Inference with Speculative DecodingIBM Technology26.5K viewsView & Download
8:26[QA] Accelerating Diffusion LLMs via Adaptive Parallel DecodingArxiv Papers28 viewsView & Download
9:29Large Language Diffusion Models - The Era Of Diffusion LLMs?AI Papers Academy24.0K viewsView & Download
5:42Diffusion Language Models Explained: The Shift to Parallel GenerationClyep48 viewsView & Download
6:00The Probability Bottleneck in Diffusion LLMs: Why Parallel Decoding Is Not FreeXiaol.x52 viewsView & Download
7:17DFlash Deep Dive: Block Diffusion Makes LLM Inference 6x FasterEnchanted Storytime431 viewsView & Download
4:54Locally Coherent Parallel Decoding in Diffusion Language Models - ICML2026Michael Hersche2 viewsView & Download
12:11LLM generates the ENTIRE output at once (world's first diffusion LLM)Matthew Berman196.6K viewsView & Download
13:20Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation, [ICLR 2026, Oral]MIT HAN Lab405 viewsView & Download