40:19Speculation is all you need: Intro to Speculative Decoding for High Performance InferenceModal884 viewsView & Download
9:39Faster LLMs: Accelerate Inference with Speculative DecodingIBM Technology26.4K viewsView & Download
7:52Accelerating LLM Inference on TPUs via Diffusion Speculative DecodingKnut Jägersberg11 viewsView & Download
7:40Speculative Decoding: 3× Faster LLM Inference with Zero Quality LossTales Of Tensors1.6K viewsView & Download
4:53What is Speculative Decoding? making LLMs fasterData Science in your pocket60 viewsView & Download
9:15Accelerating Gemma 4 via Speculative Decoding and MTP DraftersKnut Jägersberg154 viewsView & Download
5:17CVPR 26 - Multi-Scale Local Speculative Decoding for Image GenerationElia Peruzzo1 viewsView & Download
1:51MTP Speculative Decoding Explained: How AI Models Generate FasterTyrel Barstow9 viewsView & Download
1:50Speculative Speculative Decoding: Parallelizing Sequential Bottlenecks in LLM InferenceEmergent Mind25 viewsView & Download