9:39Faster LLMs: Accelerate Inference with Speculative DecodingIBM Technology26.2K viewsView & Download
48:26EAGLE and EAGLE-2: Lossless Inference Acceleration for LLMs - Hongyang ZhangNadav Timor4.0K viewsView & Download
12:30Speeding Up LLMs: Speculative Decoding for Multi-Sample InferenceTalkTensors: AI Podcast Covering ML Papers18 viewsView & Download
13:54[IDSL Seminar'26] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference AccelerationIDSL10 viewsView & Download
7:40Speculative Decoding: 3× Faster LLM Inference with Zero Quality LossTales Of Tensors1.6K viewsView & Download
40:19Speculation is all you need: Intro to Speculative Decoding for High Performance InferenceModal863 viewsView & Download
4:53What is Speculative Decoding? making LLMs fasterData Science in your pocket7 viewsView & Download
22:36MASSIVELY speed up local AI models with Speculative Decoding in LM StudioGosuCoder21.2K viewsView & Download
1:51MTP Speculative Decoding Explained: How AI Models Generate FasterTyrel Barstow4 viewsView & Download
7:48Why using a dumb language model can speed up a smarter one: Speculative Decoding [Lecture]Jordan Boyd-Graber233 viewsView & Download