4:48GitHub - z-lab/dflash: DFlash: Block Diffusion for Flash Speculative DecodingGitHub Daily Trend AI Podcast2 viewsView & Download
8:43DFlash Drafter for Gemma 4 26B - Official Speculative Decoding is Here: Run LocallyFahd Mirza5.6K viewsView & Download
40:32ML Performance Reading Group 23: DFlash: Block Diffusion for Flash Speculative DecodingEleutherAI538 viewsView & Download
1:50Unleashing DFlash A Game Changer in Speculative Decoding! Full ReviewSimple Tech Lab41 viewsView & Download
10:14MLX India Community Meetup 1 | Boosting local model performance - Speculative decoding with DFlashConscious Engines95 viewsView & Download
10:06DFlash Leaves Qwen Territory - Gemma 4 31B Now Runs 5x Faster with Speculative DecodingFahd Mirza5.3K viewsView & Download
9:01Running a 27B model at 130 tokens sec on a single GPU Locally with Luce DFlashFahd Mirza9.8K viewsView & Download
8:27600 Toks/Second Gemma4-26B —The Setting That Actually Wins (vLLM + Dflash Speculative Decoding)Tech-Practice3.9K viewsView & Download