40:32ML Performance Reading Group 23: DFlash: Block Diffusion for Flash Speculative DecodingEleutherAI538 viewsView & Download
4:48GitHub - z-lab/dflash: DFlash: Block Diffusion for Flash Speculative DecodingGitHub Daily Trend AI Podcast2 viewsView & Download
1:36:03ML Performance Reading Group Session 19: Speculative DecodingEleutherAI1.0K viewsView & Download
8:43DFlash Drafter for Gemma 4 26B - Official Speculative Decoding is Here: Run LocallyFahd Mirza5.6K viewsView & Download
7:17DFlash Deep Dive: Block Diffusion Makes LLM Inference 6x FasterEnchanted Storytime398 viewsView & Download
10:14MLX India Community Meetup 1 | Boosting local model performance - Speculative decoding with DFlashConscious Engines95 viewsView & Download
9:39Faster LLMs: Accelerate Inference with Speculative DecodingIBM Technology26.1K viewsView & Download
8:27600 Toks/Second Gemma4-26B —The Setting That Actually Wins (vLLM + Dflash Speculative Decoding)Tech-Practice3.8K viewsView & Download
3:22Magic-VLA K02: Revolutionizing Embodied AI with Instant Inference & Panoramic GeneralizationMagicLab156 viewsView & Download