23:52Blockwise Parallel Decoding for Deep Autoregressive ModelsYannic Kilcher1.3K viewsView & Download
13:20Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation, [ICLR 2026, Oral]MIT HAN Lab398 viewsView & Download
8:22Non-Autoregressive and Shallow Decoding: Speeding up TranslationEfficient NLP2.0K viewsView & Download
4:54Locally Coherent Parallel Decoding in Diffusion Language Models - ICML2026Michael Hersche1 viewsView & Download
11:43Blockwise Parallel Transformer for Long Context Large ModelsBerkeley 2023mardin mardin431 viewsView & Download
0:17Speeding up Vision-Language Models: LocateAnything Decoding ComparisonShihao Wang67 viewsView & Download
2:25Interspeech2021-Streaming End-to-End ASR based on Block-wise Non-Autoregressive ModelsWAVLab226 viewsView & Download
1:06Video on Mobile CPU: UHD Video Parallel Decoding for Asymmetric Multicores @ MMSys'17Yeongil Ryu222 viewsView & Download
9:39Faster LLMs: Accelerate Inference with Speculative DecodingIBM Technology26.3K viewsView & Download
17:11AI Agents for Medical Diagnostics: Parallel Reasoning with Octochains FrameworkAhmad Varasteh4 viewsView & Download
41:08Patterns & Practices for building Multi-Agent Systems by Nikhil BarthwalDevoxx927 viewsView & Download
7:38Which transformer architecture is best? Encoder-only vs Encoder-decoder vs Decoder-only modelsEfficient NLP50.8K viewsView & Download
6:47LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box DecodingDeep Learning With Mayank21 viewsView & Download