28:19Ultra-scale playbook, ch.3.2 - "Sequence Parallelism"Little ML book club219 viewsView & Download
1:24:42Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 7: Parallelism 1Stanford Online43.6K viewsView & Download
1:06:39Lazy and Fast: Ranges Meet Parallelism in C++ - Daniel Anderson - CppCon 2025CppCon8.1K viewsView & Download
13:20Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation, [ICLR 2026, Oral]MIT HAN Lab398 viewsView & Download
9:46TFLA & xLSTM: The Future of Efficient Real-Time AI and RoboticsFoundation Models For Robotics3 viewsView & Download
15:26Understanding Parallel Transport & Connections in Differential GeometryDialect39.6K viewsView & Download
20:18LLM Inference Optimization #2: Tensor, Data & Expert Parallelism (TP, DP, EP, MoE)Faradawn Yang4.4K viewsView & Download
7:27Two Dimensional Parallelism Using Distributed Tensors at PyTorch Conference 2022PyTorch3.4K viewsView & Download
48:06Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention (Paper Explained)Yannic Kilcher29.0K viewsView & Download
40:54Deep dive - Better Attention layers for Transformer modelsJulien Simon15.8K viewsView & Download
18:52Parallax: Parameterized Local Linear Attention for Language Modeling (May 2026)AI Paper Slop26 viewsView & Download
42:39Object-Centric Learning with Slot Attention (Paper Explained)Yannic Kilcher19.4K viewsView & Download