58:29Genloop Research Jam #2 - Exploring Meta's Transformers without NormalizationGenloop50 viewsView & Download
3:15Dynamic Tanh (DyT) Explained in 3 Minutes! | Transformers Without NormalizationKavishka Abeywardana79 viewsView & Download
8:04Transformers Without Normalization? He Kaiming & Yann LeCun's Game-Changing AI Breakthrough!DeepWing407 viewsView & Download
18:26Transformers Without Normalization: Dynamic Tanh ApproachAI Papers Decoded Podcast130 viewsView & Download
10:40E08 Normalization (Batch, Layer, RMS) | Transformer Series (with Google Engineer)Martin Is A Dad1.2K viewsView & Download
9:39Faster LLMs: Accelerate Inference with Speculative DecodingIBM Technology26.2K viewsView & Download
8:13LT2 Explained: Linear-Time Looped Transformers, GDN, DSA, and the RWKV-7 ExtensionXiaol.x41 viewsView & Download
18:21Automated Global Analysis of Experimental Dynamics through Low-Dimensional Linear EmbeddingsGeneral Robotics Lab16.2K viewsView & Download