11:23LLM Compression Explained: Build Faster, Efficient AI ModelsIBM Technology26.2K viewsView & Download
19:46Quantization vs Pruning vs Distillation: Optimizing NNs for InferenceEfficient NLP65.4K viewsView & Download
18:23these compression algorithms could halve our image file sizes (but we don't use them) #SoMEpiJentGent362.1K viewsView & Download
20:34How LLMs survive in low precision | Quantization FundamentalsJulia Turc56.0K viewsView & Download
1:09:30vLLM Office Hours #23 - Deep Dive Into the LLM Compressor - April 10, 2025Neural Magic2.0K viewsView & Download
1:00:00State of LLM Compression from Research to Production | Random SamplesRed Hat1.1K viewsView & Download
5:10Meta Just Changed Data Compression FOREVER (OpenZL Explained)Better Stack81.6K viewsView & Download