20:34How LLMs survive in low precision | Quantization FundamentalsJulia Turc56.0K viewsView & Download
3:51Quantizing Models from Hugging Face Using BitsnBytes | Quantization | TensorTeachTensorTeach732 viewsView & Download
15:35Quantization in deep learning | Deep Learning Tutorial 49 (Tensorflow, Keras & Python)codebasics73.5K viewsView & Download
50:55Quantization explained with PyTorch - Post-Training Quantization, Quantization-Aware TrainingUmar Jamil54.6K viewsView & Download
19:46Quantization vs Pruning vs Distillation: Optimizing NNs for InferenceEfficient NLP65.4K viewsView & Download
11:179.2 Quantization aware Training - ConceptsxLAB for Safe Autonomous Systems5.8K viewsView & Download
40:59Fast, Cheap, and Accurate: Optimizing LLM Inference with vLLM and Quantization by Legare KerrisonDevoxx UK116 viewsView & Download
1:37:46Leaner, Greener and Faster Pytorch Inference with QuantizationToronto Machine Learning Society (TMLS)570 viewsView & Download
27:47Leaner and Greener AI with Quantization in PyTorch - SURAJ SUBRAMANIANOpen Data Science and AI Conference644 viewsView & Download
28:34[2023 Best AI Paper] SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight CompressioPaper With Video105 viewsView & Download
1:01:20tinyML Talks: A Practical Guide to Neural Network QuantizationEDGE AI FOUNDATION30.0K viewsView & Download