20:34How LLMs survive in low precision | Quantization FundamentalsJulia Turc56.6K viewsView & Download
1:15:24EfficientML.ai Lecture 5 - Quantization (Part I) (MIT 6.5940, Fall 2023)MIT HAN Lab18.0K viewsView & Download
50:55Quantization explained with PyTorch - Post-Training Quantization, Quantization-Aware TrainingUmar Jamil54.8K viewsView & Download
0:55Fujitsu's 1-bit Quantumization—Tech to Lighten Generative AI While Maintaining AccuracyFujitsu Research53 viewsView & Download
15:35Quantization in deep learning | Deep Learning Tutorial 49 (Tensorflow, Keras & Python)codebasics73.6K viewsView & Download
19:46Quantization vs Pruning vs Distillation: Optimizing NNs for InferenceEfficient NLP65.6K viewsView & Download
10:12Quantization in LLMs Overview (Version2) | Embedded Systems AI LLCesai-llc71 viewsView & Download
40:50Scaling Inference Time Scaling: KV Cache Quantization | Hao Wang, Ligong Han | Random SamplesAI Innovation 18 viewsView & Download
7:29Model Quantization Explained 8 bit, 4 bit & Inference Optimization #genai #aigeneratedSmartSkale35 viewsView & Download