3:54SmoothQuant: Efficient & Accurate Quantization for Massive Language ModelsArxflix220 viewsView & Download
21:16SmoothQuant - Accurate and Efficient Post-Training Quantization for Large Language ModelsKUO PENG HUANG38 viewsView & Download
4:50SmoothQuant: Migrate Activation Difficulty to WeightsChallengerSpaceShuttle543 viewsView & Download
26:41How Do We Get MASSIVE Model To Run On Device? Quantization Explained.Tim Carambat12.1K viewsView & Download
35:3005.09.2023 SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language ModelsDS Talks Siberia178 viewsView & Download
19:46Quantization vs Pruning vs Distillation: Optimizing NNs for InferenceEfficient NLP65.5K viewsView & Download
20:34How LLMs survive in low precision | Quantization FundamentalsJulia Turc56.3K viewsView & Download
3:48How Quantization Makes AI Models Faster and More EfficientThe Personal AI Architecture2.9K viewsView & Download
5:08[CVPR 2026] MASQuant: Modality-Aware Smoothing Quantization for Multimodal LargeLanguage Modelsxin chen6 viewsView & Download
8:26ONNXCommunityMeetup2023: INT8 Quantization for Large Language Models with Intel Neural CompressorONNX607 viewsView & Download
31:19SmoothQuant : Accurate and Efficient Post Training Quantization for Large Langu딥러닝논문읽기모임675 viewsView & Download
6:49TurboQuant Explained: Online Vector Quantization with Near-Optimal Distortion for LLMsmathtartic462 viewsView & Download
50:55Quantization explained with PyTorch - Post-Training Quantization, Quantization-Aware TrainingUmar Jamil54.7K viewsView & Download