4:50SmoothQuant: Migrate Activation Difficulty to WeightsChallengerSpaceShuttle544 viewsView & Download
3:54SmoothQuant: Efficient & Accurate Quantization for Massive Language ModelsArxflix222 viewsView & Download
35:3005.09.2023 SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language ModelsDS Talks Siberia178 viewsView & Download
21:16SmoothQuant - Accurate and Efficient Post-Training Quantization for Large Language ModelsKUO PENG HUANG39 viewsView & Download
20:34How LLMs survive in low precision | Quantization FundamentalsJulia Turc56.8K viewsView & Download
10:295. Comparing Quantizations of the Same Model - Ollama CourseMatt Williams32.2K viewsView & Download
13:29How Quantization Shrinks Near-Frontier AI to Run on Hardware You OwnSquintist1.6K viewsView & Download
8:26ONNXCommunityMeetup2023: INT8 Quantization for Large Language Models with Intel Neural CompressorONNX607 viewsView & Download
26:41How Do We Get MASSIVE Model To Run On Device? Quantization Explained.Tim Carambat12.3K viewsView & Download
50:55Quantization explained with PyTorch - Post-Training Quantization, Quantization-Aware TrainingUmar Jamil54.9K viewsView & Download
11:44Dynamic Quantization with Unsloth: Shrinking a 20GB Model to 5GB Without Accuracy Loss!Prompt Engineer1.9K viewsView & Download
30:14LLM Quantization Explained: GPTQ, AWQ, QLoRA, GGUF and MoreTales Of Tensors1.9K viewsView & Download
54:26Anticipating new weights in the CLSA: Unpacking sampling weights and their useCanadian Longitudinal Study on Aging (CLSA/ÉLCV)445 viewsView & Download