3:54SmoothQuant: Efficient & Accurate Quantization for Massive Language ModelsArxflix220 viewsView & Download
21:16SmoothQuant - Accurate and Efficient Post-Training Quantization for Large Language ModelsKUO PENG HUANG38 viewsView & Download
4:50SmoothQuant: Migrate Activation Difficulty to WeightsChallengerSpaceShuttle543 viewsView & Download
31:19SmoothQuant : Accurate and Efficient Post Training Quantization for Large Langu딥러닝논문읽기모임675 viewsView & Download
35:3005.09.2023 SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language ModelsDS Talks Siberia178 viewsView & Download
30:14LLM Quantization Explained: GPTQ, AWQ, QLoRA, GGUF and MoreTales Of Tensors1.9K viewsView & Download
35:07Large Language Models Post Training Quantization(smoothQuant, RPTQ)MIPAL-SNU553 viewsView & Download
8:26ONNXCommunityMeetup2023: INT8 Quantization for Large Language Models with Intel Neural CompressorONNX607 viewsView & Download