16:49Boost Your AI Models with INT8 Quantization 🚀 ONNX Static vs Dynamic + Python & C++ Speed TestDeep knowledge342 viewsView & Download
19:46Quantization vs Pruning vs Distillation: Optimizing NNs for InferenceEfficient NLP65.5K viewsView & Download
9:45INT8 Inference of Quantization-Aware trained models using ONNX-TensorRTONNX4.5K viewsView & Download
13:05DeepSeek R1 0528 at 1-Bit? (Unsloth Dynamic Quant LOCAL Test)Bijan Bowen7.9K viewsView & Download
3:59Start Post-Training Static Quantization | AI Model Optimization with Intel® Neural CompressorIntel Devs220.7K viewsView & Download
8:26ONNXCommunityMeetup2023: INT8 Quantization for Large Language Models with Intel Neural CompressorONNX607 viewsView & Download
15:35Quantization in deep learning | Deep Learning Tutorial 49 (Tensorflow, Keras & Python)codebasics73.6K viewsView & Download