8:32[QA] Optimizing Large Language Model Training Using FP4 QuantizationArxiv Papers64 viewsView & Download
20:34How LLMs survive in low precision | Quantization FundamentalsJulia Turc56.0K viewsView & Download
21:33Optimizing Large Language Model Training Using FP4 QuantizationArxiv Papers160 viewsView & Download
19:46Quantization vs Pruning vs Distillation: Optimizing NNs for InferenceEfficient NLP65.4K viewsView & Download
15:35Quantization in deep learning | Deep Learning Tutorial 49 (Tensorflow, Keras & Python)codebasics73.5K viewsView & Download
2:12:21LLM Fine-Tuning 12: LLM Quantization Explained( PART 1) | PTQ, QAT, GPTQ, AWQ, GGUF, GGML, llama.cppSunny Savita8.0K viewsView & Download
50:55Quantization explained with PyTorch - Post-Training Quantization, Quantization-Aware TrainingUmar Jamil54.6K viewsView & Download
52:19Zhiyu Cheng (NVIDIA) FP4 quantization and its real-world applications on LLMs and diffusion modelsIDEAL27 viewsView & Download
33:55Be Top 0.1% - 4x LLM Training Speed - FP4 of LLMs (Pretraining, Inference)Vuk Rosić354 viewsView & Download