12:15Optimizing Large Language Model Training Using FP4 Quantization | PodcastLuis Camilo Jimenez Alvarez (Luisca)116 viewsView & Download
8:32[QA] Optimizing Large Language Model Training Using FP4 QuantizationArxiv Papers64 viewsView & Download
21:33Optimizing Large Language Model Training Using FP4 QuantizationArxiv Papers160 viewsView & Download
20:34How LLMs survive in low precision | Quantization FundamentalsJulia Turc56.2K viewsView & Download
19:46Quantization vs Pruning vs Distillation: Optimizing NNs for InferenceEfficient NLP65.5K viewsView & Download
15:35Quantization in deep learning | Deep Learning Tutorial 49 (Tensorflow, Keras & Python)codebasics73.6K viewsView & Download
33:55Be Top 0.1% - 4x LLM Training Speed - FP4 of LLMs (Pretraining, Inference)Vuk Rosić354 viewsView & Download
34:21Deephonk Stemcast -- Modern AI 17 INFERENCE OPTIMIZATION: KV CACHE & QUANTIZATIONDeephonk Stem18 viewsView & Download