20:34How LLMs survive in low precision | Quantization FundamentalsJulia Turc56.0K viewsView & Download
7:21SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion ModelsAI Illuminated195 viewsView & Download
13:01ICQuant: Index Coding enables Low-bit LLM QuantizationConference on Language Modeling176 viewsView & Download
11:03LLaMa GPTQ 4-Bit Quantization. Billions of Parameters Made Smaller and Smarter. How Does it Work?AemonAlgiz29.7K viewsView & Download
10:06Fine-tune LLMs with Unsloth: QLoRA, 4-bit train LLMs 2x faster with 70% less VRAM!Audio Obsession8 viewsView & Download
11:23LLM Compression Explained: Build Faster, Efficient AI ModelsIBM Technology26.2K viewsView & Download
15:35Quantization in deep learning | Deep Learning Tutorial 49 (Tensorflow, Keras & Python)codebasics73.5K viewsView & Download
19:01LLM Quantization (Ollama, LM Studio): Any Performance Drop? TESTDiscover AI4.3K viewsView & Download