7:14The Geometry of Compression How TurboQuant Solves the KV CacheKevin Varley3.5K viewsView & Download
20:34How LLMs survive in low precision | Quantization FundamentalsJulia Turc56.0K viewsView & Download
54:14Explained: TurboQuant in Qdrant | Improving Vector Compression without the Recall TaxQdrant Vector Search118 viewsView & Download
19:46Quantization vs Pruning vs Distillation: Optimizing NNs for InferenceEfficient NLP65.4K viewsView & Download
13:48TurboQuant The algorithm that crashed RAM prices 30% OvernightAI Depth School1.6K viewsView & Download
6:49TurboQuant Explained: Online Vector Quantization with Near-Optimal Distortion for LLMsmathtartic453 viewsView & Download
5:24TurboQuant: Google's 1-Bit Compression That Makes LLMs 6x SmallerPrism Labs4.3K viewsView & Download
8:02Google's TurboQuant Explained: Breaking the AI Memory Wall (6x Compression!) | KYC AI LabsKYC AI LABS1.5K viewsView & Download
5:53How TurboQuant Works: Google's KV Cache Compression Coming to ICLR 2026Alex To Go Eng38 viewsView & Download