9:39Faster LLMs: Accelerate Inference with Speculative DecodingIBM Technology26.3K viewsView & Download
17:20Structured Output from LLMs: Grammars, Regex, and State MachinesEfficient NLP9.4K viewsView & Download
4:54Locally Coherent Parallel Decoding in Diffusion Language Models - ICML2026Michael Hersche1 viewsView & Download
11:53Greedy? Min-p? Beam Search? How LLMs Actually Pick Words – Decoding Strategies ExplainedAI Coffee Break with Letitia6.9K viewsView & Download
48:56Typical Decoding for Natural Language Generation (Get more human-like outputs from language models!)Yannic Kilcher19.3K viewsView & Download
43:51Lec 58 Large Language Models: Tokenization, Generation, and SamplingNPTEL - Indian Institute of Science, Bengaluru1.3K viewsView & Download
27:14Transformers, the tech behind LLMs | Deep Learning Chapter 53Blue1Brown10.3M viewsView & Download
12:28LLM Sampling Explained: Temperature, Top-p, Top-k (with Go demo)AgenticCore Labs78 viewsView & Download
16:23[full] Contrastive Decoding Improves Reasoning in Large Language ModelsArxiv Papers665 viewsView & Download