4:58What is vLLM? Efficient AI Inference for Large Language ModelsIBM Technology82.6K viewsView & Download
2:12Optimize, deploy, and benchmark an open-source LLM with vLLMDeepLearningAI251 viewsView & Download
5:35Beyond Single-GPU: Orchestrating Open Source LLMs with kServe, llm-d, and vLLMllm-d Project948 viewsView & Download
12:42LLM Inference Engines: vLLM, KV Cache, Paged attention and Continuous Batching.The Cef Experience456 viewsView & Download
3:47AI Lab: Open-source inference with vLLM + SGLang | Optimizing KV cache with Crusoe Managed InferenceCrusoe AI8.2M viewsView & Download