7:05Direct Preference Optimization: An RL-free algorithm for training language models from preferences.Yousef Emami92 viewsView & Download
8:55Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explainedAI Coffee Break with Letitia40.8K viewsView & Download
21:15Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learningLuis Serrano Academy34.3K viewsView & Download
48:46Direct Preference Optimization (DPO) explained: Bradley-Terry model, log probabilities, mathUmar Jamil36.5K viewsView & Download
14:23Direct Preference Optimization: Fine-tuning Language Models Without Reinforcement LearningAI Papers Explained4 viewsView & Download
16:15Direct Preference Optimization Beats RLHF (Explained Visually), how DPO works?GeniPad237 viewsView & Download
2:45Direct Preference Optimization (DPO) Explained: AI AlignmentVLR Software Training17 viewsView & Download
10:44Direct Preference Optimization (DPO) - Learn how to fine-tune LLMs directly without RL.Audio Obsession2 viewsView & Download
36:25Direct Preference Optimization (DPO): Your Language Model is Secretly a Reward Model ExplainedGabriel Mongaras19.5K viewsView & Download
6:04Fine-tuning OpenAI's GPT4O Using direct preference optimization (DPO)GenAI Insight595 viewsView & Download
37:16Hands-on 10: Large Language Model Alignment with Direct Preference OptimizationBrainOmega3.8K viewsView & Download
34:49An introduction to Direct Preference Optimization - April 2025Massimo Piccardi55 viewsView & Download
3:42Direct Preference Optimization: Your Language Model is Secretly a Reward ModelEmergent Mind5 viewsView & Download
19:47[2024 Best AI Paper] SimPO: Simple Preference Optimization with a Reference-Free RewardPaper With Video183 viewsView & Download