37:53Direct Preference Optimization (DPO) - math insight explainedRicardo Calix368 viewsView & Download
8:55Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explainedAI Coffee Break with Letitia40.8K viewsView & Download
48:46Direct Preference Optimization (DPO) explained: Bradley-Terry model, log probabilities, mathUmar Jamil36.5K viewsView & Download
21:15Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learningLuis Serrano Academy34.3K viewsView & Download
16:15Direct Preference Optimization Beats RLHF (Explained Visually), how DPO works?GeniPad237 viewsView & Download
36:25Direct Preference Optimization (DPO): Your Language Model is Secretly a Reward Model ExplainedGabriel Mongaras19.5K viewsView & Download
3:58DPO - Direct Preference Optimization | How DPO saves computation explainedPaper in a Pod124 viewsView & Download
10:44Direct Preference Optimization (DPO) - Learn how to fine-tune LLMs directly without RL.Audio Obsession2 viewsView & Download
14:23Direct Preference Optimization: Fine-tuning Language Models Without Reinforcement LearningAI Papers Explained4 viewsView & Download
37:16Hands-on 10: Large Language Model Alignment with Direct Preference OptimizationBrainOmega3.8K viewsView & Download
1:18:44Stanford CS234 I Guest Lecture on DPO: Rafael Rafailov, Archit Sharma, Eric Mitchell I Lecture 9Stanford Online12.6K viewsView & Download
1:40:14Direct Preference Optimization (DPO) | ML@P Reading Group | Jinen SetpalML Purdue96 viewsView & Download
2:45Direct Preference Optimization (DPO) Explained: AI AlignmentVLR Software Training17 viewsView & Download