1:26:35Execution Based Evaluation for Open Domain Code GenerationJetBrains Research224 viewsView & Download
3:45Code Generation & Execution | Quality Modeller 101 Tutorials - 5.5 | Curiosity SoftwareCuriosity Software239 viewsView & Download
5:35OpenAI Codex Harness: Agentic Coding AI Evaluation Framework and Automated BenchmarksCosmoX225 viewsView & Download
13:10RAG vs Fine-Tuning vs Prompt Engineering: Optimizing AI ModelsIBM Technology656.1K viewsView & Download