1:27:12Ibragim Badertdinov: From Dentistry to AI, Coding Agents, SWE-rebench | JetBrains Research PodcastJetBrains Research624 viewsView & Download
6:17SWE-BENCH: CAN LANGUAGE MODELS RESOLVE REAL-WORLD GITHUB ISSUES?Machine Learning Daily450 viewsView & Download
5:08SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated EvaluationAleksandr Kovyazin190 viewsView & Download
9:27Introducing Open SWE: An Open-Source Asynchronous Coding AgentLangChain28.7K viewsView & Download
3:47Multi-SWE-bench: Testing LLMs on Real-World Code IssuesAI Research Roundup210 viewsView & Download
27:10The End of SWE-Bench Verified — Mia Glaese & Olivia Watkins, OpenAI Frontier EvalsLatent Space4.2K viewsView & Download
1:10GLM-5.1 Beat GPT-5.4 on SWE-Bench Pro — Did China Just Win the Coding War?Data Ranked Geek46 viewsView & Download
1:19:04Paper Reading: SWE-bench: Can Language Models Resolve Real-world Github Issues? ICLR 2024Manikishan Ghantasala136 viewsView & Download