1:32ProgramBench: Can Language Models Rebuild Programs From Scratch?Emergent Mind2.8K viewsView & Download
20:26ProgramBench: Can Language Models Rebuild Programs From Scratch? (May 2026)AI Paper Slop161 viewsView & Download
5:57[CVPR 2026 Oral] PAI-Bench: A Comprehensive Benchmark For Physical AIFengzhe Zhou10 viewsView & Download
8:32Every Frontier AI Just Scored ZERO on Meta's New BenchmarkDigital Dreamscapes12 viewsView & Download
1:10GLM-5.1 Beat GPT-5.4 on SWE-Bench Pro — Did China Just Win the Coding War?Data Ranked Geek42 viewsView & Download
34:358 Factor Producers to Scale Platform Engineering in an AI-First world with Abby BangserPure Performance6 viewsView & Download
55:43Keynote: Benchee: 9 Years of Benchmarking on the BEAM -Tobias Pfeiffer | Code BEAM Lite Sto 2024Code Sync909 viewsView & Download
55:12The Science of Benchmarking Panel (NeurIPS 2025 Tutorial)Michael Saxon (NLP & Generative AI research)1.4K viewsView & Download