Research@1.0 Leaderboard
# Spanning six major CS domains: OS, HPC, AI, DB, PL, and Security| Rank | Model Name | Pass@1 | Score@1 | Date |
|---|---|---|---|---|
| 1 | Gemini 2.5 Pro | 68.5% | 35.0% | 11/17/2025 |
| 2 | GPT 5 Thinking | 57.4% | 28.7% | 11/17/2025 |
| 3 | Claude Sonnet 4.5 | 44.2% | 22.5% | 11/17/2025 |
| 4 | Grok 4 | 34.5% | 19.7% | 11/17/2025 |
Send us an email to submit your results: qmang@berkeley.edu wenhao.chai@princeton.edu