
What will be the highest score achieved on SWE-Bench Verified in 2025?
Plus
15
Ṁ12692026
1D
1W
1M
ALL
8%
<70
31%
70-85 inclusive
61%
>85
https://openai.com/index/introducing-swe-bench-verified/
https://www.swebench.com/
Highest performance reported before 2026. Any run on https://www.swebench.com/ counts. Large AI company reported numbers count whether or not they're listed on swebench.com Other claimed scores will generally not be counted unless verified by a third party.
This question is managed and resolved by Manifold.
Get
1,000and
3.00
Sort by:
@JacobPfau Does Introducing Codex resolve <70 NO? Very annoyingly they don't give a number, but in the plot codex-1 pass@1 is clearly above 70%.
@SanghyeonSeo Don't see an option to resolve individual options, IIRC there are two types of multiple choice questions
Related questions
Related questions
What will be the best performance on SWE-bench Verified by December 31st 2025?
Top SWE-Bench Verified score in 2025?
-
Top Multi-SWE-bench score in 2025?
-
Will SotA on PaperBench (Code-Dev) surpass 75% in 2025?
40% chance
When will SWE-bench be solved?
AI resolves at least X% on SWE-bench WITH assistance, by 2028?
AI resolves at least X% on SWE-bench without any assistance, by 2028?
What will be the best score (5/5 reliability) on ZeroBench by December 31st 2025?
What will be the best score on Cybench by December 31st 2025?
BIG-bench accuracy 75% #3: Will SOTA for a single model on BIG-bench pass 75% by the start of 2026?
86% chance