Will Claude Opus 4.5 achieve a SOTA score on SWE-rebench when it is first evaluated?
7
Ṁ5906Dec 31
97%
chance
1D
1W
1M
ALL
Resolves when Claude Opus 4.5 is evaluated and its score is visible on https://swe-rebench.com/
This question is managed and resolved by Manifold.
Get
1,000and
3.00
Related questions
Related questions
Will Claude Opus 4.5 exceed 80% on SWE-Bench verified?
99% chance
Are these Claude Opus 4.5 leaked benchmark scores real?
1% chance
What will be the highest score achieved on SWE-Bench Verified in 2025?
BIG-bench accuracy 75% #3: Will SOTA for a single model on BIG-bench pass 75% by the start of 2026?
86% chance
How many parameters does the new possibly-SOTA large language model, Claude 3 Opus, have?
Will Claude 4 achieve over 95% on the MMLU-Pro benchmark by end of 2025?
1% chance
What will be true of the SOTA AI on the FrontierMath benchmark, before 2026?
BIG-bench accuracy 75% #4: Will SOTA for a single model on BIG-bench pass 75% by the start of 2027?
86% chance
BIG-bench accuracy 75% #5: Will SOTA for a single model on BIG-bench pass 75% by the start of 2028?
87% chance
When will SOTA for Atari 100k pass human median and mean score on all 57 games?