Which Benchmarks will GPT-5 be benchmarked against, when it is announced?
➕
Plus
10
Ṁ2650
2026
97%
SimpleQA
9%
GSM8K
71%
HumanEval
82%
MMLU
80%
GPQA
62%
MATH
47%
MGSM
32%
DROP
52%
Big-Bench-Hard
87%
SWE-Bench

Some flexibility on variations of specific benchmarks. eg SWE-Bench-Hard would resolve SWE-Bench YES.

Get
Ṁ1,000
and
S3.00
Sort by:

@bbb I can't add options, I might create a duplicate where i can in a bit.

© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules