Will OpenAI's next-generation model score 65% or higher on the GPQA benchmark?
Plus
13
Ṁ803resolved Sep 16
Resolved
YES1D
1W
1M
ALL
Resolve to YES if OpenAI's next generation language model scores 65% or higher on the GPQA benchmark(extended set).
If OpenAI's existing model gets 65% or higher by post-training enhancements, that also counts.
There's room for improvement via prompt engineering after the release, but I don't know how long I should wait, so I will resolve this question as soon as OpenAI releases their next model.
This question is managed and resolved by Manifold.
Get
1,000and
3.00
Related questions
Related questions
Will OpenAI's next major LLM (after GPT-4) surpass 70% accuracy on the GPQA benchmark?
66% chance
Will OpenAI's next major LLM (after GPT-4) surpass 74% accuracy on the GPQA benchmark?
85% chance
Will OpenAI's next major LLM (after GPT-4) achieve over 50% resolution rate on the SWE-bench benchmark?
75% chance
Will OpenAI models achieve ≥90% on SimpleBench by the end of 2025?
42% chance
Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?
26% chance
Will the gap between open-weights and frontier models on GPQA Diamond be at most 7%?
35% chance
OpenAI's next major AI model will be more open than GPT-4 by June 30, 2025
Will a single model achieve superhuman performance on all OpenAI gym environments by 2025?
25% chance
Will AI image generating models score >= 90% on Winoground by June 1, 2025?
82% chance
Will OpenAI's o4 get above 50% on humanity's last exam?
53% chance