Gemini 3.0 Pro outperforms GPT-5 on METR 50% time horizon?
32
Ṁ3236Nov 8
61%
chance
1D
1W
1M
ALL
50% time horizon is a measure of AI autonomy based on the length of tasks that AI can do: roughly, it is the time that humans take to complete tasks that an AI system can successfully do 50% of the time. See METR's "Measuring AI Ability to Complete Long Tasks" for the technical definition. Claude 3.7 Sonnet, released in February 2025, was the leading model with a 50% horizon of 59 minutes.

This question is managed and resolved by Manifold.
Get
1,000and
3.00
Related questions
Related questions
Gemini 3.0 released before October 31?
30% chance
Gemini 3.0 released by October 15?
2% chance
Gemini 3.0 released before November 15?
59% chance
Gemini 3's 50% time horizon, per METR
Will we know GPT-5-Pro's METR 50% Time-Horizon before November?
37% chance
Gemini 3.0 released before November 30?
74% chance
GPT-5 Pro's 50% time horizon, per METR
Gemini-2.5-Pro remains at top of LMArena by end of October?
66% chance
Gemini 3's execution time-horizon?
-
Will Gemini 3.0 Pro Achieve Silver Medal Performance on the 2025 IMO?
40% chance