Gemini 3's execution time-horizon?
20
Ṁ1696Dec 31
Invalid contract
On the task described in https://arxiv.org/abs/2509.09677, what will be the length of tasks that Gemini 3 will be able to complete in one go?
I'm an author, and I will run the same setup above^ to resolve this.
Currently:
GPT-5 Thinking is 1024
Claude 4 Sonnet is 432
Grok-4 is 384
This question is managed and resolved by Manifold.
Get
1,000and
3.00
Related questions
Related questions
Gemini 3's 50% time horizon, per METR
Gemini 3 flash release in November 2025?
4% chance
Gemini 3 Deep Think available on API in 2025?
34% chance
Gemini 3.0 Pro outperforms GPT-5 on METR 50% time horizon?
78% chance
Before 2026, will Gemini 3.0 exceed GPT-5 in Metr estimated time horizon?
76% chance
Google Gemini 3 Flash release date?
-
How many will Gemini 3.0 achieve? [Read description]
-
Will GPT-5.1 have a longer METR time horizon than Gemini 3?
48% chance