Before February 2025, will a Gemini model exceed Claude 3.5 Sonnet 10/22's Global Average score on LiveBench?
Basic
1
Ṁ10Feb 2
55%
chance
1D
1W
1M
ALL
https://livebench.ai/ as of 11/4 the best performing Gemini model scores 54.94 and 3.5 Sonnet 10/22 scores 60.33
This question is managed and resolved by Manifold.
Get
1,000
and3.00
Related questions
Related questions
Before February 2025, will a Gemini model exceed Claude 3.5 Sonnet 10/22's Global Average score on Simple Bench?
55% chance
Will Gemini achieve a higher score on the SAT compared to GPT-4?
70% chance
Will Gemini exceed the performance of GPT-4 on the 2022 AMC 10 and AMC 12 exams?
72% chance
Will Gemini-1.5-Pro-Exp-0801 Score Above 1165 in Scale AI's Math Evaluation
48% chance
What will Claude 3.5 Opus's reported 0-shot performance on GPQA Diamond be upon release?
Will any model get above human level on the Simple Bench benchmark before September 1st, 2025.
55% chance
Will o1 (not preview) achieve a better score on LiveBench coding than Claude 3.5 Sonnet 10/22?
75% chance
Will "Gemini [Ultra, 1.0] smash GPT-4 by 5x"?
18% chance
Will Google release a model called Gemini 1.5 Ultra or Gemini 2.0 Ultra before the end of the year?
41% chance
Will Gemini Ultra outperform GPT-4V on visual reasoning by the end of 2024?
65% chance