Will there be another benchmark/test after "Humanity's Last Exam"? | Manifold

Will there be another benchmark/test after "Humanity's Last Exam"?

Plus

15

Ṁ2292

2027

89%

chance

1D

1W

1M

ALL

SafeAI is developing a benchmark that "aims to be the world’s most difficult AI test" For a question to qualify, all current models must fail at it. Is this truly "Humanity's Last Exam," or will there be another one after this?

https://www.safe.ai/blog/humanitys-last-exam

This question is managed and resolved by Manifold.

Get

1,000

and

3.00

Related questions

Will the first AI model that saturates Humanity's Last Exam be employable as a software engineer?

Humanity's Last Exam score in 2025?

What will be the best AI performance on Humanity's Last Exam by December 31st 2025?

When will humanity's last exam be saturated? (>80%)

Will Al achieve 85% or higher on the Humanity's Last Exam benchmark before 2027?

Will OpenAI's o4 get above 50% on humanity's last exam?

-4% 1d27% chance

What will be o3's score on Humanity's Last Exam?

Will there be an LLM which scores above what a human can do in 2 hours on METR's eval suite before 2026?

Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2027?

Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2028?

Related questions

Will the first AI model that saturates Humanity's Last Exam be employable as a software engineer?

Will OpenAI's o4 get above 50% on humanity's last exam?

Humanity's Last Exam score in 2025?

What will be o3's score on Humanity's Last Exam?

What will be the best AI performance on Humanity's Last Exam by December 31st 2025?

Will there be an LLM which scores above what a human can do in 2 hours on METR's eval suite before 2026?

When will humanity's last exam be saturated? (>80%)

Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2027?

Will Al achieve 85% or higher on the Humanity's Last Exam benchmark before 2027?

Will Al achieve 95% or higher on the Humanity's Last Exam benchmark before 2028?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules