Will a Large Language Model beat me at chess this year? | Manifold

Will a Large Language Model beat me at chess this year?

Basic

13

Ṁ749

Jan 1

4%

chance

1D

1W

1M

ALL

I’m rated around 1900 FIDE. At the end of the 2024 I’ll play a game against an LLM at a rapid time control, selected from the top 3 of the leaderboard (https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard). Resolves YES if I lose, NO if I win, and 50% for a draw.

This question is managed and resolved by Manifold.

#Technical AI Timelines

Get

1,000

and

3.00

Sort by:

we can only hope.

What prompt will you be using? I imagine that changes their performance quite a bit

Good point! On each move, I’ll provide it the moves played so far in PGN notation, as well as the current position in FEN notation. This way both ways of representing position would be in context and in a standard format.

I think that makes the model significantly worse than it could otherwise be. I'd recommend using whatever prompt someone that claims "SOTA LLM chess" or something came up with

I’m planning to use lichess to play the game, and those are the representations it provides. In a future market this might change.

bought Ṁ43 NO

When I tested this with ChatGPT 3.0 a while back, it couldn't even remember the board position and kept making illegal moves. How will you resolve if it does this?

Let’s say three illegal moves will result in a loss. Distinctions like Rad1 vs. Rd1 won’t count towards this, but I’ll ask it for clarification.

Related questions

Will a large language model beat a super grandmaster playing chess by 2028?

Will a Language Model under 10B parameters play chess at Grandmaster level by 2050?

Will an AI by OpenAI beat a super grandmaster playing chess by 2028?

Will end-to-end neural networks such as LLMs can beat the best human player in chess by 2028?

Will an Open Source Large Language Model Surpass GPT-4 in Elo Rating on Chatbot Arena by December 31, 2024?

When will a Large Language Model beat me at chess?

Will an LLM (a GPT-like text AI) defeat the World Champion at Chess before 2035?

Will any language model trained without large number arithmetic be able to generalize to large number arithmetic by 2026?

Will any OpenAI model win a chess match against IM by the end of 2024?

Will a Large Language Model be listed as an author on a peer-reviewed paper by the end of 2025?

Related questions

Will a large language model beat a super grandmaster playing chess by 2028?

When will a Large Language Model beat me at chess?

Will a Language Model under 10B parameters play chess at Grandmaster level by 2050?

Will an LLM (a GPT-like text AI) defeat the World Champion at Chess before 2035?

Will an AI by OpenAI beat a super grandmaster playing chess by 2028?

Will any language model trained without large number arithmetic be able to generalize to large number arithmetic by 2026?

Will end-to-end neural networks such as LLMs can beat the best human player in chess by 2028?

Will any OpenAI model win a chess match against IM by the end of 2024?

Will an Open Source Large Language Model Surpass GPT-4 in Elo Rating on Chatbot Arena by December 31, 2024?

Will a Large Language Model be listed as an author on a peer-reviewed paper by the end of 2025?

© Manifold Markets, Inc.•Terms + Mana-only Terms•Privacy•Rules