Grok 3.5 ‘leaked’ benchmark scores end up real?
28
Ṁ9268Jun 3
9%
chance
1D
1W
1M
ALL
This was shared across twitter.

Will it be confirmed real or completely made up? If they announce benchmark results that are all better than 1% less than the current leaked results, this market resolves YES.
It also resolves yes if the benchmark results were obtained with pass@1024 or something like that
This question is managed and resolved by Manifold.
Get
1,000and
3.00
Sort by:
the original resolution criteria was
Will it be confirmed real or completely made up? If they announce benchmark results within 1% of each of these except one which can be within 2%, even if they announce the results were for pass@64 or any other not ‘apples to apples’ comparison like that, the market resolves yes.
But when i updated it i accidentally removed the part that would have answered your question