Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| GPT 5.2 | 11 Dec 2025 | 7.52 | Medium reasoning; sigma 0.276; games 331; avg points 0.705 | No | Source | |
| GPT 5 | 07 Aug 2025 | 5.97 | Medium reasoning; sigma 0.212; games 556; avg points 0.611 | No | Source | |
| GPT 5 Mini | 07 Aug 2025 | 5.73 | Medium reasoning; sigma 0.226; games 465; avg points 0.603 | No | Source | |
| Claude Opus 4.5 | 24 Nov 2025 | 5.66 | 16K thinking; sigma 0.264; games 341; avg points 0.594 | No | Source | |
| Gemini 3 Flash Preview | 17 Dec 2025 | 5.66 | sigma 0.244; games 398; avg points 0.598 | No | Source | |
| Grok 3 Mini Beta | 19 Feb 2025 | 5.53 | High reasoning; sigma 0.216; games 511; avg points 0.597 | No | Source | |
| DeepSeek R1 (2025-05-28) | 28 May 2025 | 5.35 | sigma 0.204; games 565; avg points 0.581 | No | Source | |
| Claude 3.7 Sonnet | 24 Feb 2025 | 5.28 | 16K thinking; sigma 0.158; games 947; avg points 0.593 | No | Source | |
| Claude Opus 4.1 | 05 Aug 2025 | 5.21 | No reasoning; sigma 0.290; games 276; avg points 0.578 | No | Source | |
| Claude Sonnet 4.5 | 29 Sept 2025 | 5.19 | 16K thinking; sigma 0.262; games 348; avg points 0.559 | No | Source | |
| Grok 4 | 10 Jul 2025 | 5.15 | sigma 0.228; games 455; avg points 0.565 | No | Source | |
| GPT 4.5 | 27 Feb 2025 | 5.12 | sigma 0.217; games 499; avg points 0.597 | No | Source | |
| Claude 3.5 Sonnet (2024-10-22) | 22 Oct 2024 | 5.10 | sigma 0.180; games 731; avg points 0.597 | No | Source | |
| Grok 3 Beta | 19 Feb 2025 | 5.10 | No reasoning; sigma 0.207; games 539; avg points 0.579 | No | Source | |
| Gemini 3 Pro Preview | 18 Nov 2025 | 4.89 | sigma 0.271; games 324; avg points 0.544 | No | Source | |
| Gemini 2.5 Flash | 17 Jun 2025 | 4.73 | sigma 0.202; games 578; avg points 0.548 | No | Source | |
| Claude Sonnet 4 | 21 May 2025 | 4.64 | No reasoning; sigma 0.252; games 369; avg points 0.539 | No | Source | |
| MiniMax M2 | 27 Oct 2025 | 4.57 | sigma 0.280; games 291; avg points 0.522 | No | Source | |
| Qwen 3 Max Thinking | 26 Jan 2026 | 4.49 | sigma 0.285; games 286; avg points 0.516 | No | Source | |
| o3 | 16 Apr 2025 | 4.48 | Medium reasoning; sigma 0.192; games 656; avg points 0.523 | No | Source | |
| Claude Opus 4 | 21 May 2025 | 4.41 | No reasoning; sigma 0.292; games 273; avg points 0.527 | No | Source | |
| Qwen 3 A235 A22B Instruct 2507 | - | 4.41 | Instruct 2507; sigma 0.274; games 305; avg points 0.507 | No | Source | |
| o3 mini | 30 Jan 2025 | 4.37 | Medium reasoning; sigma 0.139; games 1194; avg points 0.531 | No | Source | |
| Kimi K2 Thinking | 06 Nov 2025 | 4.33 | 64K thinking; sigma 0.311; games 238; avg points 0.504 | No | Source | |
| GLM 4.5 | 28 Jul 2025 | 4.25 | sigma 0.251; games 368; avg points 0.505 | No | Source | |
| Mistral Large 2.0 | 24 Jul 2024 | 4.11 | sigma 0.137; games 1229; avg points 0.522 | No | Source | |
| DeepSeek V3 (2024-12-26) | 26 Dec 2024 | 4.07 | sigma 0.140; games 1180; avg points 0.518 | No | Source | |
| DeepSeek R1 | - | 4.06 | sigma 0.142; games 1165; avg points 0.512 | No | Source | |
| o1 | 17 Dec 2024 | 3.86 | Medium reasoning; sigma 0.171; games 798; avg points 0.510 | No | Source | |
| GPT OSS 120b | 05 Aug 2025 | 3.79 | sigma 0.210; games 519; avg points 0.462 | No | Source | |
| Mistral Large 3.0 | 02 Dec 2025 | 3.64 | sigma 0.262; games 337; avg points 0.456 | No | Source | |
| Llama 4 Maverick | 05 Apr 2025 | 3.63 | sigma 0.142; games 1146; avg points 0.474 | No | Source | |
| Grok 4.1 Thinking | 17 Nov 2025 | 3.62 | Fast reasoning; sigma 0.246; games 385; avg points 0.455 | No | Source | |
| Llama 3.3 70B Instruct | 06 Dec 2024 | 3.59 | sigma 0.166; games 836; avg points 0.493 | No | Source | |
| Nova Pro 1.0 | 04 Dec 2024 | 3.52 | sigma 0.135; games 1253; avg points 0.477 | No | Source | |
| Qwen 3 235B A22B | - | 3.52 | sigma 0.205; games 558; avg points 0.464 | No | Source | |
| Minimax Text 01 | 15 Jan 2025 | 3.45 | sigma 0.131; games 1335; avg points 0.471 | No | Source | |
| Kimi K2 (2025-09-05) | 05 Sept 2025 | 3.40 | sigma 0.252; games 378; avg points 0.455 | No | Source | |
| Mistral Small 3.0 | 30 Jan 2025 | 3.39 | sigma 0.161; games 889; avg points 0.475 | No | Source | |
| Grok 2 | 13 Aug 2024 | 3.35 | sigma 0.171; games 792; avg points 0.475 | No | Source | |
| GPT 4o Mini (2024-07-18) | 18 Jul 2024 | 3.30 | sigma 0.144; games 1114; avg points 0.462 | No | Source | |
| o4 Mini | 16 Apr 2025 | 3.30 | High reasoning; sigma 0.209; games 534; avg points 0.445 | No | Source | |
| Claude 3.5 Haiku | 04 Nov 2024 | 3.15 | sigma 0.139; games 1193; avg points 0.452 | No | Source | |
| Gemini 2.0 Pro Exp (2025-02-05) | 05 Feb 2025 | 3.07 | sigma 0.218; games 494; avg points 0.462 | No | Source | |
| Llama 3.1 405B Instruct | 23 Jul 2024 | 3.07 | sigma 0.172; games 784; avg points 0.459 | No | Source | |
| Phi 4 | 12 Dec 2024 | 2.84 | sigma 0.136; games 1254; avg points 0.425 | No | Source | |
| Gemini 2.0 Flash Thinking Exp (2025-01-21) | 21 Jan 2025 | 2.81 | sigma 0.213; games 522; avg points 0.449 | No | Source | |
| GLM 4.6 | 30 Sept 2025 | 2.69 | sigma 0.584; games 70; avg points 0.386 | No | Source | |
| Mistral Medium 3.0 | 07 May 2025 | 2.19 | sigma 0.216; games 510; avg points 0.374 | No | Source | |
| QwQ 32B | - | 2.05 | 16K; sigma 0.203; games 586; avg points 0.382 | No | Source | |
| Gemini 2.0 Flash | 05 Feb 2025 | 1.95 | sigma 0.177; games 757; avg points 0.376 | No | Source | |
| Qwen 3 30B A3B | - | 1.86 | sigma 0.212; games 535; avg points 0.358 | No | Source | |
| Mistral Medium 3.1 | 12 Aug 2025 | 0.30 | sigma 0.289; games 310; avg points 0.250 | No | Source |