Loading
AI Stats is fetching the latest data for this page. This usually only takes a moment.
If this screen doesn't disappear after a short while, you can refresh the page or use one of the links above to continue.
Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Gemini 3.0 Pro Preview | 18 Nov 2025 | 0.72 | - | Yes | Source | |
| Qwen3 A235 A22B Instruct 2507 | 21 Jul 2025 | 0.54 | - | Yes | Source | |
| Gemini 2.5 Pro Preview (2025-06-05) | 05 Jun 2025 | 0.54 | - | Yes | Source | |
| Gemini 2.5 Pro Preview (2025-05-06) | 06 May 2025 | 0.51 | - | Yes | Source | |
| Grok 3 Beta | 19 Feb 2025 | 0.44 | - | Yes | Source | |
| Kimi K2 Base | 11 Jul 2025 | 0.35 | Correct | Yes | Source | |
| Kimi K2 Instruct | 11 Jul 2025 | 0.31 | Correct | Yes | Source | |
| Gemini 2.0 Flash | 05 Feb 2025 | 0.30 | - | Yes | Source | |
| Gemini 2.5 Flash Preview (2025-04-17) | 17 Apr 2025 | 0.30 | - | Yes | Source | |
| Gemini 2.5 Flash Preview (2025-05-20) | 20 May 2025 | 0.27 | - | Yes | Source | |
| Grok 3 Mini Beta | 19 Feb 2025 | 0.22 | - | Yes | Source | |
| Gemini 2.5 Flash Lite Preview | 17 Jun 2025 | 0.13 | Thinking | Yes | Source | |
| Mistral Small 3.2 | 20 Jun 2025 | 0.12 | TotalAcc | Yes | Source |