Loading
AI Stats is fetching the latest data for this page. This usually only takes a moment.
If this screen doesn't disappear after a short while, you can refresh the page or use one of the links above to continue.
Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Gemini 3.0 Pro Preview | 18 Nov 2025 | 0.94 | Deep Think, Tools Off | Yes | Source | |
| Grok 4 Heavy | 10 Jul 2025 | 0.89 | - | Yes | Source | |
| o3 Preview | 20 Dec 2024 | 0.88 | - | Yes | Source | |
| Grok 4 | 10 Jul 2025 | 0.88 | No Tools | Yes | Source | |
| GPT 5 | 07 Aug 2025 | 0.87 | Pass @ 1 | Yes | Source | |
| Claude Opus 4.5 | 24 Nov 2025 | 0.87 | Avg@5, 64k Thinking | Yes | Source | |
| Gemini 2.5 Pro Preview (2025-06-05) | 05 Jun 2025 | 0.86 | Single Attempt | Yes | Source | |
| Claude 3.7 Sonnet | 24 Feb 2025 | 0.85 | - | Yes | Source | |
| Grok 3 Beta | 19 Feb 2025 | 0.85 | Think, Cons@64 | Yes | Source | |
| Grok 3 Mini Beta | 19 Feb 2025 | 0.84 | Think, Cons@64 | Yes | Source | |
| o3 Pro | 10 Jun 2025 | 0.84 | - | Yes | Source | |
| Claude Sonnet 4 | 21 May 2025 | 0.84 | - | Yes | - | |
| Claude Opus 4 | 21 May 2025 | 0.83 | - | Yes | - | |
| o3 | 16 Apr 2025 | 0.83 | - | Yes | Source | |
| Gemini 2.5 Pro Preview (2025-05-06) | 06 May 2025 | 0.83 | Pass@1 | Yes | Source | |
| Gemini 2.5 Flash Preview (2025-05-20) | 20 May 2025 | 0.83 | Pass@1 | Yes | Source | |
| GPT 5 mini | 07 Aug 2025 | 0.82 | High Reasoning Effort, No Tools | Yes | Source | |
| o4 Mini | 16 Apr 2025 | 0.81 | - | Yes | Source | |
| Qwen3 235B A22B Thinking 2507 | 25 Jul 2025 | 0.81 | - | Yes | Source | |
| Deepseek R1 (2025-05-28) | 28 May 2025 | 0.81 | - | Yes | Source | |
| GPT OSS 120b | 05 Aug 2025 | 0.81 | High Reasoning Effort, With Tools | Yes | Source | |
| Claude Opus 4.1 | 05 Aug 2025 | 0.81 | - | Yes | Source | |
| Grok 3 Mini | 18 Apr 2025 | 0.80 | High Reasoning Effort | Yes | Source | |
| o3 mini | 30 Jan 2025 | 0.80 | High Reasoning Effort | Yes | - | |
| Grok 3 | 18 Apr 2025 | 0.79 | - | Yes | Source | |
| o1 pro | 19 Mar 2025 | 0.79 | - | Yes | Source | |
| Gemini 2.5 Flash Preview (2025-04-17) | 17 Apr 2025 | 0.78 | Thinking, Single Attempt | Yes | Source | |
| Gemini 2.0 Flash | 05 Feb 2025 | 0.78 | Single Attempt | Yes | Source | |
| o1 | 17 Dec 2024 | 0.78 | - | Yes | Source | |
| Qwen3 A235 A22B Instruct 2507 | 21 Jul 2025 | 0.78 | - | Yes | Source | |
| Llama 3.1 Nemotron Ultra 253B v1 | 07 Apr 2025 | 0.76 | - | Yes | Source | |
| EXAONE 4.0 32B | 15 Jul 2025 | 0.75 | Reasoning | Yes | Source | |
| Kimi K2 Instruct | 11 Jul 2025 | 0.75 | Avg@8 | Yes | Source | |
| GPT OSS 20b | 05 Aug 2025 | 0.74 | High Reasoning Effort, With Tools | Yes | Source | |
| o1 preview | 12 Sept 2024 | 0.73 | - | Yes | Source | |
| Deepseek R1 (2025-01-20) | 20 Jan 2025 | 0.71 | - | No | Source | |
| GPT 4.5 | 27 Feb 2025 | 0.71 | - | Yes | - | |
| GPT 5 nano | 07 Aug 2025 | 0.71 | High Reasoning Effort, No Tools | Yes | Source | |
| Magistral Medium | 10 Jun 2025 | 0.71 | - | Yes | Source | |
| Phi 4 Reasoning Plus | 30 Apr 2025 | 0.69 | - | Yes | Source | |
| DeepSeek V3 (2025-03-24) | 25 Mar 2025 | 0.68 | - | Yes | Source | |
| Magistral Small | 10 Jun 2025 | 0.68 | - | Yes | Source | |
| Claude 3.5 Sonnet (2024-06-20) | 21 Jun 2024 | 0.67 | - | Yes | Source | |
| Llama 3.3 Nemotron Super 49B v1 | 18 Mar 2025 | 0.67 | - | Yes | Source | |
| Gemini 2.5 Flash Lite Preview | 17 Jun 2025 | 0.67 | Thinking | Yes | Source | |
| GPT 4.1 | 14 Apr 2025 | 0.66 | - | Yes | Source | |
| Phi 4 Reasoning | 30 Apr 2025 | 0.66 | - | Yes | Source | |
| Qwen3 30B A3B | 29 Apr 2025 | 0.66 | - | Yes | Source | |
| QwQ 32B Preview | 28 Nov 2024 | 0.65 | - | Yes | Source | |
| QwQ 32B | 05 Mar 2025 | 0.65 | - | Yes | Source | |
| GPT 4.1 Mini | 14 Apr 2025 | 0.65 | - | Yes | Source | |
| Claude 3.5 Sonnet (2024-10-22) | 22 Oct 2024 | 0.65 | - | Yes | - | |
| o1 mini | 12 Sept 2024 | 0.60 | - | Yes | Source | |
| DeepSeek V3 (2024-12-26) | 25 Dec 2024 | 0.59 | - | No | Source | |
| Nova Premier | 30 Apr 2025 | 0.57 | - | Yes | - | |
| Phi 4 | 12 Dec 2024 | 0.56 | - | Yes | Source | |
| Grok 2 | 13 Aug 2024 | 0.56 | - | Yes | Source | |
| Llama 3.1 Nemotron Nano 8B V1 | 18 Mar 2025 | 0.54 | - | Yes | Source | |
| Phi 4 Mini Reasoning | 30 Apr 2025 | 0.52 | - | Yes | Source | |
| EXAONE 4.0 1.2B | 15 Jul 2025 | 0.52 | Reasoning | Yes | Source | |
| Grok 2 Mini | 13 Aug 2024 | 0.51 | - | Yes | Source | |
| Llama 3.1 405B Instruct | 23 Jul 2024 | 0.51 | - | Yes | Source | |
| Llama 3.3 70B Instruct | 06 Dec 2024 | 0.51 | - | Yes | Source | |
| Claude 3 Opus | 04 Mar 2024 | 0.50 | - | Yes | Source | |
| GPT 4.1 Nano | 14 Apr 2025 | 0.50 | - | Yes | Source | |
| Qwen2.5 32B Instruct | 19 Sept 2024 | 0.49 | - | Yes | Source | |
| Qwen2.5 72B Instruct | 19 Sept 2024 | 0.49 | - | Yes | Source | |
| Kimi K2 Base | 11 Jul 2025 | 0.48 | Avg@8 | Yes | Source | |
| Qwen3 235B A22B | 29 Apr 2025 | 0.47 | - | Yes | Source | |
| Nova Pro 1.0 | 04 Dec 2024 | 0.47 | - | Yes | Source | |
| Mistral Small 3.2 | 20 Jun 2025 | 0.46 | - | Yes | Source | |
| Mistral Small 3.1 24B Instruct | 17 Mar 2025 | 0.46 | - | Yes | Source | |
| GPT 4o | 06 Aug 2024 | 0.46 | - | Yes | Source | |
| Qwen2.5 VL 32B Instruct | 28 Feb 2025 | 0.46 | - | Yes | Source | |
| Qwen2.5 14B Instruct | 19 Sept 2024 | 0.46 | - | Yes | Source | |
| Mistral Small 3 24B Instruct | 30 Jan 2025 | 0.45 | - | Yes | Source | |
| Qwen2 72B Instruct | 23 Jul 2024 | 0.42 | - | Yes | Source | |
| Gemma 3 27B | 12 Mar 2025 | 0.42 | - | Yes | Source | |
| Nova Lite 1.0 | 04 Dec 2024 | 0.42 | - | Yes | Source | |
| Llama 3.1 70B Instruct | 23 Jul 2024 | 0.42 | - | Yes | Source | |
| Claude 3.5 Haiku | 04 Nov 2024 | 0.42 | - | Yes | Source | |
| Gemma 3 12B | 12 Mar 2025 | 0.41 | - | Yes | Source | |
| Claude 3 Sonnet | 04 Mar 2024 | 0.40 | - | Yes | Source | |
| Gemini Diffusion | 20 May 2025 | 0.40 | Pass@1 | Yes | Source | |
| GPT 4o Mini (2024-07-18) | 18 Jul 2024 | 0.40 | - | Yes | Source | |
| Nova Micro 1.0 | 04 Dec 2024 | 0.40 | - | Yes | Source | |
| Jamba Large 1.6 | 06 Mar 2025 | 0.39 | - | No | Source | |
| Mistral Small 3.1 24B Base | 17 Mar 2025 | 0.38 | - | Yes | Source | |
| Jamba Large 1.5 | 22 Aug 2024 | 0.37 | - | Yes | Source | |
| Phi 3.5 MoE instruct | 23 Aug 2024 | 0.37 | - | Yes | Source | |
| Qwen2.5 7B Instruct | 19 Sept 2024 | 0.36 | - | Yes | Source | |
| Grok 1.5 | 28 Mar 2024 | 0.36 | - | Yes | Source | |
| Gemini 1.0 Ultra | 06 Dec 2023 | 0.36 | - | Yes | Source | |
| GPT 4 | 14 Mar 2023 | 0.36 | - | Yes | Source | |
| Mistral Small 3 24B Base | 30 Jan 2025 | 0.34 | - | Yes | Source | |
| Claude 3 Haiku | 13 Mar 2024 | 0.33 | - | Yes | Source | |
| Llama 3.2 3B Instruct | 25 Sept 2024 | 0.33 | - | Yes | Source | |
| Jamba Mini 1.5 | 22 Aug 2024 | 0.32 | - | Yes | Source | |
| Gemma 3 4B | 12 Mar 2025 | 0.31 | - | Yes | Source | |
| GPT 3.5 Turbo | 21 Mar 2023 | 0.31 | - | No | - | |
| Qwen2.5 Omni 7B | 27 Mar 2025 | 0.31 | - | Yes | Source | |
| Phi 3.5 mini instruct | 23 Aug 2024 | 0.30 | - | Yes | Source | |
| Llama 3.1 8B Instruct | 23 Jul 2024 | 0.30 | - | Yes | Source | |
| Jamba Mini 1.6 | 06 Mar 2025 | 0.30 | - | No | Source | |
| Gemini 1.0 Pro | 06 Dec 2023 | 0.28 | - | No | - | |
| Qwen2 7B Instruct | 23 Jul 2024 | 0.25 | - | Yes | Source | |
| Gemma 3 1B | 12 Mar 2025 | 0.19 | - | Yes | Source |