Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| o1 | 17 Dec 2024 | 6117 | - | No | Source | |
| o3 mini | 30 Jan 2025 | 4984 | High Reasoning Effort | No | Source | |
| GPT 4.5 | 27 Feb 2025 | 3066 | - | No | Source | |
| Claude 3.5 Sonnet (2024-10-22) | 22 Oct 2024 | 2734 | - | No | Source | |
| Grok 3 Beta | 19 Feb 2025 | 2347 | Reasoning | No | Source | |
| Claude 3.7 Sonnet | 24 Feb 2025 | 2233 | Thinking | No | Source | |
| o1 preview | 12 Sept 2024 | 1938 | - | No | Source | |
| Deepseek R1 (2025-01-20) | 20 Jan 2025 | 1505 | - | No | Source | |
| o1 mini | 12 Sept 2024 | 1488 | - | No | Source | |
| Claude 3.5 Sonnet (2024-06-20) | 21 Jun 2024 | 1423 | - | No | Source | |
| GPT 4.1 | 14 Apr 2025 | 1379 | - | No | Source | |
| Claude 3.5 Haiku | 04 Nov 2024 | 931 | - | No | Source | |
| Claude 3 Opus | 04 Mar 2024 | 931 | - | No | Source | |
| GPT 4.1 Nano | 14 Apr 2025 | 729 | - | No | Source |