Loading
AI Stats is fetching the latest data for this page. This usually only takes a moment.
If this screen doesn't disappear after a short while, you can refresh the page or use one of the links above to continue.
Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| GPT 5 | 07 Aug 2025 | 0.84 | With Thinking, Pass @ 1 | Yes | Source | |
| Gemini 2.5 Pro Preview (2025-06-05) | 05 Jun 2025 | 0.82 | Single Attempt | Yes | Source | |
| GPT 5 mini | 07 Aug 2025 | 0.82 | High Reasoning Effort | Yes | Source | |
| Claude Opus 4.5 | 24 Nov 2025 | 0.81 | Avg@5, 64k Thinking | Yes | Source | |
| Gemini 2.5 Flash Preview (2025-05-20) | 20 May 2025 | 0.80 | Pass@1 | Yes | Source | |
| Gemini 2.5 Pro Preview (2025-05-06) | 06 May 2025 | 0.80 | Pass@1 | Yes | Source | |
| Grok 3 Beta | 19 Feb 2025 | 0.78 | Think, Cons@64 | Yes | Source | |
| Claude Opus 4.1 | 05 Aug 2025 | 0.77 | - | Yes | Source | |
| Gemini 2.0 Flash | 05 Feb 2025 | 0.77 | Pass@1 | Yes | Source | |
| Gemini 2.5 Flash Preview (2025-04-17) | 17 Apr 2025 | 0.77 | Pass@1 | Yes | Source | |
| GPT 5 nano | 07 Aug 2025 | 0.76 | High Reasoning Effort | Yes | Source | |
| Gemini 2.5 Flash Lite Preview | 17 Jun 2025 | 0.73 | No Thinking | Yes | Source | |
| Magistral Medium | 10 Jun 2025 | 0.70 | - | Yes | Source | |
| Grok 3 Mini Beta | 19 Feb 2025 | 0.69 | - | Yes | Source | |
| Magistral Small | 10 Jun 2025 | 0.66 | - | Yes | Source | |
| Mistral Small 3.2 | 20 Jun 2025 | 0.63 | - | Yes | Source | |
| Grok 1.5V | 12 Apr 2024 | 0.54 | - | Yes | Source |