Loading
AI Stats is fetching the latest data for this page. This usually only takes a moment.
If this screen doesn't disappear after a short while, you can refresh the page or use one of the links above to continue.
Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Claude Opus 4.5 | 24 Nov 2025 | 0.89 | Avg@5, 64k Thinking | Yes | Source | |
| GPT 5 | 07 Aug 2025 | 0.81 | With Thinking, Pass @ 1 | Yes | Source | |
| GPT 5 mini | 07 Aug 2025 | 0.78 | High Reasoning Effort | Yes | Source | |
| Qwen3 235B A22B Thinking 2507 | 25 Jul 2025 | 0.72 | - | Yes | Source | |
| Kimi K2 Instruct | 11 Jul 2025 | 0.71 | Avg@4 | Yes | Source | |
| GPT 5 nano | 07 Aug 2025 | 0.62 | High Reasoning Effort | Yes | Source |