Loading
AI Stats is fetching the latest data for this page. This usually only takes a moment.
If this screen doesn't disappear after a short while, you can refresh the page or use one of the links above to continue.
Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Claude Opus 4.5 | 24 Nov 2025 | 0.88 | Corrected | Yes | Source | |
| GPT 5 | 07 Aug 2025 | 0.63 | With Thinking, Pass @ 1 | Yes | Source | |
| GPT 5 mini | 07 Aug 2025 | 0.60 | High Reasoning Effort | Yes | Source | |
| Qwen3 235B A22B Thinking 2507 | 25 Jul 2025 | 0.58 | - | Yes | Source | |
| Kimi K2 Instruct | 11 Jul 2025 | 0.56 | Avg@4 | Yes | Source | |
| GPT 5 nano | 07 Aug 2025 | 0.41 | High Reasoning Effort | Yes | Source |