Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Claude Opus 4.6 | 05 Feb 2026 | 8018 | Final balance | Yes | Source | |
| Claude Sonnet 4.6 | 17 Feb 2026 | 7204 | Max effort final balance | Yes | Source | |
| GLM 5.1 | - | 5634 | - | Yes | Source | |
| Gemini 3 Pro Image Preview (Nano Banana Pro) | 20 Nov 2025 | 5478 | inferred modality/version alias from gemini-3-pro-preview | Yes | Source | |
| Gemini 3 Pro Preview | 18 Nov 2025 | 5478 | - | Yes | Source | |
| Claude Opus 4.5 | 24 Nov 2025 | 4967 | 8k Thinking | No | Source | |
| Claude Sonnet 4.5 | 29 Sept 2025 | 3839 | - | No | Source | |
| Gemini 3 Flash Preview | 17 Dec 2025 | 3635 | Higher is better, value in dollars | Yes | Source | |
| Grok 4 | 10 Jul 2025 | 1999 | - | No | Source | |
| GPT 5.1 | 12 Nov 2025 | 1473 | - | No | Source | |
| Gemini 2.5 Pro Preview (2025-06-05) | 05 Jun 2025 | 574 | - | No | Source | |
| Grok 4.1 Non Thinking | 17 Nov 2025 | 351 | - | No | Source |