Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Claude Opus 4.6 | 05 Feb 2026 | 78.30 | LLM Stats (ZeroEval) | Yes | Source | |
| Grok 4.1 Thinking | 17 Nov 2025 | 34.00 | LLM Stats (ZeroEval) | inferred modality/version alias from grok-4.1-thinking-2025-11-17 | Yes | Source |