Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| GLM 5.1 | - | 95.30 | - | Yes | Source | |
| Qwen 3.6 Plus | 01 Apr 2026 | 95.30 | LLM Stats (ZeroEval) | Yes | Source | |
| Seed 2.0 Pro | 14 Feb 2026 | 94.20 | LLM Stats (ZeroEval) | Yes | Source | |
| Qwen 3.5 397B A17B | 16 Feb 2026 | 91.30 | - | Yes | Source | |
| Gemma 4 31B | 02 Apr 2026 | 89.20 | No tools | Yes | Source | |
| Seed 2.0 Lite | 14 Feb 2026 | 88.30 | LLM Stats (ZeroEval) | Yes | Source | |
| Gemma 4 26B A4B | 02 Apr 2026 | 88.30 | No tools | Yes | Source | |
| Seed 2.0 Mini | 14 Feb 2026 | 86.70 | Seed2 official benchmark table | AIME 2026 | Yes | Source |