Loading
AI Stats is fetching the latest data for this page. This usually only takes a moment.
If this screen doesn't disappear after a short while, you can refresh the page or use one of the links above to continue.
Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Kimi K2 Instruct | 11 Jul 2025 | 0.90 | Prompt Strict | Yes | Source | |
| Qwen3 A235 A22B Instruct 2507 | 21 Jul 2025 | 0.89 | - | Yes | Source | |
| Qwen3 235B A22B Thinking 2507 | 25 Jul 2025 | 0.88 | - | Yes | Source | |
| EXAONE 4.0 32B | 15 Jul 2025 | 0.85 | Non Reasoning | Yes | Source | |
| Jamba Large 1.7 | 03 Jul 2025 | 0.84 | - | Yes | - | |
| Jamba Mini 1.7 | 03 Jul 2025 | 0.76 | - | Yes | - | |
| Jamba Large 1.6 | 06 Mar 2025 | 0.76 | - | Yes | - | |
| EXAONE 4.0 1.2B | 15 Jul 2025 | 0.75 | Non Reasoning | Yes | Source | |
| Jamba Mini 1.6 | 06 Mar 2025 | 0.68 | - | Yes | - |