Loading
AI Stats is fetching the latest data for this page. This usually only takes a moment.
If this screen doesn't disappear after a short while, you can refresh the page or use one of the links above to continue.
Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Claude Opus 4.1 | 05 Aug 2025 | 0.82 | - | Yes | Source | |
| Qwen3 Coder 480B A35B Instruct | 22 Jul 2025 | 0.78 | - | Yes | Source | |
| Qwen3 A235 A22B Instruct 2507 | 21 Jul 2025 | 0.71 | - | Yes | Source | |
| GPT OSS 120b | 05 Aug 2025 | 0.68 | High Reasoning Effort | Yes | Source | |
| EXAONE 4.0 32B | 15 Jul 2025 | 0.63 | Reasoning | Yes | Source | |
| GPT OSS 20b | 05 Aug 2025 | 0.55 | High Reasoning Effort | Yes | Source | |
| EXAONE 4.0 1.2B | 15 Jul 2025 | 0.28 | Reasoning | Yes | Source |