Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Trinity Large Thinking | 01 Apr 2026 | 91.90 | Hugging Face model card benchmark table (arcee-ai/Trinity-Large-Thinking) | Yes | Source | |
| MiMo V2 Omni | 18 Mar 2026 | 81.20 | LLM Stats (ZeroEval) | Yes | Source | |
| MiMo V2 TTS | 18 Mar 2026 | 81.00 | LLM Stats (ZeroEval) | inferred modality/version alias from mimo-v2-pro | Yes | Source | |
| MiMo V2 Pro | 18 Mar 2026 | 81.00 | LLM Stats (ZeroEval) | Yes | Source | |
| GLM 5V Turbo | 01 Apr 2026 | 80.70 | LLM Stats (ZeroEval) | Yes | Source |