Individual benchmark scores plotted by date.
| Organisation | Model | Reported | Top Score | Info | Self Reported | Source |
|---|---|---|---|---|---|---|
| Qwen 2.5 Omni 7B | - | 0.08 | LLM Stats (ZeroEval) | Yes | Source | |
| Qwen 2.5 Coder 7B | - | 0.08 | LLM Stats (ZeroEval) | inferred high-confidence family alias from qwen2.5-omni-7b (score=0.4700; benches=45) | Yes | Source | |
| Qwen 2.5 Coder 3B | - | 0.08 | LLM Stats (ZeroEval) | inferred family alias from qwen2.5-omni-7b (score=0.3000; benches=45) | Yes | Source | |
| Qwen 2.5 Math 7B | - | 0.08 | LLM Stats (ZeroEval) | inferred high-confidence family alias from qwen2.5-omni-7b (score=0.4767; benches=45) | Yes | Source | |
| Qwen 2.5 Math 7B PRM800K | - | 0.08 | LLM Stats (ZeroEval) | inferred family alias from qwen2.5-omni-7b (score=0.3696; benches=45) | Yes | Source | |
| Qwen 2.5 Math PRM 7B | - | 0.08 | LLM Stats (ZeroEval) | inferred family alias from qwen2.5-omni-7b (score=0.4092; benches=45) | Yes | Source | |
| Qwen 2.5 Omni 3B | - | 0.08 | LLM Stats (ZeroEval) | inferred high-confidence family alias from qwen2.5-omni-7b (score=0.4933; benches=45) | Yes | Source |