Search...
Ctrl K
Models
Providers
Apps
Rankings
Playground
Models
Providers
Apps
Rankings
Playground
Search...
Ctrl K
Sign In
Sign In
MMLU-STEM - Benchmark Leaderboard & Model Performance | AI Stats
MMLU-STEM
Overview
Overview
Type: numerical
Chemistry
Recorded Results
1
Average Score
0.76
Score Range
0.76 - 0.76
Leading Model
0.76 - Qwen 14B
Scores Over Time
Individual benchmark scores plotted by date.
No scores available to display.
Models Using This Benchmark
Organisation
Model
Reported
Top Score
Info
Self Reported
Source
Qwen
Qwen 14B
-
0.76
LLM Stats (ZeroEval) | inferred family alias from qwen-2.5-14b-instruct (score=0.3060; benches=16)
Yes
Source