Search...
Ctrl K
Models
Providers
Apps
Rankings
Playground
Models
Providers
Apps
Rankings
Playground
Search...
Ctrl K
Sign In
Sign In
Multilingual MMLU - Benchmark Leaderboard & Model Performance | AI Stats
Multilingual MMLU
Overview
Overview
Type: percentage
General
Recorded Results
2
Average Score
65
Score Range
49.30 - 80.70
Leading Model
80.70 - o3 mini
Scores Over Time
Individual benchmark scores plotted by date.
Models Using This Benchmark
Organisation
Model
Reported
Top Score
Info
Self Reported
Source
OpenAI
o3 mini
30 Jan 2025
80.70
LLM Stats (ZeroEval)
Yes
Source
Microsoft
Phi 4 Mini
01 Feb 2025
49.30
LLM Stats (ZeroEval)
Yes
Source