Search...
Ctrl K
Models
Providers
Apps
Rankings
Playground
Models
Providers
Apps
Rankings
Playground
Search...
Ctrl K
Sign In
Sign In
MMMU (validation) - Benchmark Leaderboard & Model Performance | AI Stats
MMMU (validation)
Overview
Overview
Type: numerical
General
Recorded Results
1
Average Score
0.73
Score Range
0.73 - 0.73
Leading Model
0.73 - Claude Haiku 4.5
Scores Over Time
Individual benchmark scores plotted by date.
Models Using This Benchmark
Organisation
Model
Reported
Top Score
Info
Self Reported
Source
Anthropic
Claude Haiku 4.5
15 Oct 2025
0.73
LLM Stats (ZeroEval) | inferred alias from claude-haiku-4-5-20251001
Yes
Source