Docs
Search
Ctrl K
Models
Playground
Compare
Providers
Apps
Rankings
Models
Playground
Compare
Providers
Apps
Rankings
Docs
Search
Ctrl K
Sign In
Sign In
MMLU Chat - Benchmark Leaderboard & Model Performance | AI Stats
MMLU Chat
Overview
Overview
Type: percentage
General
Recorded Results
1
Average Score
80.58%
Score Range
80.58% - 80.58%
Leading Model
80.58% - Llama 3.1 Nemotron 70B Instruct
Scores Over Time
Individual benchmark scores plotted by date.
Models Using This Benchmark
Organisation
Model
Reported
Top Score
Info
Self Reported
Source
Nvidia
Llama 3.1 Nemotron 70B Instruct
01 Oct 2024
80.58%
-
Yes
Source