New Release
|
Introducing AI Stats Gateway
|
Read the docs
New Release
Home
Organisations
Models
Benchmarks
API Providers
Home
Organisations
Models
Benchmarks
API Providers
Sign In
Sign In
GSM8K - Benchmark Leaderboard & Model Performance | AI Stats
GSM8K
Overview
Overview
Recorded Results
2
Average Score
0.60
Score Range
0.57 - 0.63
Leading Model
0.63 - Grok 1
Scores Over Time
Individual benchmark scores plotted by date.
Models Using This Benchmark
Organisation
Model
Reported
Top Score
Info
Self Reported
Source
xAI
Grok 1
03 Nov 2023
0.63
8 Shot
Yes
Source
xAI
Grok 0
-
0.57
8 Shot
Yes
Source