Docs
Search
Ctrl K
Models
Playground
Compare
Providers
Apps
Rankings
Models
Playground
Compare
Providers
Apps
Rankings
Docs
Search
Ctrl K
Sign In
Sign In
CommonSenseQA - Benchmark Leaderboard & Model Performance | AI Stats
CommonSenseQA
Overview
Overview
Type: numerical
Language
Recorded Results
1
Average Score
0.70
Score Range
0.70 - 0.70
Leading Model
0.70 - Mistral Nemo 12B
Scores Over Time
Individual benchmark scores plotted by date.
Models Using This Benchmark
Organisation
Model
Reported
Top Score
Info
Self Reported
Source
Mistral
Mistral Nemo 12B
18 Jul 2024
0.70
inferred family alias from mistral-nemo-instruct-2407 (score=0.3250; benches=8)
Yes
Source