Search...
Ctrl K
Models
Providers
Apps
Rankings
Playground
Models
Providers
Apps
Rankings
Playground
Search...
Ctrl K
Sign In
Sign In
MM IF-Eval - Benchmark Leaderboard & Model Performance | AI Stats
MM IF-Eval
Overview
Overview
Type: numerical
Multimodal
Recorded Results
1
Average Score
0.53
Score Range
0.53 - 0.53
Leading Model
0.53 - Pixtral 12B
Scores Over Time
Individual benchmark scores plotted by date.
Models Using This Benchmark
Organisation
Model
Reported
Top Score
Info
Self Reported
Source
Mistral
Pixtral 12B
17 Sept 2024
0.53
LLM Stats (ZeroEval) | inferred version-family alias from pixtral-12b-2409
Yes
Source