Search...
Ctrl K
Models
Providers
Apps
Rankings
Playground
Models
Providers
Apps
Rankings
Playground
Search...
Ctrl K
Sign In
Sign In
Instruct HumanEval - Benchmark Leaderboard & Model Performance | AI Stats
Instruct HumanEval
Overview
Overview
Type: percentage
General
Recorded Results
1
Average Score
73.84
Score Range
73.84 - 73.84
Leading Model
73.84 - Llama 3.1 Nemotron 70B Instruct
Scores Over Time
Individual benchmark scores plotted by date.
Models Using This Benchmark
Organisation
Model
Reported
Top Score
Info
Self Reported
Source
Nvidia
Llama 3.1 Nemotron 70B Instruct
01 Oct 2024
73.84
LLM Stats (ZeroEval)
Yes
Source