Docs
Search
Ctrl K
Models
Playground
Compare
Providers
Apps
Rankings
Models
Playground
Compare
Providers
Apps
Rankings
Docs
Search
Ctrl K
Sign In
Sign In
OfficeQA - Benchmark Leaderboard & Model Performance | AI Stats
OfficeQA
Overview
Overview
Type: percentage
Professional
View benchmark source
Recorded Results
3
Average Score
69.50%
Score Range
54.10% - 86.30%
Leading Model
86.30% - Claude Opus 4.7
Scores Over Time
Individual benchmark scores plotted by date.
Models Using This Benchmark
Organisation
Model
Reported
Top Score
Info
Self Reported
Source
Anthropic
Claude Opus 4.7
16 Apr 2026
86.30%
-
Yes
Source
OpenAI
GPT 5.4
05 Mar 2026
68.10%
-
Yes
Source
OpenAI
GPT 5.5
23 Apr 2026
54.10%
Pro
Yes
Source