Loading...
Loading...
AI Stats
Home
Comparisons
Providers
Models
Benchmarks
Prices
Open menu
Tau Bench (Airline)
7
Total Models
46.26
Average Score
20.50 - 60.00
Score Range
1
Max Score Achievable
Top 10 Model Performance
7 models
Models Using This Benchmark
(7)
LG
(2 models)
EXAONE 4.0 32B
lg
51.50%
EXAONE 4.0 1.2B
lg
20.50%
OpenAI
(2 models)
gpt-oss-120b
openai
49.20%
gpt-oss-20b
openai
42.60%
Qwen
(2 models)
Qwen3 Coder 480B A35B Instruct
qwen
60.00%
Qwen3 A235 A22B Instruct 2507
qwen
44.00%
Anthropic
(1 model)
Claude Opus 4.1
anthropic
56.00%