Capabilities, modalities, and lifecycle fields pulled from the model database.
Comparative results across benchmarks shared by the selected models.
| Aider-Polyglot | 79.6% |
| Vending Bench 2 | 199946.0% |
| EQ-Bench 3 | 119320.0% |
| AIME 2025 | 91.7% |
| ARC-AGI-1 | 66.7% |
| ARC-AGI-2 | 16.2% |
| Confabulations | 1241.0% |
| Creative Story Writing | 769.0% |
| GPQA Diamond | 87.5% |
| HMMT 2025 | 90.0% |
| Elimation Game | 599.6% |
| MathArena Apex | 2.1% |
| Humanity's Last Exam | 25.4% |
| NYT Connections | 92.4% |
| Thematic Generalisation | 188.0% |
| USAMO 2025 | 37.5% |
Observed provider pricing per million tokens.
All unique meters observed across the selected models.
| Meter | Grok 4 |
|---|---|
| Input Text Tokens | $3.00 |
Providers that expose each model based on observed pricing data.
Plans that include each selected model, grouped by organisation.
2 plans
Maximum input and output token capacity.
Usage and distribution terms.
Model release chronology.
Most recent training data date (when available).
A deeper field-by-field view (including benchmarks, pricing, and links).
| General Information | |
| Context Window | Input: 256,000 Output: 256,000 |
| Modalities | In: Text Out: Text |
| Reasoning | - |
| Web access | - |
| Parameters | - |
| Training Tokens | - |
| License | Proprietary |
| Knowledge Cutoff | Nov 2024 |
| Status | Available |
| Release | Jul 2025 |
| Announced | Jul 2025 |
| Deprecation | - |
| Retirement | - |
| Links | |
| Operational Metrics | |
| Cost per 1M Tokens | Input: $3.00 Output: $15.00 |
| Latency | - |
| Throughput | - |
| Benchmarks | |
| AIME 2025 | |
| ARC-AGI-1 | |
| ARC-AGI-2 | |
| Aider-Polyglot | |
| Confabulations | |
| Creative Story Writing | |
| EQ-Bench 3 | |
| Elimation Game | |
| GPQA Diamond | |
| HMMT 2025 | |
| Humanity's Last Exam | |
| MathArena Apex | |
| NYT Connections | |
| Thematic Generalisation | |
| USAMO 2025 | |
| Vending Bench 2 | |