Capabilities, modalities, and lifecycle fields pulled from the model database.
Comparative results across benchmarks shared by the selected models.
| IF-Bench | 80.2% |
| MMMU Pro | 63.5% |
| OCRBench V2 | 64.5% |
| Tau 2 Retail | 92.7% |
| BFCL Overall FC V4 | 65.2% |
| Scale MCP Atlas | 61.6% |
| SWE-Bench | 70.0% |
| Terminal Bench | 41.3% |
| LongCodeBench 1M | 84.0% |
| Tau 2 Telecom | 94.5% |
| Tau 2 Airline | 77.7% |
| ScreenSpot | 88.1% |
| LiveCodeBench Coding | 74.6% |
| MMLU-Pro | 81.6% |
| MMLU Pro | 77.7% |
| GPQA Diamond | 81.4% |
| AIME 2025 | 92.3% |
| Realkie | 67.0% |
| QVHighlights | 76.7% |
Observed provider pricing per million tokens.
No providers found yet.
Maximum input and output token capacity.
Usage and distribution terms.
Model release chronology.
A deeper field-by-field view (including benchmarks, pricing, and links).
| General Information | |
| Context Window | Input: 1,000,000 Output: - |
| Modalities | In: Text, Vision, Video, Audio Out: Text |
| Reasoning | - |
| Web access | - |
| Parameters | - |
| Training Tokens | - |
| License | Proprietary |
| Knowledge Cutoff | - |
| Status | Available |
| Release | Dec 2025 |
| Announced | Dec 2025 |
| Deprecation | - |
| Retirement | - |
| Links | |
| Operational Metrics | |
| Cost per 1M Tokens | Input: - Output: - |
| Latency | - |
| Throughput | - |
| Benchmarks | |
| AIME 2025 | |
| BFCL Overall FC V4 | |
| GPQA Diamond | |
| IF-Bench | |
| LiveCodeBench Coding | |
| LongCodeBench 1M | |
| MMLU Pro | |
| MMLU-Pro | |
| MMMU Pro | |
| OCRBench V2 | |
| QVHighlights | |
| Realkie | |
| SWE-Bench | |
| Scale MCP Atlas | |
| ScreenSpot | |
| Tau 2 Airline | |
| Tau 2 Retail | |
| Tau 2 Telecom | |
| Terminal Bench | |