Based on the last hour of usage: we total provider spend, divide by total token volume, and express it as USD per 1M tokens. Providers with more traffic naturally carry more weight.
Weighted Avg Input Price
--
per 1M tokens (past hour)
Weighted Avg Output Price
--
per 1M tokens (past hour)
| Provider | Input $/Million | Output $/Million | Cache Token % |
|---|---|---|---|
Fireworks | $1.200 | $1.200 | -- |
Meter
Public apps observed in gateway request traffic for this model.
Core latency and throughput trends from recent traffic.
Start calling this model with endpoint-specific examples.
# 1) Set your key
export AI_STATS_API_KEY="aistats_***"
# 2) Send a request
curl -s https://api.phaseo.app/v1/responses \
-H "Authorization: Bearer $AI_STATS_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "cogito/cogito-671b-v2.1",
"input": "Give me one fun fact about cURL."
}'Latency
Throughput
Uptime
Total Context