Public apps observed in gateway request traffic for this model.
Based on the last hour of usage: we total provider spend, divide by total token volume, and express it as USD per 1M tokens. Providers with more traffic naturally carry more weight.
Weighted Avg Input Price
--
per 1M tokens (past hour)
Weighted Avg Output Price
--
per 1M tokens (past hour)
| Provider | Input $/Million | Output $/Million | Cache Token % |
|---|---|---|---|
AtlasCloud | $0.216 | $0.880 | -- |
Baseten | $0.770 | $0.770 | -- |
DeepInfra | $0.200 | $0.770 | -- |
GMICloud | $0.180 | $0.600 | -- |
Nebius Token Factory | $0.750 | $2.250 | -- |
NovitaAI | $0.270 | $1.120 | -- |
Weights & Biases | -- | -- | -- |
Meter
Core latency and throughput trends from recent traffic.
Latency
Throughput
Uptime
Total Context
Max Output
Latency
Throughput
Uptime
Total Context
Max Output
Latency
Throughput
Uptime
Total Context
Max Output
Latency
Throughput
Uptime
Latency
Throughput
Uptime
Total Context
Max Output
Latency
Throughput
Uptime
Total Context
Latency
Throughput
Uptime
Start calling this model with endpoint-specific examples.
# 1) Set your key
export AI_STATS_API_KEY="aistats_***"
# 2) Send a request
curl -s https://api.phaseo.app/v1/responses \
-H "Authorization: Bearer $AI_STATS_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "deepseek/deepseek-v3-0324",
"input": "Give me one fun fact about cURL."
}'