Public apps observed in gateway request traffic for this model.
Based on the last hour of usage: we total provider spend, divide by total token volume, and express it as USD per 1M tokens. Providers with more traffic naturally carry more weight.
Weighted Avg Input Price
--
per 1M tokens (past hour)
Weighted Avg Output Price
--
per 1M tokens (past hour)
| Provider | Input $/Million | Output $/Million | Cache Token % |
|---|---|---|---|
DeepInfra | -- | -- | -- |
Meter
Core latency and throughput trends from recent traffic.
Latency
Throughput
Uptime
Total Context
Max Output
| Meter | Unit | Price | Conditions | Rule |
|---|---|---|---|---|
| total_tokens | Per 1M tokens | $0.18 | -- | 40b784ea-8fb2-455d-a602-e5c543a936c2 |
Start calling this model with endpoint-specific examples.
# 1) Set your key
export AI_STATS_API_KEY="aistats_***"
# 2) Send a request
curl -s https://api.phaseo.app/v1/responses \
-H "Authorization: Bearer $AI_STATS_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "meta/llama-guard-4-12b",
"input": "Give me one fun fact about cURL."
}'