API providers, route pricing, availability, and recent reliability signals.
Latency, throughput, and reliability signals from recent traffic.
Effective prices over the last 30 days, with current provider list prices for context.
Headline benchmark standings and comparison context.
Daily gateway activity over the last 30 days, with current UTC-day pace projection.
Public apps observed in gateway request traffic for this model.
Uptime trend for this model over the last 24 hours.
Start calling this model with endpoint-specific examples.
Key dates, capabilities, and model metadata.
Commercial plans and bundled access that currently include this model.
Google: Gemma 4 31B
Latency, throughput, and reliability signals from recent traffic.
Weighted provider pricing over the last 30 days, with recent route pricing history below.
Headline benchmark standings and comparison context.
Daily gateway activity over the last 30 days, with current UTC-day pace projection.
Public apps observed in gateway request traffic for this model.
Uptime trend for this model over the last 24 hours.
Start calling this model with endpoint-specific examples.
Key dates, capabilities, and model metadata.
02 Apr 2026·3 months ago
02 Apr 2026·3 months ago
Not listed
Not listed
Parameters
Not listed
License
Apache 2.0
Training Tokens
Not listed
Input
Output
Choose a supported endpoint, pick a main language, then select the example style you want to copy.
Supported API reference routes for this model.
import AIStats from '@ai-stats/sdk';
const client = new AIStats({
apiKey: process.env.AI_STATS_API_KEY,
});
const response = await client.generateResponse({
"model": "google/gemma-4-31b",
"input": "Give me one fun fact about cURL."
});
const outputText = response.output
?.flatMap((item) => item.content ?? [])
.find((item) => item.type === "output_text")
?.text;
console.log(outputText ?? response);Parameters
Aggregated across active providers for the responses route.
Routing will select a compatible provider when a parameter narrows availability, so this list stays model-facing instead of provider-facing.
| Parameter | Description |
|---|---|
temperature | Controls how random token selection can be. |
top_p | Applies nucleus sampling by limiting candidates to a probability mass threshold. |
top_k | Restricts sampling to the top-k candidate tokens on providers that expose it. |
max_tokens | Caps output length on endpoints and providers that use the max_tokens field name. |
frequency_penalty | Discourages repeated tokens in proportion to how often they already appeared. |
presence_penalty | Encourages the model to explore new wording or topics after they first appear. |
seed | Requests deterministic sampling when the upstream provider supports seeded generation. |
stop | Defines one or more sequences that terminate generation early. |
logprobs | Requests token-level probability data in the response. |
structured_outputs | Capability signal for reliable schema-constrained output workflows. |
reasoning | Provider-specific reasoning configuration for reasoning-capable APIs. |
logit_bias | Adjusts token selection bias directly when a provider exposes logit control. |
top_logprobs | Limits how many alternative token probabilities are returned per position. |
Top benchmark results for google/gemma-4-31b.
Detailed benchmark comparisons now live in the Compare tool.
Weighted by routed usage over the last 30 days.
Weighted input price
--
Per 1M tokens across 0 input tokens
Weighted output price
--
Per 1M tokens across 0 output tokens
| -- | -- | 100.0% | 237 | |
| $0.14 | $0.4 | -- | 0 | |
| $0.99 | $1.49 | -- | 0 | |
| CrofAI | $0.1 | $0.3 | -- | 0 |
| $0.18 | $0.5 | -- | 0 | |
| $0.14 | $0.4 | -- | 0 | |
| $0.14 | $0.4 | -- | 0 | |
| $0.14 | $0.4 | -- | 0 | |
| $0.13 | $0.4 | -- | 0 | |
| $0.2 | $0.5 | -- | 0 | |
| $0.17 | $0.5 | -- | 0 | |
| $0.3 | $1.25 | -- | 0 |