Quickstart

Start calling this model with endpoint-specific examples.

Benchmarks

Headline benchmark standings and comparison context.

About

Key dates, capabilities, and model metadata.

z.AI: GLM 5

GLM 5 is Z.ai's flagship open-source foundation model engineered for complex systems design and long-horizon agent workflows. It accepts text inputs and produces text outputs.

Overview Playground Providers Pricing Performance Apps Activity Quickstart Benchmarks Family Timeline

Pricing

Effective pricing across providers over the past hour and 30-day pricing history by meter.

Effective Pricing

Based on the last hour of usage: we total provider spend, divide by total token volume, and express it as USD per 1M tokens. Providers with more traffic naturally carry more weight.

Weighted Avg Input Price

per 1M tokens (past hour)

Weighted Avg Output Price

per 1M tokens (past hour)

Provider	Input $/Million	Output $/Million	Cache Token %
Amazon Bedrock	$1.000	$3.200	--
AtlasCloud	$0.950	$3.150	--
Baseten	$0.950	$3.150	--
Canopy Wave	--	--	--
CrofAI	$0.480	$1.900	--
DeepInfra	$0.600	$2.080	--
DigitalOcean	$1.000	$3.200	--
Fireworks	--	--	--
Friendli	$1.000	$3.200	--
GMICloud	$1.000	$3.200	--
IonRouter	$1.200	$3.500	--
Nebius Token Factory	$1.000	$3.200	--
NovitaAI	$1.000	$3.200	--
SiliconFlow	$0.950	$2.550	--
Tensorix	--	--	--
Together	$1.000	$3.200	--
Venice	$1.000	$3.200	--
Venice (E2EE)	$1.100	$4.150	--
Weights & Biases	$1.000	$3.200	--
z.AI	$1.000	$3.200	--

Pricing History

Meter

No 30-day pricing history is available for the selected filters.