Quickstart

Start calling this model with endpoint-specific examples.

Benchmarks

Headline benchmark standings and comparison context.

About

Key dates, capabilities, and model metadata.

z.AI: GLM 4.6

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex.

Overview Playground Providers Pricing Performance Apps Activity Quickstart Benchmarks Family Timeline

Pricing

Effective pricing across providers over the past hour and 30-day pricing history by meter.

Effective Pricing

Based on the last hour of usage: we total provider spend, divide by total token volume, and express it as USD per 1M tokens. Providers with more traffic naturally carry more weight.

Weighted Avg Input Price

per 1M tokens (past hour)

Weighted Avg Output Price

per 1M tokens (past hour)

Provider	Input $/Million	Output $/Million	Cache Token %
AtlasCloud	$0.440	$1.740	--
Baseten	--	--	--
DeepInfra	$0.430	$1.740	--
GMICloud	$0.600	$2.000	--
NovitaAI	$0.550	$2.200	--
SiliconFlow	$0.390	$1.900	--
Venice	$0.850	$2.750	--
z.AI	$0.600	$2.200	--

Pricing History

Meter

No 30-day pricing history is available for the selected filters.