GLM 5.1

z-ai/glm-5.1
旗艦
來自 Z.ai · text in · text out · 200K ctx · 2026-04-07

Z.ai's strongest coding-and-agent model in the GLM-5 line; supports streaming tool calls and deep th…

INPUT$1.40/ 1M tokens
OUTPUT$4.40/ 1M tokens
p50 TTFT2.44 s7d
p95 TTFT7.66 s7d
TRAFFIC1.9Mtokens / 7d
程式碼範例
from openai import OpenAI

client = OpenAI(
    base_url="https://api.orcarouter.ai/v1",
    api_key="$ORCAROUTER_API_KEY",
)

response = client.chat.completions.create(
    model="z-ai/glm-5.1",
    messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)
價格
輸入 / 1M tokens$1.40
輸出 / 1M tokens$4.40
快取讀取 / 1M$0.260
貨幣USD
效能
TTFT p50
2.44 s
輸出速度
54.4 tok/s
TTFT p95
7.66 s
錯誤率
0%
公開基準測試
AA Coding
43.4
AA Intelligence
51.4
GPQA Diamond
86.8
Humanity's Last Exam
28.0
IFBench
76.3
Long-Context Recall
62.3
SciCode
43.8
TerminalBench Hard
43.2
τ²-Bench
97.7
Source: artificialanalysis.ai
常見問題
How much does Z.ai: GLM 5.1 cost on OrcaRouter?
Z.ai: GLM 5.1 is priced at $1.40 per 1M input tokens and $4.40 per 1M output tokens via OrcaRouter. Pricing is pulled live from the routing layer.
What is Z.ai: GLM 5.1's context window?
Z.ai: GLM 5.1 supports a context window of 200K tokens. Use long-context features (RAG, summarisation) up to that limit.
How do I call Z.ai: GLM 5.1 via the OpenAI SDK?
Set OpenAI base_url to https://api.orcarouter.ai/v1, supply your OrcaRouter API key, and pass model="z-ai/glm-5.1" in the chat.completions.create call.
Does OrcaRouter rate-limit Z.ai: GLM 5.1?
Per-model rate limits follow your OrcaRouter plan. Free tiers ship with conservative caps; paid tiers lift them. Check /pricing for current quotas.
嵌入此徽章
Z.ai: GLM 5.1$1.40/M in2444ms p50via OrcaRouter
HTML <a href="https://www.orcarouter.ai/models/z-ai/glm-5.1" target="_blank"> <img src="https://www.orcarouter.ai/embed/z-ai/glm-5.1.svg" alt="Z.ai: GLM 5.1 on OrcaRouter" /> </a>
Markdown [![Z.ai: GLM 5.1](https://www.orcarouter.ai/embed/z-ai/glm-5.1.svg)](https://www.orcarouter.ai/models/z-ai/glm-5.1)