Skip to main content
glm-5.1 is the published flagship from Zhipu’s 智谱 GLM family — strong on Chinese tasks, vision-capable. It supersedes the earlier glm-5 preview.

Pricing

InputCache readOutput
$1.40 / 1M$0.26 / 1M$4.40 / 1M
Cache write is billed at the input rate. See the pricing table for the live rate.

Protocols

ProtocolPath
OpenAI Chat CompletionsPOST https://llm.bytespike.ai/v1/chat/completions

Quickstart

curl https://llm.bytespike.ai/v1/chat/completions \
  -H "Authorization: Bearer $BYTESPIKE_API_KEY" \
  -H "content-type: application/json" \
  -d '{
    "model": "glm-5-1",
    "messages": [{ "role": "user", "content": "Hello, ByteSpike." }]
  }'

Capabilities

CapabilitySupported
Chat Completions
Streaming (SSE)
Tools / function calling
JSON mode
Vision (image input)
Context window128K tokens

When to use

  • Chinese-language general chat — GLM-5.1 is competitive with Doubao on Chinese benchmarks.
  • Vision in CN region — good OCR / document understanding.
The earlier glm-5 preview is legacy and no longer the published flagship — use glm-5.1.

Next