Back to Models

Z.ai: GLM 5 Turbo

z-ai/glm-5-turbo
Mar 15, 2026202.8K context131.1K max output$1.20/M in · $4.00/M outReasoning

Description

GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows involving long execution chains, with improved complex instruction decomposition, tool use, scheduled and persistent execution, and overall stability across extended tasks.

Specifications

Provider
z-ai
Context Length
202.8K
Max Output
131.1K
Modality
Intext
Outtext

Pricing

TypePrice / 1M tokens
Input$1.20
Output$4.00
Cache Read$0.24

Quick Start

curl https://api.ominigate.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-omg-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "z-ai/glm-5-turbo",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'
Z.ai: GLM 5 Turbo — Pricing & API on OminiGate