Back to Models

Z.ai: GLM 4.7 Flash

z-ai/glm-4.7-flash
Jan 19, 2026202.8K context$0.07/M in · $0.40/M outReasoning

Description

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.

Specifications

Provider
z-ai
Context Length
202.8K
Max Output
Modality
Intext
Outtext

Pricing

TypePrice / 1M tokens
Input$0.07
Output$0.40
Cache Read$0.01

Quick Start

curl https://api.ominigate.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-omg-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "z-ai/glm-4.7-flash",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'
Z.ai: GLM 4.7 Flash — Pricing & API on OminiGate