Back to Models
Z.ai: GLM 4.7 Flash
z-ai/glm-4.7-flashJan 19, 2026202.8K context$0.07/M in · $0.40/M outReasoning
Description
As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning, and tool collaboration, and has achieved leading performance among open-source models of the same size on several current public benchmark leaderboards.
Specifications
Provider
z-ai
Context Length
202.8K
Max Output
—
Modality
Intext
Outtext
Pricing
| Type | Price / 1M tokens |
|---|---|
| Input | $0.07 |
| Output | $0.40 |
| Cache Read | $0.01 |
Quick Start
curl https://api.ominigate.ai/v1/chat/completions \
-H "Authorization: Bearer sk-omg-your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "z-ai/glm-4.7-flash",
"messages": [{"role": "user", "content": "Hello!"}]
}'