Back to Models
Qwen: Qwen3.6 Flash
qwen/qwen3.6-flashApr 27, 20261M context65.5K max output$0.25/M in · $1.50/M outReasoning
Description
Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M token context window. Tiered pricing kicks in above 256K tokens. Prompt caching is supported, with both explicit cache read and cache creation pricing.
Specifications
Provider
qwen
Context Length
1M
Max Output
65.5K
Modality
Intextimagevideo
Outtext
Pricing
| Type | ≤256K | >256K |
|---|---|---|
| Input | $0.25 | $1.00 |
| Output | $1.50 | $4.00 |
| Cache Write | $0.31 | $0.31 |
Quick Start
curl https://api.ominigate.ai/v1/chat/completions \
-H "Authorization: Bearer sk-omg-your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "qwen/qwen3.6-flash",
"messages": [{"role": "user", "content": "Hello!"}]
}'