Back to Models

Qwen: Qwen3.6 Flash

qwen/qwen3.6-flash
Apr 27, 20261M context65.5K max output$0.25/M in · $1.50/M outReasoning

Description

Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M token context window. Tiered pricing kicks in above 256K tokens. Prompt caching is supported, with both explicit cache read and cache creation pricing.

Specifications

Provider
qwen
Context Length
1M
Max Output
65.5K
Modality
Intextimagevideo
Outtext

Pricing

Type256K>256K
Input$0.25$1.00
Output$1.50$4.00
Cache Write$0.31$0.31

Quick Start

curl https://api.ominigate.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-omg-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "qwen/qwen3.6-flash",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'
Qwen: Qwen3.6 Flash — Pricing & API on OminiGate