Back to Models
Qwen: Qwen3.6 35B A3B
qwen/qwen3.6-35b-a3bApr 27, 2026262.1K context65.5K max output$0.16/M in · $0.97/M outReasoning
Description
Qwen3.6-35B-A3B is an open-weight multimodal model from Alibaba Cloud with 35 billion total parameters and 3 billion active parameters per token. It uses a hybrid sparse mixture-of-experts architecture combining Gated DeltaNet linear attention with standard gated attention layers, enabling efficient inference at a fraction of the compute cost. The model supports a 262K token native context window (extensible to 1M via YaRN) and accepts text, image, and video inputs. It includes integrated thinking mode with reasoning traces preserved across multi-turn conversations, function calling, and structured output. Released under the Apache 2.0 license.
Specifications
Provider
qwen
Context Length
262.1K
Max Output
65.5K
Modality
Intextimagevideo
Outtext
Pricing
| Type | Price / 1M tokens |
|---|---|
| Input | $0.16 |
| Output | $0.97 |
| Cache Read | $0.16 |
Quick Start
curl https://api.ominigate.ai/v1/chat/completions \
-H "Authorization: Bearer sk-omg-your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "qwen/qwen3.6-35b-a3b",
"messages": [{"role": "user", "content": "Hello!"}]
}'