Back to Models

Qwen: Qwen3.6 35B A3B

qwen/qwen3.6-35b-a3b
Apr 27, 2026262.1K context65.5K max output$0.16/M in · $0.97/M outReasoning

Description

Qwen3.6-35B-A3B is an open-weight multimodal model from Alibaba Cloud with 35 billion total parameters and 3 billion active parameters per token. It uses a hybrid sparse mixture-of-experts architecture combining Gated DeltaNet linear attention with standard gated attention layers, enabling efficient inference at a fraction of the compute cost. The model supports a 262K token native context window (extensible to 1M via YaRN) and accepts text, image, and video inputs. It includes integrated thinking mode with reasoning traces preserved across multi-turn conversations, function calling, and structured output. Released under the Apache 2.0 license.

Specifications

Provider
qwen
Context Length
262.1K
Max Output
65.5K
Modality
Intextimagevideo
Outtext

Pricing

TypePrice / 1M tokens
Input$0.16
Output$0.97
Cache Read$0.16

Quick Start

curl https://api.ominigate.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-omg-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "qwen/qwen3.6-35b-a3b",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'
Qwen: Qwen3.6 35B A3B — Pricing & API on OminiGate