Back to Models
Qwen: Qwen3.5-Flash
qwen/qwen3.5-flash-02-23Feb 25, 20261M context65.5K max output$0.10/M in · $0.40/M outReasoning
Description
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.
Specifications
Provider
qwen
Context Length
1M
Max Output
65.5K
Modality
Intextimagevideo
Outtext
Pricing
| Type | Price / 1M tokens |
|---|---|
| Input | $0.10 |
| Output | $0.40 |
| Cache Write | $0.13 |
Quick Start
curl https://api.ominigate.ai/v1/chat/completions \
-H "Authorization: Bearer sk-omg-your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "qwen/qwen3.5-flash-02-23",
"messages": [{"role": "user", "content": "Hello!"}]
}'