Back to Models

Qwen: Qwen3.5-Flash

qwen/qwen3.5-flash-02-23
Feb 25, 20261M context65.5K max output$0.10/M in · $0.40/M outReasoning

Description

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.

Specifications

Provider
qwen
Context Length
1M
Max Output
65.5K
Modality
Intextimagevideo
Outtext

Pricing

TypePrice / 1M tokens
Input$0.10
Output$0.40
Cache Write$0.13

Quick Start

curl https://api.ominigate.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-omg-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "qwen/qwen3.5-flash-02-23",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'