Back to Models
Qwen: Qwen3 Max
qwen/qwen3-maxSep 23, 2025262.1K context32.8K max output$1.20/M in · $6.00/M outReasoning
Description
Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It delivers higher accuracy in math, coding, logic, and science tasks, follows complex instructions in Chinese and English more reliably, reduces hallucinations, and produces higher-quality responses for open-ended Q&A, writing, and conversation. The model supports over 100 languages with stronger translation and commonsense reasoning, and is optimized for retrieval-augmented generation (RAG) and tool calling, though it does not include a dedicated “thinking” mode.
Specifications
Provider
qwen
Context Length
262.1K
Max Output
32.8K
Modality
Intext
Outtext
Pricing
| Type | ≤32K | >32K | >128K |
|---|---|---|---|
| Input | $1.20 | $2.40 | $3.00 |
| Output | $6.00 | $12.00 | $15.00 |
| Cache Read | $0.24 | $0.48 | $0.60 |
| Cache Write | $1.50 | $3.00 | $3.75 |
Quick Start
curl https://api.ominigate.ai/v1/chat/completions \
-H "Authorization: Bearer sk-omg-your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "qwen/qwen3-max",
"messages": [{"role": "user", "content": "Hello!"}]
}'