Back to Models

Arcee AI: Maestro Reasoning

arcee-ai/maestro-reasoning
May 5, 2025131.1K context32K max output$0.90/M in · $3.30/M out

Description

Maestro Reasoning is Arcee's flagship analysis model: a 32 B‑parameter derivative of Qwen 2.5‑32 B tuned with DPO and chain‑of‑thought RL for step‑by‑step logic. Compared to the earlier 7 B preview, the production 32 B release widens the context window to 128 k tokens and doubles pass‑rate on MATH and GSM‑8K, while also lifting code completion accuracy. Its instruction style encourages structured "thought → answer" traces that can be parsed or hidden according to user preference. That transparency pairs well with audit‑focused industries like finance or healthcare where seeing the reasoning path matters. In Arcee Conductor, Maestro is automatically selected for complex, multi‑constraint queries that smaller SLMs bounce.

Specifications

Provider
arcee-ai
Context Length
131.1K
Max Output
32K
Modality
Intext
Outtext

Pricing

TypePrice / 1M tokens
Input$0.90
Output$3.30

Quick Start

curl https://api.ominigate.ai/v1/chat/completions \
  -H "Authorization: Bearer sk-omg-your-api-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "arcee-ai/maestro-reasoning",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'
Arcee AI: Maestro Reasoning — Pricing & API on OminiGate