Back to Models
NVIDIA: Nemotron Nano 9B V2
nvidia/nemotron-nano-9b-v2Sep 5, 2025131.1K context$0.04/M in · $0.16/M outReasoning
Description
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response.
The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.
Specifications
Provider
nvidia
Context Length
131.1K
Max Output
—
Modality
Intext
Outtext
Pricing
| Type | Price / 1M tokens |
|---|---|
| Input | $0.04 |
| Output | $0.16 |
Quick Start
curl https://api.ominigate.ai/v1/chat/completions \
-H "Authorization: Bearer sk-omg-your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "nvidia/nemotron-nano-9b-v2",
"messages": [{"role": "user", "content": "Hello!"}]
}'