Back to Models
Google: Gemini 3.1 Flash Lite Preview
google/gemini-3.1-flash-lite-previewMar 3, 20261.0M context65.5K max output$0.25/M in · $1.50/M outReasoning
Description
Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities. Improvements span audio input/ASR, RAG snippet ranking, translation, data extraction, and code completion. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.
Specifications
Provider
google
Context Length
1.0M
Max Output
65.5K
Modality
Intextimagevideofileaudio
Outtext
Pricing
| Type | Price / 1M tokens |
|---|---|
| Input | $0.25 |
| Output | $1.50 |
| Cache Read | $0.03 |
| Cache Write | $0.08 |
| Audio Input | $0.50 |
| Audio Cache | $0.05 |
| Reasoning | $1.50 |
| Image Input | $0.25 |
| Web Search | $0.01 / call |
Quick Start
curl https://api.ominigate.ai/v1/chat/completions \
-H "Authorization: Bearer sk-omg-your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "google/gemini-3.1-flash-lite-preview",
"messages": [{"role": "user", "content": "Hello!"}]
}'