Description
Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities. Improvements span audio input/ASR, RAG snippet ranking, translation, data extraction, and code completion. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.
API Usage Examples
OpenAI Compatible Endpoint
Use this endpoint with any OpenAI-compatible library. Model: Google: Gemini 3.1 Flash Lite Preview (google/gemini-3.1-flash-lite-preview)
curl https://api.ridvay.com/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer YOUR_API_KEY" -d '{
"model": "google/gemini-3.1-flash-lite-preview",
"messages": [
{
"role": "user",
"content": "Explain the capabilities of the Google: Gemini 3.1 Flash Lite Preview model"
}
],
"temperature": 0.7,
"max_tokens": 1024
}'Supported Modalities
- Text
- Images
- Audio
- Video
API Pricing
- Input: 0.25$ / 1M tokens
- Output: 1.5$ / 1M tokens
- Image: 0$ / image
Token Limits
- Max Output: 65,536 tokens
- Max Context: 1,048,576 tokens
Subscription Tiers
- free
- pro
- ultimate