Gemini 3.1 Flash Lite logo

Gemini 3.1 Flash Lite

Chat

by Google

Overview

Most cost-efficient model in the Gemini 3 family, offering excellent performance at a fraction of the cost. Optimized for high throughput and cost-sensitive deployments with 1M token context window and configurable thinking capabilities.

Key Features
  • Most cost-efficient model in the Gemini 3 family at $0.25 input, $1.50 output per 1M tokens
  • 1M token context window with full multimodal support for text, audio, images, video, and documents
  • Optimized for high throughput and low-latency applications requiring speed and cost efficiency
  • Configurable thinking capabilities for flexible reasoning depth based on task complexity
  • Full tool support including Search Grounding, Code Execution, and Function Calling
Input Capabilities
Text
Audio
Image
Video
Document
Available Tools

Web Search

Search the web for current info.

Code Execution

Run Python code in a sandbox.

JSON Mode

Output responses in valid JSON format.

URL Context

Fetch and process content from URLs.

Google Maps

Ground responses in real-time Google Maps data for location-aware queries.

Image Generator

Generate images from text prompts.

10k FREE Credits50+ AI Models

Start Building with AI Today

Join thousands of developers using our unified platform to access 50+ premium AI models without multiple subscriptions.

OpenAI
Anthropic
Gemini
Grok
Meta
Runway
DeepMind
DeepSeek
Ideogram
ElevenLabs
Stability
Perplexity
Recraft