Gemini 3.1 Flash Lite

Chat

by Google

Overview

Most cost-efficient model in the Gemini 3 family, offering excellent performance at a fraction of the cost. Optimized for high throughput and cost-sensitive deployments with 1M token context window and configurable thinking capabilities.

Key Features

Most cost-efficient model in the Gemini 3 family at $0.25 input, $1.50 output per 1M tokens
1M token context window with full multimodal support for text, audio, images, video, and documents
Optimized for high throughput and low-latency applications requiring speed and cost efficiency
Configurable thinking capabilities for flexible reasoning depth based on task complexity
Full tool support including Search Grounding, Code Execution, and Function Calling

Input Capabilities

Text

Audio

Image

Video

Document

Available Tools

Web Search

Search the web for current info.

Code Execution

Run Python code in a sandbox.

JSON Mode

Output responses in valid JSON format.

URL Context

Fetch and process content from URLs.

Google Maps

Ground responses in real-time Google Maps data for location-aware queries.

Image Generator

Generate images from text prompts.

Technical Specifications

Context Window

1,048,576

Max Output

65,536

Model Information

Provider: Google

Model Code: gemini-3.1-flash-lite

Category: Chat

File Support

Max Files

Max Size

50MB

Supported Formats

.pdf

.txt

.html

.htm

.css

.js

+29 more

Other Google Models

Gemini 3.1 Pro

Google's most advanced reasoning model (Gemini 3.1 Pro) with configurable thinking levels (minimal/low/medium/high) for optimal cost-performance balance. Successor to Gemini 3 Pro with enhanced reasoning and tool capabilities including Google Maps grounding.

Gemini 3.5 Flash

Google's most intelligent stable model for coding tasks in the Gemini 3 family. Gemini 3.5 Flash combines frontier-class intelligence with Flash-tier speed, making it ideal for complex coding workflows and agentic tasks at a cost-effective price.

Gemini 3 Flash

Google's fastest frontier AI model built for speed at a fraction of the cost. Outperforms Gemini 2.5 Pro while being 3x faster. Excels at agentic coding (78% SWE-bench) and PhD-level reasoning (90.4% GPQA Diamond).