Gemini 2.5 Flash Lite

Chat

Deprecated

by Google

This model has been deprecated by Google and is no longer available for use. Please explore other available models below.

View All Google Models

Overview

Most cost-efficient model in the 2.5 family released August 2025, optimized for high throughput and cost-sensitive deployments. Maintains 2.5-level quality while delivering superior performance compared to 2.0 Flash-Lite across all benchmarks.

Key Features

Most cost-efficient model in Gemini 2.5 family at $0.10 input, $0.40 output per 1M tokens
Superior performance compared to 2.0 Flash-Lite across all key benchmarks with enhanced capabilities
Speed optimized with lower latency than both 2.0 Flash-Lite and standard Flash models
Controllable thinking capabilities available but disabled by default for maximum speed
Enterprise-ready with full Google Search grounding and comprehensive tool integration support

Input Capabilities

Text

Audio

Image

Video

Document

Available Tools

Web Search

Search the web for current info.

Code Execution

Run Python code in a sandbox.

JSON Mode

Output responses in valid JSON format.

URL Context

Fetch and process content from URLs.

Google Maps

Ground responses in real-time Google Maps data for location-aware queries.

Image Generator

Generate images from text prompts.

Technical Specifications

Context Window

1,048,576

Max Output

65,536

Model Information

Provider: Google

Model Code: gemini-2.5-flash-lite

Category: Chat

File Support

Max Files

Max Size

50MB

Supported Formats

.png

.jpeg

.jpg

.webp

.heic

.heif

+29 more

Other Google Models

Gemini 3.1 Pro

Google's most advanced reasoning model (Gemini 3.1 Pro) with configurable thinking levels (minimal/low/medium/high) for optimal cost-performance balance. Successor to Gemini 3 Pro with enhanced reasoning and tool capabilities including Google Maps grounding.

Gemini 3.5 Flash

Google's most intelligent stable model for coding tasks in the Gemini 3 family. Gemini 3.5 Flash combines frontier-class intelligence with Flash-tier speed, making it ideal for complex coding workflows and agentic tasks at a cost-effective price.

Gemini 3 Flash

Google's fastest frontier AI model built for speed at a fraction of the cost. Outperforms Gemini 2.5 Pro while being 3x faster. Excels at agentic coding (78% SWE-bench) and PhD-level reasoning (90.4% GPQA Diamond).