Most cost-efficient model in the 2.5 family released August 2025, optimized for high throughput and cost-sensitive deployments. Maintains 2.5-level quality while delivering superior performance compared to 2.0 Flash-Lite across all benchmarks.
- Most cost-efficient model in Gemini 2.5 family at $0.10 input, $0.40 output per 1M tokens
- Superior performance compared to 2.0 Flash-Lite across all key benchmarks with enhanced capabilities
- Speed optimized with lower latency than both 2.0 Flash-Lite and standard Flash models
- Controllable thinking capabilities available but disabled by default for maximum speed
- Enterprise-ready with full Google Search grounding and comprehensive tool integration support
Web Search
Search the web for current info.
Image Generator
Generate images from text prompts.
Model Information
Supported Formats
Google's most advanced reasoning model released 2025, featuring configurable thinking levels (low/high) for optimal cost-performance balance. Excels at complex tasks requiring broad knowledge and advanced reasoning with 1M token context window.
Google's fastest frontier AI model built for speed at a fraction of the cost. Outperforms Gemini 2.5 Pro while being 3x faster. Excels at agentic coding (78% SWE-bench) and PhD-level reasoning (90.4% GPQA Diamond).
Most advanced thinking model released March 2025, topping LMArena leaderboard with state-of-the-art reasoning capabilities. Features enhanced coding performance with 86.7% on AIME 2025 and 84% on GPQA Diamond, plus configurable thinking budgets up to 32K tokens.