Google AI Models
17 models across 3 categories
Chat
7 models
Input Capabilities
Google's most advanced reasoning model released 2025, featuring configurable thinking levels (low/high) for optimal cost-performance balance. Excels at complex tasks requiring broad knowledge and advanced reasoning with 1M token context window.
Input Capabilities
Google's fastest frontier AI model built for speed at a fraction of the cost. Outperforms Gemini 2.5 Pro while being 3x faster. Excels at agentic coding (78% SWE-bench) and PhD-level reasoning (90.4% GPQA Diamond).
Input Capabilities
Most advanced thinking model released March 2025, topping LMArena leaderboard with state-of-the-art reasoning capabilities. Features enhanced coding performance with 86.7% on AIME 2025 and 84% on GPQA Diamond, plus configurable thinking budgets up to 32K tokens.
Input Capabilities
Best price-performance model with hybrid reasoning capabilities for large-scale processing and agentic use cases. Offers well-rounded capabilities with thinking features optimized for low-latency, high-volume tasks requiring both speed and intelligence.
Input Capabilities
Most cost-efficient model in the 2.5 family released August 2025, optimized for high throughput and cost-sensitive deployments. Maintains 2.5-level quality while delivering superior performance compared to 2.0 Flash-Lite across all benchmarks.
Input Capabilities
Next-generation multimodal model released February 2025, built for the agentic era with native tool use and enhanced capabilities. Delivers significant performance improvements over 1.5 series with superior speed and 1M token context window.
Input Capabilities
Ultra-cost-effective variant of Gemini 2.0 Flash optimized for maximum efficiency and low latency deployment. Delivers Gemini 2.0 capabilities with the most competitive pricing for high-volume applications requiring speed over maximum intelligence.
Image Generation
5 models
Input Capabilities
Google's most advanced image generation model from the Gemini 3 family - ideal for high-quality image generation and editing tasks requiring superior detail and understanding
Input Capabilities
Gemini's native image generation model (Nano Banana) offering unique conversational editing capabilities and multimodal understanding - ideal for complex creative workflows requiring iterative refinement, image editing, composition, and style transfer
Input Capabilities
Best-in-class text-to-image model designed for wide range of creative tasks with exceptional text rendering and 2K resolution support - ideal for marketing assets, artistic compositions, and professional visual content creation
Input Capabilities
Premium precision model for users requiring exact prompt adherence and highest detail fidelity - perfect for professional design work, detailed artistic projects, and applications demanding strict creative control
Input Capabilities
Cost-effective high-quality model offering excellent balance of performance and pricing with strong prompt adherence - ideal for general creative applications, presentations, and diverse artistic styles without requiring complex prompt engineering
Video Generation
5 models
Input Capabilities
Premium flagship model with cutting-edge video generation capabilities - ideal for cinematic productions, storytelling, and professional content with frame control and reference image support
Input Capabilities
Fast variant of Veo 3.1 optimized for speed and cost - perfect for business applications, advertising content, and high-volume video generation requirements
Input Capabilities
Premium flagship model with revolutionary native audio generation - ideal for cinematic productions, storytelling, and professional content requiring synchronized audiovisual experiences
Input Capabilities
Speed-optimized variant maintaining Veo 3.0 quality at reduced cost - perfect for business applications, advertising content, and high-volume video generation requirements
Input Capabilities
Versatile model offering flexible duration and aspect ratio options - suitable for social media content, marketing videos, and general-purpose video generation with reliable quality