Fastest model that surpasses Claude 3 Opus intelligence on many benchmarks while maintaining speed and cost efficiency. Achieves 40.6% on SWE-bench Verified despite being optimized for rapid responses and high-volume applications.
- Superior intelligence outperforming Claude 3 Opus on numerous benchmarks while maintaining speed
- Exceptional coding capabilities for a compact model with 40.6% SWE-bench Verified performance
- Near-instant responses optimized for real-time applications and user-facing products
- Enhanced tool use and instruction following compared to predecessors
- Cost-effective solution ideal for high-volume data processing and automated workflows
Web Search
Search the web for current info.
Model Information
Supported Formats
State-of-the-art coding model released September 2025, achieving 77.2% on SWE-bench Verified (82% with high compute). Excels at autonomous multi-step tasks for 30+ hours with enhanced tool coordination, context management, and computer use capabilities. Most aligned frontier model with improved security against prompt injection.
Fast, cost-efficient model released October 2025, achieving 73.3% on SWE-bench Verified. Delivers performance comparable to Claude Sonnet 4 at one-third the cost and more than twice the speed. First Haiku model with extended thinking, computer use, and context awareness capabilities. ASL-2 safety classification with lower misalignment rates than larger models.
Premium model released November 2025, combining maximum intelligence with practical performance. Features 200K token context window with 64K max output, extended thinking support, and priority tier access. Knowledge cutoff March 2025 with training data through August 2025. Offers the best balance of intelligence and efficiency for complex reasoning tasks.