Gemini 3 Flash
FastestGoogle's speed-optimized model that closes surprising ground on intelligence. Released December 2025, Gemini 3 Flash scores 35 on the Artificial Analysis Intelligence Index — higher than several models that cost five to ten times more per token — while running at 170 tokens per second. At $0.50/$3.00 per 1M, it's genuinely cheap for high-volume API use. The 1M token context window and native video/audio/image input make it the practical go-to for multimodal pipelines that need throughput without paying Gemini 3 Pro prices.
Context window
1.0M tokens
API (blended)
$1.13/1M
Consumer access
Free
Multimodal
Yes
Strengths
- +170.2 t/s — the fastest model in this comparison by a wide margin
- +AA Intelligence Index 35 at $1.13/1M blended — exceptional price-to-performance
- +1M token context window, same as Gemini 3 Pro
- +Native multimodal: text, image, audio, video in a single API
- +Free via gemini.google.com with no hard usage cap
Weaknesses
- -AA Index 35 is notably below Gemini 3 Pro (48.44) — real capability gap for complex reasoning tasks
- -Prompts over 200K tokens billed at 2× — 1M context can get expensive at full capacity
- -Writing quality and nuance below Gemini 3 Pro and Claude
Best for
Not ideal for
Pricing details
Subscription plans
API pricing
Prices verified February 2026. LLM pricing changes frequently — verify at the provider's site before budgeting.
Last updated: February 2026