← Back to Calculator

Gemini 3.5 Flash vs Gemini Flash Lite

Detailed comparison of API pricing, technical specifications, and benchmark scores. Updated 2026-06-29.

Cost at a Glance

Gemini 3.5 Flash is 3.3× more expensive than Gemini Flash Lite — switching could save 69.4% on your monthly API bill.

At 5M input + 2M output tokens per month:

Gemini 3.5 Flash$4.25/movsGemini Flash Lite$1.30/mo

Choosing Gemini Flash Lite could save you $2.95/mo

Price Comparison (per 1M tokens)

Input (per 1M tokens)
$0.25vs$0.10
Output (per 1M tokens)
$1.50vs$0.40
Cache Write (per 1M tokens)
vs
Cache Read (per 1M tokens)
$0.03vs$0.01
Batch Input (per 1M tokens)
vs
Batch Output (per 1M tokens)
vs

Prices are per 1 million tokens. Actual costs depend on your monthly token volume. Use the calculator →

Specifications

Context Window
1Mvs1M
Max Output Tokens
64Kvs64K
Knowledge Cutoff
2026-04vs2026-04
Modalities
Text, ImagevsText
Prompt Caching
YesvsYes
Batch API
NovsNo
Thinking / Reasoning
NovsNo

Benchmark Scores

SWE-bench Verified
65.0vs45.0
MMLU-Pro
80.0vs72.0
HumanEval
90.0vs82.0

Benchmark scores from official provider reports. Higher is better. Missing scores indicate the provider has not published that benchmark yet.

Use Cases

Gemini 3.5 Flash

fastcheapmultimodal

Gemini Flash Lite

cheapestfastsimple

About the Providers

Gemini 3.5 Flash is developed by Google. Gemini Flash Lite is developed by Google. Both models are from the same provider, representing different tiers in the Google model lineup. For the most up-to-date pricing, always check the official API documentation from each provider.

Compare More AI Models