← Back to Calculator

GPT-5.4 vs Gemini 3.5 Flash

Detailed comparison of API pricing, technical specifications, and benchmark scores. Updated 2026-06-29.

Cost at a Glance

GPT-5.4 is 10× more expensive than Gemini 3.5 Flash — switching could save 90% on your monthly API bill.

At 5M input + 2M output tokens per month:

GPT-5.4$42.50/movsGemini 3.5 Flash$4.25/mo

Choosing Gemini 3.5 Flash could save you $38.25/mo

Price Comparison (per 1M tokens)

Input (per 1M tokens)
$2.50vs$0.25
Output (per 1M tokens)
$15.00vs$1.50
Cache Write (per 1M tokens)
$3.13vs
Cache Read (per 1M tokens)
$0.25vs$0.03
Batch Input (per 1M tokens)
$1.25vs
Batch Output (per 1M tokens)
$7.50vs

Prices are per 1 million tokens. Actual costs depend on your monthly token volume. Use the calculator →

Specifications

Context Window
1Mvs1M
Max Output Tokens
128Kvs64K
Knowledge Cutoff
2025-12vs2026-04
Modalities
Text, ImagevsText, Image
Prompt Caching
YesvsYes
Batch API
YesvsNo
Thinking / Reasoning
NovsNo

Benchmark Scores

SWE-bench Verified
78.0vs65.0
MMLU-Pro
84.5vs80.0
HumanEval
93.0vs90.0

Benchmark scores from official provider reports. Higher is better. Missing scores indicate the provider has not published that benchmark yet.

Use Cases

GPT-5.4

codinggeneralbalanced

Gemini 3.5 Flash

fastcheapmultimodal

About the Providers

GPT-5.4 is developed by OpenAI. Gemini 3.5 Flash is developed by Google. This is a cross-provider comparison between OpenAI and Google models. For the most up-to-date pricing, always check the official API documentation from each provider.

Compare More AI Models