Gemini 3.5 Flash vs Llama 4 Maverick

Detailed comparison of API pricing, technical specifications, and benchmark scores. Updated 2026-06-29.

Cost at a Glance

Llama 4 Maverick is 1.9× more expensive than Gemini 3.5 Flash (46.9% savings if you switch).

At 5M input + 2M output tokens per month:

Gemini 3.5 Flash$4.25/movsLlama 4 Maverick$8.00/mo

Choosing Gemini 3.5 Flash could save you $3.75/mo

Price Comparison (per 1M tokens)

Metric	Gemini 3.5 Flash	Llama 4 Maverick
Input (per 1M tokens)	$0.25	$0.80
Output (per 1M tokens)	$1.50	$2.00
Cache Write (per 1M tokens)	—	—
Cache Read (per 1M tokens)	$0.03	—
Batch Input (per 1M tokens)	—	—
Batch Output (per 1M tokens)	—	—

Input (per 1M tokens)

$0.25vs$0.80

Output (per 1M tokens)

$1.50vs$2.00

Cache Write (per 1M tokens)

—vs—

Cache Read (per 1M tokens)

$0.03vs—

Batch Input (per 1M tokens)

—vs—

Batch Output (per 1M tokens)

—vs—

Prices are per 1 million tokens. Actual costs depend on your monthly token volume. Use the calculator →

Specifications

Spec	Gemini 3.5 Flash	Llama 4 Maverick
Context Window	1M	1M
Max Output Tokens	64K	64K
Knowledge Cutoff	2026-04	2026-03
Modalities	Text, Image	Text, Image
Prompt Caching	Yes	No
Batch API	No	No
Thinking / Reasoning	No	No

Context Window

1Mvs1M

Max Output Tokens

64Kvs64K

Knowledge Cutoff

2026-04vs2026-03

Modalities

Text, ImagevsText, Image

Prompt Caching

YesvsNo

Batch API

NovsNo

Thinking / Reasoning

NovsNo

Benchmark Scores

Benchmark	Gemini 3.5 Flash	Llama 4 Maverick
SWE-bench Verified	65.0	72.0
MMLU-Pro	80.0	81.0
HumanEval	90.0	89.0

SWE-bench Verified

65.0vs72.0

MMLU-Pro

80.0vs81.0

HumanEval

90.0vs89.0

Benchmark scores from official provider reports. Higher is better. Missing scores indicate the provider has not published that benchmark yet.

Use Cases

Gemini 3.5 Flash

fastcheapmultimodal

Llama 4 Maverick

generalopen-weightmultimodal

About the Providers

Gemini 3.5 Flash is developed by Google. Llama 4 Maverick is developed by Meta. This is a cross-provider comparison between Google and Meta models. For the most up-to-date pricing, always check the official API documentation from each provider.

Gemini 3.5 Flash vs Llama 4 Maverick

Price Comparison (per 1M tokens)

Specifications

Benchmark Scores

Use Cases

About the Providers

Compare More AI Models