← Back to Calculator

Gemini Flash Lite vs Llama 4 Maverick

Detailed comparison of API pricing, technical specifications, and benchmark scores. Updated 2026-06-29.

Cost at a Glance

Llama 4 Maverick is 6.2× more expensive than Gemini Flash Lite — switching could save 83.8% on your monthly API bill.

At 5M input + 2M output tokens per month:

Gemini Flash Lite$1.30/movsLlama 4 Maverick$8.00/mo

Choosing Gemini Flash Lite could save you $6.70/mo

Price Comparison (per 1M tokens)

Input (per 1M tokens)
$0.10vs$0.80
Output (per 1M tokens)
$0.40vs$2.00
Cache Write (per 1M tokens)
vs
Cache Read (per 1M tokens)
$0.01vs
Batch Input (per 1M tokens)
vs
Batch Output (per 1M tokens)
vs

Prices are per 1 million tokens. Actual costs depend on your monthly token volume. Use the calculator →

Specifications

Context Window
1Mvs1M
Max Output Tokens
64Kvs64K
Knowledge Cutoff
2026-04vs2026-03
Modalities
TextvsText, Image
Prompt Caching
YesvsNo
Batch API
NovsNo
Thinking / Reasoning
NovsNo

Benchmark Scores

SWE-bench Verified
45.0vs72.0
MMLU-Pro
72.0vs81.0
HumanEval
82.0vs89.0

Benchmark scores from official provider reports. Higher is better. Missing scores indicate the provider has not published that benchmark yet.

Use Cases

Gemini Flash Lite

cheapestfastsimple

Llama 4 Maverick

generalopen-weightmultimodal

About the Providers

Gemini Flash Lite is developed by Google. Llama 4 Maverick is developed by Meta. This is a cross-provider comparison between Google and Meta models. For the most up-to-date pricing, always check the official API documentation from each provider.

Compare More AI Models