← Back to Calculator

Gemini 2.5 Flash-Lite vs Llama 4 Scout

Detailed comparison of API pricing, technical specifications, and benchmark scores. Updated 2026-07-01.

Cost at a Glance

Gemini 2.5 Flash-Lite is 1.2× more expensive than Llama 4 Scout (15.4% savings if you switch).

At 5M input + 2M output tokens per month:

Gemini 2.5 Flash-Lite$1.30/movsLlama 4 Scout$1.10/mo

Choosing Llama 4 Scout could save you $0.20/mo

Price Comparison (per 1M tokens)

Input (per 1M tokens)
$0.10vs$0.10
Output (per 1M tokens)
$0.40vs$0.30
Cache Write (per 1M tokens)
vs
Cache Read (per 1M tokens)
$0.01vs
Batch Input (per 1M tokens)
vs
Batch Output (per 1M tokens)
vs

Prices are per 1 million tokens. Actual costs depend on your monthly token volume. Use the calculator →

Specifications

Context Window
1Mvs10M
Max Output Tokens
64Kvs16K
Knowledge Cutoff
2026-04vs2025-04
Modalities
TextvsText, Image
Prompt Caching
YesvsNo
Batch API
NovsNo
Thinking / Reasoning
NovsNo

Benchmark Scores

SWE-bench Verified
45.0vs
MMLU-Pro
72.0vs74.3
HumanEval
82.0vs32.8

Benchmark scores from official provider reports. Higher is better. Missing scores indicate the provider has not published that benchmark yet.

Use Cases

Gemini 2.5 Flash-Lite

cheapestfastsimple

Llama 4 Scout

open-weightcheaplong-context

About the Providers

Gemini 2.5 Flash-Lite is developed by Google. Llama 4 Scout is developed by Meta. This is a cross-provider comparison between Google and Meta models. For the most up-to-date pricing, always check the official API documentation from each provider.

Compare More AI Models