Gemini 3.5 Flash vs Llama 4 Maverick
Detailed comparison of API pricing, technical specifications, and benchmark scores. Updated 2026-06-29.
Cost at a Glance
Llama 4 Maverick is 1.9× more expensive than Gemini 3.5 Flash (46.9% savings if you switch).
At 5M input + 2M output tokens per month:
Choosing Gemini 3.5 Flash could save you $3.75/mo
Price Comparison (per 1M tokens)
| Metric | Gemini 3.5 Flash | Llama 4 Maverick |
|---|---|---|
| Input (per 1M tokens) | $0.25 | $0.80 |
| Output (per 1M tokens) | $1.50 | $2.00 |
| Cache Write (per 1M tokens) | — | — |
| Cache Read (per 1M tokens) | $0.03 | — |
| Batch Input (per 1M tokens) | — | — |
| Batch Output (per 1M tokens) | — | — |
Prices are per 1 million tokens. Actual costs depend on your monthly token volume. Use the calculator →
Specifications
| Spec | Gemini 3.5 Flash | Llama 4 Maverick |
|---|---|---|
| Context Window | 1M | 1M |
| Max Output Tokens | 64K | 64K |
| Knowledge Cutoff | 2026-04 | 2026-03 |
| Modalities | Text, Image | Text, Image |
| Prompt Caching | Yes | No |
| Batch API | No | No |
| Thinking / Reasoning | No | No |
Benchmark Scores
| Benchmark | Gemini 3.5 Flash | Llama 4 Maverick |
|---|---|---|
| SWE-bench Verified | 65.0 | 72.0 |
| MMLU-Pro | 80.0 | 81.0 |
| HumanEval | 90.0 | 89.0 |
Benchmark scores from official provider reports. Higher is better. Missing scores indicate the provider has not published that benchmark yet.
Use Cases
Gemini 3.5 Flash
Llama 4 Maverick
About the Providers
Gemini 3.5 Flash is developed by Google. Llama 4 Maverick is developed by Meta. This is a cross-provider comparison between Google and Meta models. For the most up-to-date pricing, always check the official API documentation from each provider.