← Back to Calculator

Gemini 2.5 Flash-Lite vs Qwen 3.7 Max

Detailed comparison of API pricing, technical specifications, and benchmark scores. Updated 2026-07-01.

Cost at a Glance

Qwen 3.7 Max is 21.2× more expensive than Gemini 2.5 Flash-Lite — switching could save 95.3% on your monthly API bill.

At 5M input + 2M output tokens per month:

Gemini 2.5 Flash-Lite$1.30/movsQwen 3.7 Max$27.50/mo

Choosing Gemini 2.5 Flash-Lite could save you $26.20/mo

Price Comparison (per 1M tokens)

Input (per 1M tokens)
$0.10vs$2.50
Output (per 1M tokens)
$0.40vs$7.50
Cache Write (per 1M tokens)
vs
Cache Read (per 1M tokens)
$0.01vs
Batch Input (per 1M tokens)
vs$1.25
Batch Output (per 1M tokens)
vs$3.75

Prices are per 1 million tokens. Actual costs depend on your monthly token volume. Use the calculator →

Specifications

Context Window
1Mvs1M
Max Output Tokens
64Kvs128K
Knowledge Cutoff
2026-04vs2026-05
Modalities
TextvsText
Prompt Caching
YesvsYes
Batch API
NovsYes
Thinking / Reasoning
NovsYes

Benchmark Scores

SWE-bench Verified
45.0vs80.4
MMLU-Pro
72.0vs89.6
HumanEval
82.0vs87.2

Benchmark scores from official provider reports. Higher is better. Missing scores indicate the provider has not published that benchmark yet.

Use Cases

Gemini 2.5 Flash-Lite

cheapestfastsimple

Qwen 3.7 Max

codingreasoningagentic

About the Providers

Gemini 2.5 Flash-Lite is developed by Google. Qwen 3.7 Max is developed by Alibaba. This is a cross-provider comparison between Google and Alibaba models. For the most up-to-date pricing, always check the official API documentation from each provider.

Compare More AI Models