Gemini 2.5 Flash-Lite vs Qwen 3.7 Max
Detailed comparison of API pricing, technical specifications, and benchmark scores. Updated 2026-07-01.
Cost at a Glance
Qwen 3.7 Max is 21.2× more expensive than Gemini 2.5 Flash-Lite — switching could save 95.3% on your monthly API bill.
At 5M input + 2M output tokens per month:
Choosing Gemini 2.5 Flash-Lite could save you $26.20/mo
Price Comparison (per 1M tokens)
| Metric | Gemini 2.5 Flash-Lite | Qwen 3.7 Max |
|---|---|---|
| Input (per 1M tokens) | $0.10 | $2.50 |
| Output (per 1M tokens) | $0.40 | $7.50 |
| Cache Write (per 1M tokens) | — | — |
| Cache Read (per 1M tokens) | $0.01 | — |
| Batch Input (per 1M tokens) | — | $1.25 |
| Batch Output (per 1M tokens) | — | $3.75 |
Prices are per 1 million tokens. Actual costs depend on your monthly token volume. Use the calculator →
Specifications
| Spec | Gemini 2.5 Flash-Lite | Qwen 3.7 Max |
|---|---|---|
| Context Window | 1M | 1M |
| Max Output Tokens | 64K | 128K |
| Knowledge Cutoff | 2026-04 | 2026-05 |
| Modalities | Text | Text |
| Prompt Caching | Yes | Yes |
| Batch API | No | Yes |
| Thinking / Reasoning | No | Yes |
Benchmark Scores
| Benchmark | Gemini 2.5 Flash-Lite | Qwen 3.7 Max |
|---|---|---|
| SWE-bench Verified | 45.0 | 80.4 |
| MMLU-Pro | 72.0 | 89.6 |
| HumanEval | 82.0 | 87.2 |
Benchmark scores from official provider reports. Higher is better. Missing scores indicate the provider has not published that benchmark yet.
Use Cases
Gemini 2.5 Flash-Lite
Qwen 3.7 Max
About the Providers
Gemini 2.5 Flash-Lite is developed by Google. Qwen 3.7 Max is developed by Alibaba. This is a cross-provider comparison between Google and Alibaba models. For the most up-to-date pricing, always check the official API documentation from each provider.