How much can Gemini users save?
Start with a $1,000/month Gemini 2.5 Flash input bill. CPEN shows the projected bill, monthly savings, and same-GPQA routes before any table.
Same GPQA, different price.
Start with the bill you already know: $1,000/month on reference-model input becomes a concrete CPEN route, savings percent, and dollar delta before the detailed $/1M table.
At $1,000/month current input spend
Default view uses input price per 1M tokens. Output and total modes are estimates for comparing token mixes, not metered billing.
Monthly data analysis / extraction
Gemini 2.5 Flash → CPEN Middle
Savings scenarios as you scroll
Each model shows how the bill changes when moved to a same-GPQA CPEN preset.
Up to 97% savings
| Model | GPQA | Input value score | Input $/1M | CPEN input $/1M | Input comparison |
|---|---|---|---|---|---|
| GPT-5.4 nanoOpenAI | 76 | 38.0 | $0.25 | $0.05 | 80% |
| GPT-5.4 miniOpenAI | 88 | 17.0 | $0.75 | $0.08 | 89% |
| GPT-5.4OpenAI | 92 | 5.56 | $2.50 | $0.12 | 95% |
| Claude Haiku 4.5Anthropic | 78 | 100 | $0.10 | $0.05 | 50% |
| Claude Sonnet 4.6Anthropic | 87.5 | 4.19 | $3.00 | $0.08 | 97% |
| Claude Opus 4.7Anthropic | 91.4 | 2.75 | $5.00 | $0.12 | 98% |
| Gemini 2.5 FlashGoogle | 82.8 | 37.6 | $0.30 | $0.08 | 73% |
| Gemini 3 FlashGoogle | 90.4 | 26.9 | $0.50 | $0.12 | 76% |
| Gemini 3.5 FlashGoogle | 92.2 | 9.31 | $1.50 | $0.12 | 92% |
- Input $/1M
- $0.25
- CPEN input $/1M
- $0.05
- Input comparison
- 80%
- Input $/1M
- $0.75
- CPEN input $/1M
- $0.08
- Input comparison
- 89%
- Input $/1M
- $2.50
- CPEN input $/1M
- $0.12
- Input comparison
- 95%
- Input $/1M
- $0.10
- CPEN input $/1M
- $0.05
- Input comparison
- 50%
- Input $/1M
- $3.00
- CPEN input $/1M
- $0.08
- Input comparison
- 97%
- Input $/1M
- $5.00
- CPEN input $/1M
- $0.12
- Input comparison
- 98%
- Input $/1M
- $0.30
- CPEN input $/1M
- $0.08
- Input comparison
- 73%
- Input $/1M
- $0.50
- CPEN input $/1M
- $0.12
- Input comparison
- 76%
Input value score excludes success rate, output price, and total token cost. Input comparison uses CPEN preset input prices; final cost depends on token mix.
Three presets. One API.
Low, Middle, High are convenience presets grouped by GPQA score. Override price, RPM, and strategy per workload.
Bulk extraction and classification.
Use cpen/lowBalanced reasoning for invoices and forms.
Use cpen/middleFrontier reasoning at a fraction of the cost.
Use cpen/highCap first. Route second. Reject if over.
Set a USD ceiling per 1M tokens. CPEN picks a live route at a matching GPQA score inside your cap. If none fits, the request is rejected — never silently overspent.
Declare max USD per 1M tokens before dispatch.
CPEN selects a live model at a matching GPQA score inside your cap.
No fit? Request fails. No silent escalation.
Savings vs reference models within ±2 GPQA points. Final cost depends on token mix and route availability.
Three presets. One API.
Low, Middle, High are convenience presets grouped by GPQA score. Override price, RPM, and strategy per workload.