Gemini 3.1 Pro vs Claude Sonnet 4.6: the developer API cost face-off

Coding workhorses and reasoning model showdown.

For complex coding and agentic reasoning, Gemini 3.1 Pro and Claude Sonnet 4.6 are the top contenders. Their cost structures depend heavily on prompt caching efficiency.

Cost Comparison

Based on 100,000 input tokens (50% cached), 5,000 output tokens, and 100 requests.

Gemini 3.5 Flash
$5.25
Claude Haiku 4.5
$8.00
Gemini 3.1 Pro (<=200k)
$17.00
GPT-5.4
$21.25
Claude Sonnet 4.6
$24.00
Claude Opus 4.7
$40.00
GPT-5.5
$42.50
Option A
Gemini 3.1 Pro (<=200k)
Wins 3 of 4 compared specs
Option B
Claude Sonnet 4.6
Wins 0 of 4 compared specs

Side-by-side specs

SpecGemini 3.1 Pro (<=200k)Claude Sonnet 4.6
Input Cost (per M)$2.00 (better on this spec)$3.00
Output Cost (per M)$12.00 (better on this spec)$15.00
Cached Input (per M)$0.20 (better on this spec)$0.30
Batch Discount50%50%

How they differ

Gemini 3.1 Pro costs $2.00 per million input tokens and $12.00 per million output tokens, with a 90% cache discount. Claude Sonnet 4.6 is priced at $3.00 per million input tokens and $15.00 per million output tokens, also with a 90% cache discount.

Verdict

Gemini 3.1 Pro is more economical across the board for both base input and output costs. Claude Sonnet 4.6 is preferred if you need Anthropic's specific coding and instruction-following strengths.

Which should you pick?

Choose Gemini 3.1 Pro (<=200k)

Massive context size processing (up to 2M tokens) and general cost savings.

Choose Claude Sonnet 4.6

State-of-the-art multi-file code editing, tool use, and agentic loops.

Related comparisons

GPT-5.4 vs Claude Sonnet 4.6
The workhorse model pricing showdown.
Read comparison ➜
Claude Opus 4.7 vs DeepSeek V4 Pro
Frontier reasoning versus optimized price-performance.
Read comparison ➜
DeepSeek V4 Pro vs Mistral Large 3
Serverless pricing versus flagship open weights.
Read comparison ➜
Gemini 3.5 Flash vs GPT-5.4 Mini
Fast, lightweight multimodal models comparison.
Read comparison ➜
Gemini 3.5 Flash vs Claude Sonnet 4.6
Speedy utility model versus premium reasoning flagship.
Read comparison ➜
Gemini 3.5 Flash vs GPT-5.4
Utility cost versus premium flagship performance.
Read comparison ➜
GPT-4o vs Claude Sonnet 4.5
Optimized premium intelligence confrontation.
Read comparison ➜
GPT-5.4 Mini vs Claude Haiku 4.5
Rapid response utility models compared.
Read comparison ➜
GPT-5.4 Nano vs DeepSeek V4 Flash
Ultra-low-cost utility endpoints comparison.
Read comparison ➜
GPT-5.5 vs Claude Opus 4.7
The frontier intelligence showdown.
Read comparison ➜
GPT-5.5 vs DeepSeek V4 Pro
Premium frontier intelligence versus budget-optimized utility.
Read comparison ➜
GPT-5.5 Pro vs o3 Pro
Pro-tier specialized reasoning comparison.
Read comparison ➜
Claude Sonnet 4.6 vs DeepSeek V4 Pro
Premier coding engine versus price-performance champion.
Read comparison ➜
o4-mini vs GPT-5.4 Mini
Reasoning capabilities versus standard speed-optimized utility.
Read comparison ➜
GPT-5.4 Nano vs Gemini 2.5 Flash-Lite
High-frequency entry-level endpoints comparison.
Read comparison ➜
GPT-5.2 Codex vs Mistral Codestral
Developer-focused auto-complete and refactoring endpoints.
Read comparison ➜
Gemini 3.1 Flash Live vs GPT-5.4 Nano
Low-latency streaming models compared.
Read comparison ➜
Gemini 3.1 Pro vs Claude Opus 4.7
Large-context reasoning versus frontier flagship reasoning.
Read comparison ➜
Mistral Small 4 vs Mistral Large 3
Utility-scale model versus flagship Europe-hosted logic.
Read comparison ➜
Mistral Small 4 vs Mistral Medium 3.5
Lightweight utility versus balanced medium-scale logic.
Read comparison ➜