Gemini 3.5 Flash API Pricing & Cost Calculator

Gemini 3.5 Flash is Google's ultra-fast multimodal model, designed for extreme speed and high scalability.

Input tokens
30,000
50% cached
Output tokens
3,000
per request
Volume
1,000 / monthly
Standard API

Calculator

Cost Comparison

Based on 30,000 input tokens (50% cached), 3,000 output tokens, and 1,000 requests.

Gemini 3.5 Flash
$20.25
Claude Haiku 4.5
$31.50
Gemini 3.1 Pro (<=200k)
$69.00
GPT-5.4
$86.25
Claude Sonnet 4.6
$94.50
Claude Opus 4.7
$157.50
GPT-5.5
$172.50

How this is calculated

Gemini 3.5 Flash is priced at $0.50 per million input tokens and $3.00 per million output tokens, supporting a 50% caching discount ($0.25 per million) and a 50% batch discount ($0.25 per million inputs, $1.50 per million outputs).

Verdict

Gemini 3.5 Flash is one of the most cost-effective fast models on the market. It is ideal for high-volume multimodal analysis and low-latency completions.

More API Standalones scenarios

Frequently asked questions

Does Gemini 3.5 Flash support prompt caching?
Yes, Google offers a 50% discount on cached input tokens, reducing the input price to $0.25 per million for matches.