Gemini 3.5 Flash API Pricing & Cost Calculator

Q: Does Gemini 3.5 Flash support prompt caching?

Yes, Google offers a 50% discount on cached input tokens, reducing the input price to $0.25 per million for matches.

Gemini 3.5 Flash is one of the most cost-effective fast models on the market. It is ideal for high-volume multimodal analysis and low-latency completions.

Gemini 3.5 Flash is Google's ultra-fast multimodal model, designed for extreme speed and high scalability.

By TechCompare · Updated July 2026

Input tokens

30,000

50% cached

Output tokens

3,000

per request

Volume

1,000 / monthly

Standard API

Cost Comparison

Based on 30,000 input tokens (50% cached), 3,000 output tokens, and 1,000 requests.

Claude Haiku 4.5

$31.50

Gemini 3.5 Flash

$55.12

Gemini 3.1 Pro (<=200k)

$69.00

GPT-5.4

$86.25

Claude Sonnet 4.6

$94.50

Claude Opus 4.8

$157.50

GPT-5.5

$172.50

Open full calculator to adjust tokens, caching, or add other models

How this is calculated

Gemini 3.5 Flash is priced at $1.50 per million input tokens and $9.00 per million output tokens, supporting a 75% caching discount ($0.38 per million) and a 50% batch discount ($0.75 per million inputs, $4.50 per million outputs).

Verdict

Gemini 3.5 Flash is one of the most cost-effective fast models on the market. It is ideal for high-volume multimodal analysis and low-latency completions.