google/gemini-3.5-flash

google/gemini-3.5-flash
by google
INPUT$1.50/ 1M tokens
OUTPUT$9.00/ 1M tokens
p50 TTFT1.57 s7d
p95 TTFT5.98 s7d
TRAFFIC3.3Mtokens / 7d
Code samples
from openai import OpenAI

client = OpenAI(
    base_url="https://api.orcarouter.ai/v1",
    api_key="$ORCAROUTER_API_KEY",
)

response = client.chat.completions.create(
    model="google/gemini-3.5-flash",
    messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)
Pricing
Input / 1M tokens$1.50
Output / 1M tokens$9.00
Cache read / 1M$0.150
Cache write / 1M$0.083
CurrencyUSD
Performance
TTFT p50
1.57 s
Output speed
1194 tok/s
TTFT p95
5.98 s
Error rate
0.44%
Public benchmarks
No public benchmark scores ingested yet.
FAQ
How much does google/gemini-3.5-flash cost on OrcaRouter?
google/gemini-3.5-flash is priced at $1.50 per 1M input tokens and $9.00 per 1M output tokens via OrcaRouter. Pricing is pulled live from the routing layer.
What is google/gemini-3.5-flash's context window?
google/gemini-3.5-flash supports a context window of — tokens. Use long-context features (RAG, summarisation) up to that limit.
How do I call google/gemini-3.5-flash via the OpenAI SDK?
Set OpenAI base_url to https://api.orcarouter.ai/v1, supply your OrcaRouter API key, and pass model="google/gemini-3.5-flash" in the chat.completions.create call.
Does OrcaRouter rate-limit google/gemini-3.5-flash?
Per-model rate limits follow your OrcaRouter plan. Free tiers ship with conservative caps; paid tiers lift them. Check /pricing for current quotas.
Embed this badge
google/gemini-3.5-flash$1.50/M in1572ms p50via OrcaRouter
HTML <a href="https://www.orcarouter.ai/models/google/gemini-3.5-flash" target="_blank"> <img src="https://www.orcarouter.ai/embed/google/gemini-3.5-flash.svg" alt="google/gemini-3.5-flash on OrcaRouter" /> </a>
Markdown [![google/gemini-3.5-flash](https://www.orcarouter.ai/embed/google/gemini-3.5-flash.svg)](https://www.orcarouter.ai/models/google/gemini-3.5-flash)