google/gemini-3.1-flash-lite

google/gemini-3.1-flash-lite
by google
INPUT$0.25/ 1M tokens
OUTPUT$1.50/ 1M tokens
p50 TTFT623 ms7d
p95 TTFT623 ms7d
TRAFFIC6.6Mtokens / 7d
Code samples
from openai import OpenAI

client = OpenAI(
    base_url="https://api.orcarouter.ai/v1",
    api_key="$ORCAROUTER_API_KEY",
)

response = client.chat.completions.create(
    model="google/gemini-3.1-flash-lite",
    messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)
Pricing
Input / 1M tokens$0.250
Output / 1M tokens$1.50
Cache read / 1M$0.025
CurrencyUSD
Performance
p50 TTFT
623 ms
Output speed
p95 TTFT
623 ms
Error rate
0.68%
Public benchmarks
No public benchmark scores ingested yet.
FAQ
How much does google/gemini-3.1-flash-lite cost on OrcaRouter?
google/gemini-3.1-flash-lite is priced at $0.25 per 1M input tokens and $1.50 per 1M output tokens via OrcaRouter. Pricing is pulled live from the routing layer.
What is google/gemini-3.1-flash-lite's context window?
google/gemini-3.1-flash-lite supports a context window of — tokens. Use long-context features (RAG, summarisation) up to that limit.
How do I call google/gemini-3.1-flash-lite via the OpenAI SDK?
Set OpenAI base_url to https://api.orcarouter.ai/v1, supply your OrcaRouter API key, and pass model="google/gemini-3.1-flash-lite" in the chat.completions.create call.
Does OrcaRouter rate-limit google/gemini-3.1-flash-lite?
Per-model rate limits follow your OrcaRouter plan. Free tiers ship with conservative caps; paid tiers lift them. Check /pricing for current quotas.
Embed this badge
google/gemini-3.1-flash-lite$0.25/M in623ms p50via OrcaRouter
HTML <a href="https://www.orcarouter.ai/models/google/gemini-3.1-flash-lite" target="_blank"> <img src="https://www.orcarouter.ai/embed/google/gemini-3.1-flash-lite.svg" alt="google/gemini-3.1-flash-lite on OrcaRouter" /> </a>
Markdown [![google/gemini-3.1-flash-lite](https://www.orcarouter.ai/embed/google/gemini-3.1-flash-lite.svg)](https://www.orcarouter.ai/models/google/gemini-3.1-flash-lite)