google/gemini-3.1-flash-lite

Name: google/gemini-3.1-flash-lite API
Brand: google

by google

Endpoints:/v1/chat/completions /v1beta/models/{model}:generateContent

p50 TTFT759 ms

from openai import OpenAI

client = OpenAI(
    base_url="https://api.orcarouter.ai/v1",
    api_key="$ORCAROUTER_API_KEY",
)

INPUT$0.25/ 1M tokens

OUTPUT$1.50/ 1M tokens

p50 TTFT759 ms7d

p95 TTFT1.37 s7d

TRAFFIC4.4Mtokens / 7d

Get the google/gemini-3.1-flash-lite API →▶ Try in playground </> Use via API

Code samples

Call from any SDK

OpenAI-compatible — keep the SDK you already use

OpenAI SDKhttps://api.orcarouter.ai/v1
Gemini SDKhttps://api.orcarouter.ai

from openai import OpenAI

client = OpenAI(
    base_url="https://api.orcarouter.ai/v1",
    api_key="$ORCAROUTER_API_KEY",
)

response = client.chat.completions.create(
    model="google/gemini-3.1-flash-lite",
    messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)

Pricing

Input / 1M tokens	$0.250
Output / 1M tokens	$1.50
Cache read / 1M	$0.025
Currency	USD

Cost calculator

Tokens / month10MM

70%

Estimate based on list price

Token & cost estimator

Expected output tokens

Input tokens: 20Cost per request: $0.000755

Estimate only — actual token counts depend on the provider's tokenizer.

Performance

last 7 days

p50 TTFT

759 ms

Output speed

19.0 tok/s

p95 TTFT

1.37 s

Error rate

Public benchmarks

pending

How Design Arena works

Source: Design Arena

How it compares

	google/gemini-3.1-flash-lite	Gemini 3.1 Pro Preview	Gemini 3.1 Pro Preview Custom Tools	Gemini 3 Flash Preview
Input $/M	$0.25	$2.00	$4.00	$0.50
Output $/M	$1.50	$12.00	$18.00	$3.00
Context	—	1.0M	1.0M	1.0M
Quality	5/10	10/10	10/10	9/10
Compare side-by-side		Compare side-by-side	Compare side-by-side	Compare side-by-side

More from google

See all models from google →

Gemini 3.5 FlashCheapest

google/gemini-3.5-flash

$1.50 in · $9.00 out / 1M

1.05M ctx· quality 9/10

Compare side-by-side

Gemini 3.6 FlashNew

google/gemini-3.6-flash

$1.50 in · $7.50 out / 1M

1.05M ctx· quality 8/10

Compare side-by-side

google/gemini-pro-latest

google/gemini-pro-latest

$4.00 in · $18.00 out / 1M

· quality 8/10

Compare side-by-side

FAQ

How much does google/gemini-3.1-flash-lite cost on OrcaRouter?

google/gemini-3.1-flash-lite is priced at $0.25 per 1M input tokens and $1.50 per 1M output tokens via OrcaRouter. Pricing is pulled live from the routing layer.

What is google/gemini-3.1-flash-lite's context window?

google/gemini-3.1-flash-lite supports a context window of — tokens. Use long-context features (RAG, summarisation) up to that limit.

How do I call google/gemini-3.1-flash-lite via the OpenAI SDK?

Set OpenAI base_url to https://api.orcarouter.ai/v1, supply your OrcaRouter API key, and pass model="google/gemini-3.1-flash-lite" in the chat.completions.create call.

Does OrcaRouter rate-limit google/gemini-3.1-flash-lite?

Per-model rate limits follow your OrcaRouter plan. Free tiers ship with conservative caps; paid tiers lift them. Check /pricing for current quotas.

Embed this badge

Paste into your blog post

google/gemini-3.1-flash-lite•$0.25/M in•759ms p50•via OrcaRouter

HTML <a href="https://www.orcarouter.ai/models/google/gemini-3.1-flash-lite" target="_blank"> <img src="https://www.orcarouter.ai/embed/google/gemini-3.1-flash-lite.svg" alt="google/gemini-3.1-flash-lite on OrcaRouter" /> </a>

Markdown [![google/gemini-3.1-flash-lite](https://www.orcarouter.ai/embed/google/gemini-3.1-flash-lite.svg)](https://www.orcarouter.ai/models/google/gemini-3.1-flash-lite)

Model card as data

GET /api/public/models/google/gemini-3.1-flash-liteOpen

Machine-readable:/llms.txt /llms-full.txt