Gemini 3.1 Pro Preview

Name: Google: Gemini 3.1 Pro Preview API
Brand: Google

google/gemini-3.1-pro-preview

FlagshipFeatured

by Google · 2026-02-19

Google Gemini 3.1 Pro Preview: flagship multimodal model with 1M context window and 95.6 τ²-Bench score, accessed via OrcaRouter API.

Endpoints:/v1/chat/completions /v1beta/models/{model}:generateContent

ctx1.05M tokens

Inputaudio + file + image + text + video

Outputtext

p50 TTFT5.71 s

from openai import OpenAI

client = OpenAI(
    base_url="https://api.orcarouter.ai/v1",
    api_key="$ORCAROUTER_API_KEY",
)

INPUT$2.00/ 1M tokens

OUTPUT$12.00/ 1M tokens

p50 TTFT5.71 s7d

p95 TTFT10.00 s7d

TRAFFIC5.1Mtokens / 7d

Get the Gemini 3.1 Pro Preview API →▶ Try in playground </> Use via API

What is Google Gemini 3.1 Pro Preview?

Google Gemini 3.1 Pro Preview is a flagship model from Google, offered in preview form. It is a multimodal model capable of processing text, image, video, audio, and file inputs. The model is categorized as flagship tier, indicating that it is designed for high-demand, complex applications where performance and capacity are critical. As a preview, it may have limitations in stability or availability compared to stable releases. Access is provided through the OrcaRouter API.

Who should use this model?

This model is intended for developers and enterprises that need to handle large context windows up to 1,048,576 tokens and require multimodal understanding. Use cases include long-document analysis, video moderation, advanced chatbots with memory of entire conversations, and complex data extraction from mixed media. The preview status makes it suitable for experimentation and early integration, but production deployments should evaluate stability. It is also ideal for teams already using OrcaRouter’s OpenAI-compatible API who want to test Google’s latest flagship capabilities.

What are the key specifications?

The model supports a context window of 1,048,576 tokens (input) and a maximum output of 65,536 tokens. It accepts input in multiple modalities: audio, file (e.g., PDF, code files), image, text, and video. The headline benchmark score is 95.6 on τ²-Bench, a metric that measures task completion performance. The model is classified as flagship tier by its provider, Google. It is accessed via OrcaRouter’s API at base URL https://api.orcarouter.ai/v1 with model ID "google/gemini-3.1-pro-preview".

How does it compare to other Gemini previews?

As a preview version of Gemini 3.1 Pro, this model sits at the top of Google’s current lineup among preview releases. It offers a significantly larger context window (1M tokens) and higher output limits (65K tokens) compared to earlier Gemini 2.0 models or Gemini 3.0 previews. The τ²-Bench score of 95.6 provides a quantitative benchmark for task-oriented performance. Compared to other preview models from Google, this one targets the most demanding use cases where both breadth of context and depth of reasoning are required.

Code samples

from openai import OpenAI

client = OpenAI(
    base_url="https://api.orcarouter.ai/v1",
    api_key="$ORCAROUTER_API_KEY",
)

response = client.chat.completions.create(
    model="google/gemini-3.1-pro-preview",
    messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)

Pricing

Tier	Input / 1M tokens	Output / 1M tokens	Cache read / 1M	Cache write / 1M
≤ 200K	$2.00	$12.00	$0.200	$0.375
≤ ∞	$4.00	$18.00	$0.400	$0.375
Tier selected by input token count of each request

Performance

last 7 days

p50 TTFT

5.71 s

Output speed

429 tok/s

p95 TTFT

10.00 s

Error rate

Public benchmarks

Last evaluated 2026-02-19

55.5

AA Coding

Better than 75% of models compared

57.2

AA Intelligence

Better than 80% of models compared

GPQA Diamond

94.1

Humanity's Last Exam

44.7

IFBench

77.1

Long-Context Recall

72.7

SciCode

58.9

TerminalBench Hard

53.8

τ²-Bench

95.6

Source: artificialanalysis.ai

More from Google

See all models from google →

Gemini 3.1 Pro Preview Custom Tools

google/gemini-3.1-pro-preview-customtools

$4.00 in · $18.00 out / 1M

1.05M ctx· quality 10/10

Gemini 3 Flash PreviewCheapest

google/gemini-3-flash-preview

$0.50 in · $3.00 out / 1M

1.05M ctx· quality 9/10

Gemini 3.5 Flash

google/gemini-3.5-flash

$1.50 in · $9.00 out / 1M

1.05M ctx· quality 9/10

FAQ

What is the cost of using Gemini 3.1 Pro Preview on OrcaRouter?

Pricing details are not provided in the available facts. As a flagship model, it is typically priced higher per token than smaller models. Costs depend on input and output token usage. Check OrcaRouter’s current pricing page for exact rates.

How large is the context window?

The model supports a context window of 1,048,576 tokens (input). This means you can submit very long documents, code, or conversation history in a single request. The maximum output is 65,536 tokens.

What are the main strengths of this model?

Its main strengths include a massive context window (1M tokens), high output limit (65K tokens), multimodal input (audio, file, image, text, video), and a strong τ²-Bench score of 95.6, indicating high accuracy on agentic tasks.

How does Gemini 3.1 Pro Preview compare to Gemini 2.0 models?

It offers a much larger context window (1M vs. up to 32K) and higher output limit (65K vs. 8K). It also supports more input modalities. However, it is a preview version and may have less stability than Gemini 2.0 stable releases.

Does OrcaRouter handle data privacy for requests?

Data handling policies are not specified in the provided facts. Users should review OrcaRouter’s data processing and privacy documentation to understand how input and output data are treated.

How do I call this model via an OpenAI-compatible API?

Use the base URL https://api.orcarouter.ai/v1 and set the model ID to "google/gemini-3.1-pro-preview". The API follows the standard OpenAI chat completions format. Authentication requires an API key from OrcaRouter.

What is the τ²-Bench score and why does it matter?

The model scored 95.6 on τ²-Bench, a benchmark that measures task completion performance. This quantitative metric reflects the model’s ability to handle complex, multi-step tasks accurately.

Can I use this model for production?

As a preview model, it is intended for testing and experimentation. It may have lower rate limits, less reliability, and ongoing changes. For production, consider using a stable, non-preview model.

What input modalities are supported?

The model supports audio, file (e.g., PDFs, code files), image, text, and video inputs. All can be included in a single request for cross-modal reasoning.

Is the model available for streaming responses?

The available facts do not specify streaming support. OrcaRouter’s API likely supports streaming for compatible models, but for this preview, check the documentation for stream parameter availability.

Embed this badge

Paste into your blog post

Google: Gemini 3.1 Pro Preview•$2.00/M in•5707ms p50•via OrcaRouter

HTML <a href="https://www.orcarouter.ai/models/google/gemini-3.1-pro-preview" target="_blank"> <img src="https://www.orcarouter.ai/embed/google/gemini-3.1-pro-preview.svg" alt="Google: Gemini 3.1 Pro Preview on OrcaRouter" /> </a>

Markdown [![Google: Gemini 3.1 Pro Preview](https://www.orcarouter.ai/embed/google/gemini-3.1-pro-preview.svg)](https://www.orcarouter.ai/models/google/gemini-3.1-pro-preview)

Gemini 3.1 Pro Preview

What is Google Gemini 3.1 Pro Preview?

Who should use this model?

What are the key specifications?

How does it compare to other Gemini previews?

What input modalities does it handle?

How large of a context window does it offer?

What are the best use cases for this model?

When should a cheaper model be used instead?

What is the τ²-Bench score and what does it mean?

What is known about speed and latency?

What are the model’s strengths based on available data?

What are honest limitations to consider?

How is pricing structured for this model on OrcaRouter?

What cost trade-offs should users consider?

Is caching available to reduce costs?

What is the likely cost relative to other Google models?

How do I call the model via OrcaRouter’s API?

Which parameters can I configure?

How can I migrate from another provider to OrcaRouter?

What should I know about the preview status when using the API?

How does Gemini 3.1 Pro Preview compare to other Google models?

How does it compare to models from other providers like GPT or Claude?

When should I choose Gemini 3.1 Pro Preview over other models?

When should I choose an alternative model instead?

Code samples

Pricing

Performance

Public benchmarks

More from Google

FAQ

Embed this badge

Gemini 3.1 Pro Preview

Model details

What is Google Gemini 3.1 Pro Preview?

Who should use this model?

What are the key specifications?

How does it compare to other Gemini previews?

What input modalities does it handle?

How large of a context window does it offer?

What are the best use cases for this model?

When should a cheaper model be used instead?

What is the τ²-Bench score and what does it mean?

What is known about speed and latency?

What are the model’s strengths based on available data?

What are honest limitations to consider?

How is pricing structured for this model on OrcaRouter?

What cost trade-offs should users consider?

Is caching available to reduce costs?

What is the likely cost relative to other Google models?

How do I call the model via OrcaRouter’s API?

Which parameters can I configure?

How can I migrate from another provider to OrcaRouter?

What should I know about the preview status when using the API?

How does Gemini 3.1 Pro Preview compare to other Google models?

How does it compare to models from other providers like GPT or Claude?

When should I choose Gemini 3.1 Pro Preview over other models?

When should I choose an alternative model instead?

Code samples

Pricing

Performance

Public benchmarks

More from Google

FAQ

Embed this badge