Frontier quality. Open-source prices.

API واحد. كل نموذج. بدون وسطاء.

يوجّه OrcaRouter كل طلب إلى النموذج المناسب: الاستدلال الصعب إلى نماذج الجبهة، والعمل الروتيني إلى المصدر المفتوح. تحصل على جودة الجبهة حيث تحتاجها وتدفع أسعار المصدر المفتوح في كل مكان آخر. بدون هامش. بدون ترحيل. API واحد.

Get your API key →Browse models

- client = OpenAI(api_key="sk-...")
+ client = OpenAI(
+   base_url="https://api.orcarouter.ai/v1",
+   api_key="sk-orca-..."
+ )
 
# Everything else stays the same.
response = client.chat.completions.create(
  model="gpt-4o",
  messages=[{"role": "user", "content": "..."}]
)
# → routed to cheapest live backend, zero markup ✓

claude-opus-4-7$5.00 in·$25.00 outAnthropic Direct

claude-sonnet-4-6$3.00 in·$15.00 outAnthropic Direct

gpt-5.5$5.00 in·$30.00 outOpenAI Direct

gemini-3.1-pro-preview$4.00 in·$18.00 outGoogle Direct

deepseek-v4-pro$0.560 in·$1.12 outDeepSeek

qwen3.6-plus$0.500 in·$3.00 outAlibaba Cloud

kimi-k2.6$0.900 in·$3.75 outMoonshot

seedance-2.0from $0.07 /sec·—ByteDance

claude-opus-4-7$5.00 in·$25.00 outAnthropic Direct

claude-sonnet-4-6$3.00 in·$15.00 outAnthropic Direct

gpt-5.5$5.00 in·$30.00 outOpenAI Direct

gemini-3.1-pro-preview$4.00 in·$18.00 outGoogle Direct

deepseek-v4-pro$0.560 in·$1.12 outDeepSeek

qwen3.6-plus$0.500 in·$3.00 outAlibaba Cloud

kimi-k2.6$0.900 in·$3.75 outMoonshot

seedance-2.0from $0.07 /sec·—ByteDance

routing markup. ever.

~40%

avg cost reduction vs list price

200+

models across 10+ providers

<1ms

routing overhead added

تبني باستخدامه؟ تواصل معنا.

ملاحظاتك تشكّل الإصدار القادم.

X Discord GitHub

How routing works

Pass-through by design.

→

Direct provider routing

Your traffic flows to the upstream provider with no shadow proxy in between. The provider you see in your logs is the one that served the request.

⚖

Provider terms apply end-to-end

Each upstream provider's data and usage terms apply directly to your traffic. Pick the provider that matches your policy.

🧾

Per-request auditability

Every call records the provider, published price, and route taken. Reproduce any routing decision later from the dashboard.

Setup

Live in 60 seconds.

One URL change. Your existing SDK, model names, and streaming all work exactly as before.

Step 1

🔗

Point your SDK at us

Set base_url to api.orcarouter.ai/v1 and swap your API key. No other code changes needed.

→

Step 2

⚡

We route in real-time

Each request checks live provider health and prices in under 1ms. We pick the cheapest available direct backend for that model.

→

Step 3

✓

You pay provider cost

Traffic goes direct to the provider. You pay their published rate to them. We add exactly $0 on top — our fee is on the plan, not your tokens.

Live pricing

Every model.
Best available rate.

Real-time routing to the cheapest live backend. Prices refresh every 60s.

View all 200+ models →

Model	Routed to	Input /M	Output /M	Context	Quality
claude-opus-4-7	Anthropic Direct	$5.00	$25.00	1M	10.0
claude-sonnet-4-6	Anthropic Direct	$3.00	$15.00	1M	7.0
gpt-5.5	OpenAI Direct	$5.00	$30.00	1M	10.0
gemini-3.1-pro-preview	Google Direct	$4.00	$18.00	1M	10.0
deepseek-v4-pro	DeepSeek	$0.560	$1.12	1M	9.0
qwen3.6-plus	Alibaba Cloud	$0.500	$3.00	1M	8.0
kimi-k2.6	Moonshot	$0.900	$3.75	256K	9.0
seedance-2.0	ByteDance	from $0.07 /sec	—	—	10.0
+ 194 more models · Prices update every 60 seconds

Platform

Production-grade from day one.

Everything you need to run AI in production without managing multiple provider integrations.

⚡

Real-time price routing

Provider prices refresh every 60s. Every request hits the cheapest available backend automatically.

↩

Automatic failover

Provider goes down mid-stream? We switch transparently. Your app sees zero errors.

🔑

API key management

Issue keys per team or service with spend caps, model allowlists, and rate limits built in.

Per-request cost tracking

See exactly what every request cost, which provider handled it, and how much you saved.

◈

OpenAI-compatible

Change one line. Same SDK, same model names, same streaming format. Zero migration effort.

🛡

Budget enforcement

Hard and soft limits per key, team, or org. Auto-resets monthly. Slack + webhook alerts.

Differentiator

Glass-box routing.

Every request shows you the provider that served it, the published price they charged, and the route taken. Verifiable per call, reproducible later.

🔍

Per-request provider attribution

Each completion is tagged with the upstream provider — Anthropic Direct, OpenAI Direct, Bedrock, Vertex — and surfaced in your dashboard and headers.

📒

Published-rate ledger

Every token charge equals the provider's public list price. Audit any request against the provider's own pricing page in seconds.

↻

Routing decisions you can replay

Failover events, health swaps, and cheapest-backend choices are logged with timestamps. Reproduce any request's routing path.

Pricing

Routing is free.
Pay for features.

We never take a cut of your token spend. Our revenue comes from optional team features.

Zero markup guarantee

You pay providers directly at their published rates. We add nothing on top of token costs. Routing is free; the optional Team plan funds the platform.

$0.00routing fee

Hacker

Free

Forever. Zero markup on all tokens.

✓ 3 API keys

✓ All 200+ models

✓ Auto failover

✓ Basic dashboard

Start free

Team

$29/mo

Still zero markup. Pay for features.

✓ Unlimited API keys

✓ Team & budget controls

✓ Per-request breakdown

✓ Webhook + Slack alerts

✓ Priority support

Get started →

Enterprise

Custom

SLA commitments + private deployment.

✓ Private deployment option

✓ Custom routing rules

✓ 99.99% uptime SLA

✓ Dedicated support

✓ Audit logs & compliance

Start routing smarter.

Get your API key →Read the docs ↗

OrcaRouter

Product

Open Source Models Pricing Rewards

Resources

Docs Integrations Status

Legal

Privacy Terms