Frontier quality. Open-source prices.

API واحد. كل نموذج. بدون وسطاء.

يوجّه OrcaRouter كل طلب إلى النموذج المناسب: الاستدلال الصعب إلى نماذج الجبهة، والعمل الروتيني إلى المصدر المفتوح. تحصل على جودة الجبهة حيث تحتاجها وتدفع أسعار المصدر المفتوح في كل مكان آخر. بدون هامش. بدون ترحيل. API واحد.

- client = OpenAI(api_key="sk-...")
+ client = OpenAI(
+ base_url="https://api.orcarouter.ai/v1",
+ api_key="sk-orca-..."
+ )
 
# Everything else stays the same.
response = client.chat.completions.create(
model="gpt-4o",
messages=[{"role": "user", "content": "..."}]
)
# → routed to cheapest live backend, zero markup ✓
claude-opus-4-7$5.00 in·$25.00 outAnthropic Direct
claude-sonnet-4-6$3.00 in·$15.00 outAnthropic Direct
gpt-5.5$5.00 in·$30.00 outOpenAI Direct
gemini-3.1-pro-preview$4.00 in·$18.00 outGoogle Direct
deepseek-v4-pro$0.560 in·$1.12 outDeepSeek
qwen3.6-plus$0.500 in·$3.00 outAlibaba Cloud
kimi-k2.6$0.900 in·$3.75 outMoonshot
seedance-2.0from $0.07 /sec·ByteDance
claude-opus-4-7$5.00 in·$25.00 outAnthropic Direct
claude-sonnet-4-6$3.00 in·$15.00 outAnthropic Direct
gpt-5.5$5.00 in·$30.00 outOpenAI Direct
gemini-3.1-pro-preview$4.00 in·$18.00 outGoogle Direct
deepseek-v4-pro$0.560 in·$1.12 outDeepSeek
qwen3.6-plus$0.500 in·$3.00 outAlibaba Cloud
kimi-k2.6$0.900 in·$3.75 outMoonshot
seedance-2.0from $0.07 /sec·ByteDance
0%
routing markup. ever.
~40%
avg cost reduction vs list price
200+
models across 10+ providers
<1ms
routing overhead added
تبني باستخدامه؟ تواصل معنا.
ملاحظاتك تشكّل الإصدار القادم.
How routing works

Pass-through by design.

Direct provider routing

Your traffic flows to the upstream provider with no shadow proxy in between. The provider you see in your logs is the one that served the request.

Provider terms apply end-to-end

Each upstream provider's data and usage terms apply directly to your traffic. Pick the provider that matches your policy.

🧾

Per-request auditability

Every call records the provider, published price, and route taken. Reproduce any routing decision later from the dashboard.


Setup

Live in 60 seconds.

One URL change. Your existing SDK, model names, and streaming all work exactly as before.

Step 1
🔗

Point your SDK at us

Set base_url to api.orcarouter.ai/v1 and swap your API key. No other code changes needed.

Step 2

We route in real-time

Each request checks live provider health and prices in under 1ms. We pick the cheapest available direct backend for that model.

Step 3

You pay provider cost

Traffic goes direct to the provider. You pay their published rate to them. We add exactly $0 on top — our fee is on the plan, not your tokens.


Live pricing

Every model.
Best available rate.

Real-time routing to the cheapest live backend. Prices refresh every 60s.

View all 200+ models →
ModelRouted toInput /MOutput /MContextQuality
claude-opus-4-7Anthropic Direct$5.00$25.001M10.0
claude-sonnet-4-6Anthropic Direct$3.00$15.001M7.0
gpt-5.5OpenAI Direct$5.00$30.001M10.0
gemini-3.1-pro-previewGoogle Direct$4.00$18.001M10.0
deepseek-v4-proDeepSeek$0.560$1.121M9.0
qwen3.6-plusAlibaba Cloud$0.500$3.001M8.0
kimi-k2.6Moonshot$0.900$3.75256K9.0
seedance-2.0ByteDancefrom $0.07 /sec10.0
+ 194 more models · Prices update every 60 seconds
Platform

Production-grade from day one.

Everything you need to run AI in production without managing multiple provider integrations.

Real-time price routing

Provider prices refresh every 60s. Every request hits the cheapest available backend automatically.

Automatic failover

Provider goes down mid-stream? We switch transparently. Your app sees zero errors.

🔑

API key management

Issue keys per team or service with spend caps, model allowlists, and rate limits built in.

$

Per-request cost tracking

See exactly what every request cost, which provider handled it, and how much you saved.

OpenAI-compatible

Change one line. Same SDK, same model names, same streaming format. Zero migration effort.

🛡

Budget enforcement

Hard and soft limits per key, team, or org. Auto-resets monthly. Slack + webhook alerts.


Differentiator

Glass-box routing.

Every request shows you the provider that served it, the published price they charged, and the route taken. Verifiable per call, reproducible later.

🔍

Per-request provider attribution

Each completion is tagged with the upstream provider — Anthropic Direct, OpenAI Direct, Bedrock, Vertex — and surfaced in your dashboard and headers.

📒

Published-rate ledger

Every token charge equals the provider's public list price. Audit any request against the provider's own pricing page in seconds.

Routing decisions you can replay

Failover events, health swaps, and cheapest-backend choices are logged with timestamps. Reproduce any request's routing path.


Pricing

Routing is free.
Pay for features.

We never take a cut of your token spend. Our revenue comes from optional team features.

Zero markup guarantee
You pay providers directly at their published rates. We add nothing on top of token costs. Routing is free; the optional Team plan funds the platform.
$0.00routing fee

Hacker

Free
Forever. Zero markup on all tokens.
✓ 3 API keys
✓ All 200+ models
✓ Auto failover
✓ Basic dashboard
Start free

Enterprise

Custom
SLA commitments + private deployment.
✓ Private deployment option
✓ Custom routing rules
✓ 99.99% uptime SLA
✓ Dedicated support
✓ Audit logs & compliance

Start routing smarter.

Sign up with GitHub — $5 in tokens free. No credit card required. Swap one line of code and you're live.

© 2026 OrcaRouter