Question 1

How does OrcaRouter charge $0 markup on already-cheap models?

Accepted Answer

OrcaRouter's per-token rate is exactly the upstream provider's per-token rate. We make money on subscription tiers (Hacker free, Team $29/mo, Enterprise custom), not on token markup.

Question 2

Why are the cheapest models still good?

Accepted Answer

Frontier-quality reasoning has commoditized fast — a 2026 'cheap' model outperforms a 2024 'frontier' model on almost every benchmark. Vendors compete on price at the lower tier because the absolute quality floor has risen.

Question 3

Should I always pick the cheapest model?

Accepted Answer

No. Pick by total task cost (dollars per completed task), not per-token cost. For coding, Claude Sonnet at $3 / 1M out beats Haiku at $1 / 1M out because Sonnet finishes tasks in 1/3 the tokens.