DeepSeek V4 Flash efficient MoE — 284B total / 13B active params, 1M context, optimized for fast eve…
from openai import OpenAI
client = OpenAI(
base_url="https://orcarouter.ai/v1",
api_key="$ORCAROUTER_API_KEY",
)
response = client.chat.completions.create(
model="deepseek/deepseek-v4-flash",
messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)| Input / 1M tokens | $0.190 |
| Output / 1M tokens | $0.370 |
| Cache read / 1M | $0.0037 |
| Currency | USD |