Qwen3 Max — proprietary flagship chat model, 256k context, thinking mode + function calling.
from openai import OpenAI
client = OpenAI(
base_url="https://orcarouter.ai/v1",
api_key="$ORCAROUTER_API_KEY",
)
response = client.chat.completions.create(
model="qwen/qwen3-max",
messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)| Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|
| ≤ 32K | $0.359 | $1.43 |
| ≤ 128K | $0.574 | $2.29 |
| ≤ 256K | $1.00 | $4.01 |
| Tier selected by input token count of each request | ||