Qwen3 Max preview — proprietary chat preview, 256k context, thinking mode + function calling.
from openai import OpenAI
client = OpenAI(
base_url="https://orcarouter.ai/v1",
api_key="$ORCAROUTER_API_KEY",
)
response = client.chat.completions.create(
model="qwen/qwen3-max-preview",
messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)| Tier | Input / 1M tokens | Output / 1M tokens |
|---|---|---|
| ≤ 32K | $0.861 | $3.44 |
| ≤ 128K | $1.43 | $5.74 |
| ≤ 256K | $2.15 | $8.60 |
| Tier selected by input token count of each request | ||