Qwen3-VL 8B Thinking — open-weight small vision-language reasoning model, 8B params, 128k context.
from openai import OpenAI
client = OpenAI(
base_url="https://orcarouter.ai/v1",
api_key="$ORCAROUTER_API_KEY",
)
response = client.chat.completions.create(
model="qwen/qwen3-vl-8b-thinking",
messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)| Input / 1M tokens | $0.180 |
| Output / 1M tokens | $2.10 |
| Currency | USD |