Qwen3 VL 8B Thinking

qwen/qwen3-vl-8b-thinking
by Qwen · text+image+video in · text out · 131.1K ctx · 2025-10-14

Qwen3-VL 8B Thinking — open-weight small vision-language reasoning model, 8B params, 128k context.

INPUT$0.18/ 1M tokens
OUTPUT$2.10/ 1M tokens
p50 TTFT5.41 s7d
p95 TTFT19.02 s7d
TRAFFIC62.3Ktokens / 7d
Code samples
from openai import OpenAI

client = OpenAI(
    base_url="https://orcarouter.ai/v1",
    api_key="$ORCAROUTER_API_KEY",
)

response = client.chat.completions.create(
    model="qwen/qwen3-vl-8b-thinking",
    messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)
Pricing
Input / 1M tokens$0.180
Output / 1M tokens$2.10
CurrencyUSD
Performance
TTFT p50
5.41 s
Output speed
609 tok/s
TTFT p95
19.02 s
Error rate
20.0%
Public benchmarks
No public benchmark scores ingested yet.
FAQ
How much does Qwen: Qwen3 VL 8B Thinking cost on OrcaRouter?
Qwen: Qwen3 VL 8B Thinking is priced at $0.18 per 1M input tokens and $2.10 per 1M output tokens via OrcaRouter. Pricing is pulled live from the routing layer.
What is Qwen: Qwen3 VL 8B Thinking's context window?
Qwen: Qwen3 VL 8B Thinking supports a context window of 131K tokens. Use long-context features (RAG, summarisation) up to that limit.
How do I call Qwen: Qwen3 VL 8B Thinking via the OpenAI SDK?
Set OpenAI base_url to https://orcarouter.ai/v1, supply your OrcaRouter API key, and pass model="qwen/qwen3-vl-8b-thinking" in the chat.completions.create call.
Does OrcaRouter rate-limit Qwen: Qwen3 VL 8B Thinking?
Per-model rate limits follow your OrcaRouter plan. Free tiers ship with conservative caps; paid tiers lift them. Check /pricing for current quotas.
Embed this badge
Qwen: Qwen3 VL 8B Thinking$0.18/M in5405ms p50via OrcaRouter
HTML <a href="https://www.orcarouter.ai/models/qwen/qwen3-vl-8b-thinking" target="_blank"> <img src="https://www.orcarouter.ai/embed/qwen/qwen3-vl-8b-thinking.svg" alt="Qwen: Qwen3 VL 8B Thinking on OrcaRouter" /> </a>
Markdown [![Qwen: Qwen3 VL 8B Thinking](https://www.orcarouter.ai/embed/qwen/qwen3-vl-8b-thinking.svg)](https://www.orcarouter.ai/models/qwen/qwen3-vl-8b-thinking)