Qwen3 VL 8B Instruct

qwen/qwen3-vl-8b-instruct
by Qwen · text+image+video in · text out · 131.1K ctx · 2025-10-14

Qwen3-VL 8B Instruct — open-weight small vision-language model, 8B params, 128k context, no thinking…

INPUT$0.18/ 1M tokens
OUTPUT$0.70/ 1M tokens
p50 TTFT1.12 s7d
p95 TTFT1.74 s7d
TRAFFIC16.3Ktokens / 7d
Code samples
from openai import OpenAI

client = OpenAI(
    base_url="https://orcarouter.ai/v1",
    api_key="$ORCAROUTER_API_KEY",
)

response = client.chat.completions.create(
    model="qwen/qwen3-vl-8b-instruct",
    messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)
Pricing
Input / 1M tokens$0.180
Output / 1M tokens$0.700
CurrencyUSD
Performance
TTFT p50
1.12 s
Output speed
18.6 tok/s
TTFT p95
1.74 s
Error rate
2.9%
Public benchmarks
AA Coding
7.3
AA Intelligence
14.3
AA Math
27.3
AIME 2025
27.3
GPQA Diamond
42.7
Humanity's Last Exam
2.9
IFBench
32.3
LiveCodeBench
33.2
Long-Context Recall
15.3
MMLU-Pro
68.6
SciCode
17.4
TerminalBench Hard
2.3
τ²-Bench
29.2
Source: artificialanalysis.ai
FAQ
How much does Qwen: Qwen3 VL 8B Instruct cost on OrcaRouter?
Qwen: Qwen3 VL 8B Instruct is priced at $0.18 per 1M input tokens and $0.70 per 1M output tokens via OrcaRouter. Pricing is pulled live from the routing layer.
What is Qwen: Qwen3 VL 8B Instruct's context window?
Qwen: Qwen3 VL 8B Instruct supports a context window of 131K tokens. Use long-context features (RAG, summarisation) up to that limit.
How do I call Qwen: Qwen3 VL 8B Instruct via the OpenAI SDK?
Set OpenAI base_url to https://orcarouter.ai/v1, supply your OrcaRouter API key, and pass model="qwen/qwen3-vl-8b-instruct" in the chat.completions.create call.
Does OrcaRouter rate-limit Qwen: Qwen3 VL 8B Instruct?
Per-model rate limits follow your OrcaRouter plan. Free tiers ship with conservative caps; paid tiers lift them. Check /pricing for current quotas.
Embed this badge
Qwen: Qwen3 VL 8B Instruct$0.18/M in1118ms p50via OrcaRouter
HTML <a href="https://www.orcarouter.ai/models/qwen/qwen3-vl-8b-instruct" target="_blank"> <img src="https://www.orcarouter.ai/embed/qwen/qwen3-vl-8b-instruct.svg" alt="Qwen: Qwen3 VL 8B Instruct on OrcaRouter" /> </a>
Markdown [![Qwen: Qwen3 VL 8B Instruct](https://www.orcarouter.ai/embed/qwen/qwen3-vl-8b-instruct.svg)](https://www.orcarouter.ai/models/qwen/qwen3-vl-8b-instruct)