qwen/qwen3.5-flash

qwen/qwen3.5-flash
by qwen · text+image+video in · text out · 1.05M ctx

Qwen3.5 Flash — multimodal chat (text/image/video) optimized for cost, 1M context.

INPUT$0.10/ 1M tokens
OUTPUT$0.40/ 1M tokens
p50 TTFT3.65 s7d
p95 TTFT117.27 s7d
TRAFFIC147.1Ktokens / 7d
Code samples
from openai import OpenAI

client = OpenAI(
    base_url="https://orcarouter.ai/v1",
    api_key="$ORCAROUTER_API_KEY",
)

response = client.chat.completions.create(
    model="qwen/qwen3.5-flash",
    messages=[{"role": "user", "content": "Hello"}],
)
print(response.choices[0].message.content)
Pricing
Input / 1M tokens$0.100
Output / 1M tokens$0.400
CurrencyUSD
Performance
TTFT p50
3.65 s
Output speed
3513 tok/s
TTFT p95
117.27 s
Error rate
15.4%
Public benchmarks
No public benchmark scores ingested yet.
FAQ
How much does qwen/qwen3.5-flash cost on OrcaRouter?
qwen/qwen3.5-flash is priced at $0.10 per 1M input tokens and $0.40 per 1M output tokens via OrcaRouter. Pricing is pulled live from the routing layer.
What is qwen/qwen3.5-flash's context window?
qwen/qwen3.5-flash supports a context window of 1M tokens. Use long-context features (RAG, summarisation) up to that limit.
How do I call qwen/qwen3.5-flash via the OpenAI SDK?
Set OpenAI base_url to https://orcarouter.ai/v1, supply your OrcaRouter API key, and pass model="qwen/qwen3.5-flash" in the chat.completions.create call.
Does OrcaRouter rate-limit qwen/qwen3.5-flash?
Per-model rate limits follow your OrcaRouter plan. Free tiers ship with conservative caps; paid tiers lift them. Check /pricing for current quotas.
Embed this badge
qwen/qwen3.5-flash$0.10/M in3654ms p50via OrcaRouter
HTML <a href="https://www.orcarouter.ai/models/qwen/qwen3.5-flash" target="_blank"> <img src="https://www.orcarouter.ai/embed/qwen/qwen3.5-flash.svg" alt="qwen/qwen3.5-flash on OrcaRouter" /> </a>
Markdown [![qwen/qwen3.5-flash](https://www.orcarouter.ai/embed/qwen/qwen3.5-flash.svg)](https://www.orcarouter.ai/models/qwen/qwen3.5-flash)