Note: direct via Alibaba, a Chinese entity - privacy and logging guarantees may be limited. Qwen3 Vision‑Language model (235B MoE, ≈22B active) tuned for instruction following and grounded visual QA. Excels at image understanding, dense OCR, charts and diagrams, and multi‑image context. Use this variant when you want concise, direct answers grounded in the visuals.
Added Sep 25, 2025
Context Window
N/A
Max Output
32.8K
Input Price (Auto)
$0.50/1M
Output Price (Auto)
$1.20/1M
Capabilities
Performance metrics and benchmarks
Sourced from Artificial Analysis.
Intelligence Index
20.8
Auto routing is available for this model. Explicit provider selection is not available.
Loading provider options…
Coding Index
16.5
GPQA Diamond
Graduate-level scientific reasoning
71.2%
Better than 64% of models compared
HLE
Humanity's Last Exam
6.3%
Better than 56% of models compared
IFBench
Instruction-following benchmark
42.7%
Better than 53% of models compared
T²-Bench Telecom
Conversational AI agents in dual-control scenarios
35.1%
Better than 51% of models compared
AA-LCR
Long context reasoning evaluation
31.7%
Better than 52% of models compared
SciCode
Python programming for scientific computing
35.9%
Better than 67% of models compared
Terminal-Bench Hard
Agentic coding and terminal use
6.8%
Better than 43% of models compared
AIME 2025
American Invitational Mathematics Examination 2025
70.7%
Better than 65% of models compared
MMLU-Pro
Professional and academic subject knowledge
82.3%
Better than 80% of models compared
Last updated May 15, 2026
Artificial AnalysisLiveCodeBench
Contamination-free coding benchmark
59.4%
Better than 66% of models compared