这页解决什么
Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It e
- 阿里云 · 通义 · qwen/qwen3-vl-30b-a3b-instruct
- text+image->text · 中国模型路线
- 131,072 context · US$0.13 input