O que esta página responde
Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interle
- Alibaba Cloud · Qwen · qwen/qwen3-vl-8b-instruct
- text+image->text · rota de modelo da China
- 131.072 context · US$ 0,08 input