ماذا تجيب هذه الصفحة
Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interle
- Alibaba Cloud · Qwen · qwen/qwen3-vl-8b-instruct
- text+image->text · مسار نموذج الصين
- 131,072 context · 0.08 US$ input