このページで分かること
Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement learning for enhanced mathematical reasoning, structured outputs, and visual problem-solving capabilities. It excels at visual analysi
- Alibaba Cloud · Qwen · qwen/qwen2.5-vl-32b-instruct
- text+image->text · 中国モデルルート
- 128,000 context · $0.20 input