这页解决什么
Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement learning for enhanced mathematical reasoning, structured outputs, and visual problem-solving capabilities. It excels at visual analysi
- 阿里云 · 通义 · qwen/qwen2.5-vl-32b-instruct
- text+image->text · 中国模型路线
- 128,000 context · US$0.20 input