이 페이지가 답하는 것
Qwen2.5-VL-32B is a multimodal vision-language model fine-tuned through reinforcement learning for enhanced mathematical reasoning, structured outputs, and visual problem-solving capabilities. It excels at visual analysi
- Alibaba Cloud · Qwen · qwen/qwen2.5-vl-32b-instruct
- text+image->text · 중국 모델 경로
- 128,000 context · US$0.20 input