이 페이지가 답하는 것
GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results i
- Zhipu AI (GLM) · z-ai/glm-4.5v
- text+image->text · 중국 모델 경로
- 65,536 context · US$0.60 input