这页解决什么
GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts...
- 智谱 AI · z-ai/glm-4.6v
- text+image+video->text · 中国模型路线
- 131,072 context · US$0.30 input