Trang này trả lời điều gì
GLM-4.6V is a large multimodal model designed for high-fidelity visual understanding and long-context reasoning across images, documents, and mixed media. It supports up to 128K tokens, processes complex page layouts...
- Zhipu AI (GLM) · z-ai/glm-4.6v
- text+image+video->text · tuyến model Trung Quốc
- 131.072 context · 0,30 US$ input