หน้านี้ตอบอะไร
Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilitie
- Google · google/gemma-3-12b-it
- text+image->text · เส้นทางโมเดลทั่วโลก
- 131,072 context · US$0.04 input