このページで分かること
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer
- NVIDIA · nvidia/nemotron-nano-12b-v2-vl
- text+image+video->text · グローバルモデルルート
- 131,072 context · $0.20 input