nvidia
NVIDIA: Nemotron Nano 12B 2 VL
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...
Quality Score
91/100
composite of price, context, capability
Input Price
$0.20
per 1M tokens
Output Price
$0.60
per 1M tokens
Context Window
131,072
tokens
- Model ID
- nvidia/nemotron-nano-12b-v2-vl
- Vendor
- nvidia
- Tokenizer
- Other
- Input Modalities
- image, text, video
- Output Modalities
- text
- Max Output
- default
- Tool Calling
- not supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- ✓ accepts images
- Audio
- no
- Moderated
- no
Similar models
nvidia
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
$0.10 in / $0.40 out
131,072 ctx
91
nvidia
NVIDIA: Nemotron Nano 9B V2
$0.04 in / $0.16 out
131,072 ctx
91
nvidia
NVIDIA: Nemotron 3 Super (free)
Free
262,144 ctx
93
nvidia
NVIDIA: Nemotron Nano 12B 2 VL (free)
Free
128,000 ctx
89
nvidia
NVIDIA: Nemotron 3 Nano 30B A3B (free)
Free
256,000 ctx
88
nvidia
NVIDIA: Llama 3.1 Nemotron 70B Instruct
$1.20 in / $1.20 out
131,072 ctx
85