nvidia
NVIDIA: Llama 3.1 Nemotron 70B Instruct
NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...
Quality Score
85/100
composite of price, context, capability
Input Price
$1.20
per 1M tokens
Output Price
$1.20
per 1M tokens
Context Window
131,072
tokens
- Model ID
- nvidia/llama-3.1-nemotron-70b-instruct
- Vendor
- nvidia
- Tokenizer
- Llama3
- Input Modalities
- text
- Output Modalities
- text
- Max Output
- 16,384 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- not supported
- Vision
- text only
- Audio
- no
- Moderated
- no
Similar models
nvidia
NVIDIA: Nemotron Nano 9B V2 (free)
Free
128,000 ctx
84
nvidia
NVIDIA: Nemotron 3 Nano 30B A3B (free)
Free
256,000 ctx
88
nvidia
NVIDIA: Nemotron Nano 12B 2 VL (free)
Free
128,000 ctx
89
nvidia
NVIDIA: Nemotron Nano 12B 2 VL
$0.20 in / $0.60 out
131,072 ctx
91
nvidia
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
$0.10 in / $0.40 out
131,072 ctx
91
nvidia
NVIDIA: Nemotron Nano 9B V2
$0.04 in / $0.16 out
131,072 ctx
91