nvidia

NVIDIA: Llama 3.1 Nemotron 70B Instruct

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...

Quality Score
85/100
composite of price, context, capability
Input Price
$1.20
per 1M tokens
Output Price
$1.20
per 1M tokens
Context Window
131,072
tokens
Model ID
nvidia/llama-3.1-nemotron-70b-instruct
Vendor
nvidia
Tokenizer
Llama3
Input Modalities
text
Output Modalities
text
Max Output
16,384 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
not supported
Vision
text only
Audio
no
Moderated
no

Similar models