nvidia

NVIDIA: Llama 3.1 Nemotron 70B Instruct

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...

Try on OpenRouter → Estimate cost

Quality Score

85/100

composite of price, context, capability

Input Price

$1.20

per 1M tokens

Output Price

$1.20

per 1M tokens

Context Window

131,072

tokens

Model ID: nvidia/llama-3.1-nemotron-70b-instruct
Vendor: nvidia
Tokenizer: Llama3
Input Modalities: text
Output Modalities: text
Max Output: 16,384 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: not supported
Vision: text only
Audio: no
Moderated: no

Similar models

nvidia

NVIDIA: Llama 3.1 Nemotron 70B Instruct

Similar models

NVIDIA: Nemotron Nano 9B V2 (free)

NVIDIA: Nemotron 3 Nano 30B A3B (free)

NVIDIA: Nemotron Nano 12B 2 VL (free)

NVIDIA: Nemotron Nano 12B 2 VL

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

NVIDIA: Nemotron Nano 9B V2