NVIDIA: Llama 3.1 Nemotron 70B Instruct

nvidia/llama-3.1-nemotron-70b-instruct

Try model on Ridvay Chat Get Ridvay API Keys

Description

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels in automatic alignment benchmarks. This model is tailored for applications requiring high accuracy in helpfulness and response generation, suitable for diverse user queries across multiple domains. Usage of this model is subject to [Meta's Acceptable Use Policy](https://www.llama.com/llama3/use-policy/).

API Usage Examples

OpenAI Compatible Endpoint

Use this endpoint with any OpenAI-compatible library. Model: NVIDIA: Llama 3.1 Nemotron 70B Instruct (nvidia/llama-3.1-nemotron-70b-instruct)

curl https://api.ridvay.com/v1/chat/completions   -H "Content-Type: application/json"   -H "Authorization: Bearer YOUR_API_KEY"   -d '{
    "model": "nvidia/llama-3.1-nemotron-70b-instruct",
    "messages": [
      {
        "role": "user",
        "content": "Explain the capabilities of the NVIDIA: Llama 3.1 Nemotron 70B Instruct model"
      }
    ],
    "temperature": 0.7,
    "max_tokens": 1024
  }'

Supported Modalities

Text

API Pricing

Input: 1.2$ / 1M tokens
Output: 1.2$ / 1M tokens

Token Limits

Max Output: 16,384 tokens
Max Context: 131,072 tokens

Subscription Tiers

free
pro
ultimate

NVIDIA: Llama 3.1 Nemotron 70B Instruct

Description

API Usage Examples

OpenAI Compatible Endpoint

Supported Modalities

API Pricing

Token Limits

Subscription Tiers

More from nvidia

NVIDIA: Nemotron Nano 9B V2

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

NVIDIA: Nemotron Nano 12B 2 VL

NVIDIA: Nemotron 3 Nano 30B A3B