NVIDIA: Nemotron Nano 9B V2

nvidia/nemotron-nano-9b-v2

Try model on Ridvay Chat Get Ridvay API Keys

Description

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so.

API Usage Examples

OpenAI Compatible Endpoint

Use this endpoint with any OpenAI-compatible library. Model: NVIDIA: Nemotron Nano 9B V2 (nvidia/nemotron-nano-9b-v2)

curl https://api.ridvay.com/v1/chat/completions   -H "Content-Type: application/json"   -H "Authorization: Bearer YOUR_API_KEY"   -d '{
    "model": "nvidia/nemotron-nano-9b-v2",
    "messages": [
      {
        "role": "user",
        "content": "Explain the capabilities of the NVIDIA: Nemotron Nano 9B V2 model"
      }
    ],
    "temperature": 0.7,
    "max_tokens": 1024
  }'

Supported Modalities

Text

API Pricing

Input: 0.04$ / 1M tokens
Output: 0.16$ / 1M tokens

Token Limits

Max Output: 131,072 tokens
Max Context: 131,072 tokens

Subscription Tiers

free
pro
ultimate

NVIDIA: Nemotron Nano 9B V2

Description

API Usage Examples

OpenAI Compatible Endpoint

Supported Modalities

API Pricing

Token Limits

Subscription Tiers

More from nvidia

NVIDIA: Llama 3.1 Nemotron 70B Instruct

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

NVIDIA: Nemotron Nano 12B 2 VL

NVIDIA: Nemotron 3 Nano 30B A3B