Description
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully open with open-weights, datasets and recipes so developers can easily customize, optimize, and deploy the model on their infrastructure for maximum privacy and security.
API Usage Examples
OpenAI Compatible Endpoint
Use this endpoint with any OpenAI-compatible library. Model: NVIDIA: Nemotron 3 Nano 30B A3B (nvidia/nemotron-3-nano-30b-a3b)
curl https://api.ridvay.com/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer YOUR_API_KEY" -d '{
"model": "nvidia/nemotron-3-nano-30b-a3b",
"messages": [
{
"role": "user",
"content": "Explain the capabilities of the NVIDIA: Nemotron 3 Nano 30B A3B model"
}
],
"temperature": 0.7,
"max_tokens": 1024
}'Supported Modalities
- Text
API Pricing
- Input: 0.05$ / 1M tokens
- Output: 0.2$ / 1M tokens
Token Limits
- Max Output: 262,144 tokens
- Max Context: 262,144 tokens
Subscription Tiers
- free
- pro
- ultimate