Description
NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully open with open-weights, datasets and recipes so developers can easily customize, optimize, and deploy the model on their infrastructure for maximum privacy and security. Note: For the free endpoint, all prompts and output are logged to improve the provider's model and its product and services. Please do not upload any personal, confidential, or otherwise sensitive information. This is a trial use only. Do not use for production or business-critical systems.
API Usage Examples
OpenAI Compatible Endpoint
Use this endpoint with any OpenAI-compatible library. Model: NVIDIA: Nemotron 3 Nano 30B A3B (nvidia/nemotron-3-nano-30b-a3b)
curl https://api.ridvay.com/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer YOUR_API_KEY" -d '{
"model": "nvidia/nemotron-3-nano-30b-a3b",
"messages": [
{
"role": "user",
"content": "Explain the capabilities of the NVIDIA: Nemotron 3 Nano 30B A3B model"
}
],
"temperature": 0.7,
"max_tokens": 1024
}'Supported Modalities
- Text
API Pricing
- Input: 0.06$ / 1M tokens
- Output: 0.24$ / 1M tokens
Token Limits
- Max Output: 262,144 tokens
- Max Context: 262,144 tokens
Subscription Tiers
- free
- pro
- ultimate