Description
For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini. It’s ideal for tasks like classification or autocompletion.
API Usage Examples
OpenAI Compatible Endpoint
Use this endpoint with any OpenAI-compatible library. Model: OpenAI: GPT-4.1 Nano (openai/gpt-4.1-nano)
curl https://api.ridvay.com/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer YOUR_API_KEY" -d '{
"model": "openai/gpt-4.1-nano",
"messages": [
{
"role": "user",
"content": "Explain the capabilities of the OpenAI: GPT-4.1 Nano model"
}
],
"temperature": 0.7,
"max_tokens": 1024
}'
Supported Modalities
- Text
- Images
API Pricing
- Input: 0.1$ / 1M tokens
- Output: 0.4$ / 1M tokens
Token Limits
- Max Output: 32,768 tokens
- Max Context: 1,047,576 tokens
Subscription Tiers
- free
- pro
- ultimate