Description
gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized to run on a single H100 GPU with native MXFP4 quantization. The model supports configurable reasoning depth, full chain-of-thought access, and native tool use, including function calling, browsing, and structured output generation.
API Usage Examples
OpenAI Compatible Endpoint
Use this endpoint with any OpenAI-compatible library. Model: OpenAI: gpt-oss-120b (openai/gpt-oss-120b)
curl https://api.ridvay.com/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer YOUR_API_KEY" -d '{
"model": "openai/gpt-oss-120b",
"messages": [
{
"role": "user",
"content": "Explain the capabilities of the OpenAI: gpt-oss-120b model"
}
],
"temperature": 0.7,
"max_tokens": 1024
}'
Supported Modalities
- Text
API Pricing
- Input: 0.072$ / 1M tokens
- Output: 0.28$ / 1M tokens
Token Limits
- Max Output: 131,000 tokens
- Max Context: 131,000 tokens
Subscription Tiers
- free
- pro
- ultimate