Description
The gpt-4o-audio-preview model adds support for audio inputs as prompts. This enhancement allows the model to detect nuances within audio recordings and add depth to generated user experiences. Audio outputs are currently not supported. Audio tokens are priced at $40 per million input audio tokens.
API Usage Examples
OpenAI Compatible Endpoint
Use this endpoint with any OpenAI-compatible library. Model: OpenAI: GPT-4o Audio (openai/gpt-4o-audio-preview)
curl https://api.ridvay.com/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer YOUR_API_KEY" -d '{
"model": "openai/gpt-4o-audio-preview",
"messages": [
{
"role": "user",
"content": "Explain the capabilities of the OpenAI: GPT-4o Audio model"
}
],
"temperature": 0.7,
"max_tokens": 1024
}'
Supported Modalities
- Text
API Pricing
- Input: 2.5$ / 1M tokens
- Output: 10$ / 1M tokens
Token Limits
- Max Output: 16,384 tokens
- Max Context: 128,000 tokens
Subscription Tiers
- free
- pro
- ultimate