Description
QwQ is the reasoning model of the Qwen series. Compared with conventional instruction-tuned models, QwQ, which is capable of thinking and reasoning, can achieve significantly enhanced performance in downstream tasks, especially hard problems. QwQ-32B is the medium-sized reasoning model, which is capable of achieving competitive performance against state-of-the-art reasoning models, e.g., DeepSeek-R1, o1-mini.
API Usage Examples
OpenAI Compatible Endpoint
Use this endpoint with any OpenAI-compatible library. Model: Qwen: QwQ 32B (qwen/qwq-32b)
curl https://api.ridvay.com/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer YOUR_API_KEY" -d '{
"model": "qwen/qwq-32b",
"messages": [
{
"role": "user",
"content": "Explain the capabilities of the Qwen: QwQ 32B model"
}
],
"temperature": 0.7,
"max_tokens": 1024
}'
Supported Modalities
- Text
API Pricing
- Input: 0.075$ / 1M tokens
- Output: 0.15$ / 1M tokens
Token Limits
- Max Output: 131,072 tokens
- Max Context: 131,072 tokens
Subscription Tiers
- free
- pro
- ultimate