Description
Qwen2.5-Coder-7B-Instruct is a 7B parameter instruction-tuned language model optimized for code-related tasks such as code generation, reasoning, and bug fixing. Based on the Qwen2.5 architecture, it incorporates enhancements like RoPE, SwiGLU, RMSNorm, and GQA attention with support for up to 128K tokens using YaRN-based extrapolation. It is trained on a large corpus of source code, synthetic data, and text-code grounding, providing robust performance across programming languages and agentic coding workflows. This model is part of the Qwen2.5-Coder family and offers strong compatibility with tools like vLLM for efficient deployment. Released under the Apache 2.0 license.
API Usage Examples
OpenAI Compatible Endpoint
Use this endpoint with any OpenAI-compatible library. Model: Qwen: Qwen2.5 Coder 7B Instruct (qwen/qwen2.5-coder-7b-instruct)
curl https://api.ridvay.com/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer YOUR_API_KEY" -d '{
"model": "qwen/qwen2.5-coder-7b-instruct",
"messages": [
{
"role": "user",
"content": "Explain the capabilities of the Qwen: Qwen2.5 Coder 7B Instruct model"
}
],
"temperature": 0.7,
"max_tokens": 1024
}'Supported Modalities
- Text
API Pricing
- Input: 0.03$ / 1M tokens
- Output: 0.09$ / 1M tokens
Token Limits
- Max Output: 32,768 tokens
- Max Context: 32,768 tokens
Subscription Tiers
- free
- pro
- ultimate
