Description
Ministral 8B is an 8B parameter model featuring a unique interleaved sliding-window attention pattern for faster, memory-efficient inference. Designed for edge use cases, it supports up to 128k context length and excels in knowledge and reasoning tasks. It outperforms peers in the sub-10B category, making it perfect for low-latency, privacy-first applications.
API Usage Examples
OpenAI Compatible Endpoint
Use this endpoint with any OpenAI-compatible library. Model: Mistral: Ministral 8B (mistralai/ministral-8b)
curl https://api.ridvay.com/v1/chat/completions -H "Content-Type: application/json" -H "Authorization: Bearer YOUR_API_KEY" -d '{
"model": "mistralai/ministral-8b",
"messages": [
{
"role": "user",
"content": "Explain the capabilities of the Mistral: Ministral 8B model"
}
],
"temperature": 0.7,
"max_tokens": 1024
}'Supported Modalities
- Text
API Pricing
- Input: 0.1$ / 1M tokens
- Output: 0.1$ / 1M tokens
Token Limits
- Max Output: 131,072 tokens
- Max Context: 131,072 tokens
Subscription Tiers
- free
- pro
- ultimate