Back to Models

TNG: DeepSeek R1T2 Chimera

tngtech/deepseek-r1t2-chimera

Description

DeepSeek-TNG-R1T2-Chimera is the second-generation Chimera model from TNG Tech. It is a 671 B-parameter mixture-of-experts text-generation model assembled from DeepSeek-AI’s R1-0528, R1, and V3-0324 checkpoints with an Assembly-of-Experts merge. The tri-parent design yields strong reasoning performance while running roughly 20 % faster than the original R1 and more than 2× faster than R1-0528 under vLLM, giving a favorable cost-to-intelligence trade-off. The checkpoint supports contexts up to 60 k tokens in standard use (tested to ~130 k) and maintains consistent <think> token behaviour, making it suitable for long-context analysis, dialogue and other open-ended generation tasks.

API Usage Examples

OpenAI Compatible Endpoint

Use this endpoint with any OpenAI-compatible library. Model: TNG: DeepSeek R1T2 Chimera (tngtech/deepseek-r1t2-chimera)

curl https://api.ridvay.com/v1/chat/completions   -H "Content-Type: application/json"   -H "Authorization: Bearer YOUR_API_KEY"   -d '{
    "model": "tngtech/deepseek-r1t2-chimera",
    "messages": [
      {
        "role": "user",
        "content": "Explain the capabilities of the TNG: DeepSeek R1T2 Chimera model"
      }
    ],
    "temperature": 0.7,
    "max_tokens": 1024
  }'

Supported Modalities

  • Text

API Pricing

  • Input: 0.3$ / 1M tokens
  • Output: 1.2$ / 1M tokens

Token Limits

  • Max Output: 163,840 tokens
  • Max Context: 163,840 tokens

Subscription Tiers

  • free
  • pro
  • ultimate