MiMo v2.5 Pro

Model Information

Display Name: MiMo v2.5 Pro

API Model ID: xiaomi/mimo-v2.5-pro

Category: Text To Text

Description: MiMo v2.5 Pro is Xiaomi's flagship reasoning model, ranked #10 globally among 371 AI models. Built on a sparse Mixture-of-Experts architecture with 1.02 trillion total parameters and 42 billion active per token, it delivers frontier-level performance on coding, mathematics, and complex agentic tasks. **Architecture:** - 1.02T total parameters (sparse MoE), 42B active per token - 70 layers (1 dense + 69 MoE) with 384 routed experts, 8 selected per token - Hybrid attention: 10 full attention + 60 sliding window layers (6:1 ratio, 128-token window) - Multi-Token Prediction (MTP): 3 lightweight modules for ~3x output speed - FP8 (E4M3) mixed precision **Key Features:** - 1M token context window for massive document and codebase processing - Advanced chain-of-thought reasoning with reasoning_content field - Function calling, tool use, and structured JSON outputs - SWE-bench Pro: 57.2%% — top-tier agentic coding performance - ClawEval: 64%% Pass³ using 40-60%% fewer tokens than competing frontier models - OpenAI-compatible API - MIT License (fully open-source) **Best For:** - Complex multi-step reasoning and agentic workflows - Long-horizon coding tasks and large codebase analysis - Long document summarization and QA (up to 1M tokens) - Mathematical and scientific computing - Cost-effective frontier-level intelligence

Context Window: 1,000,000 tokens

Max Output: 32,768 tokens

How to Use This Model

To use MiMo v2.5 Pro via the HInow.ai API, use the model ID: xiaomi/mimo-v2.5-pro

API Request Example (Chat/Text)


POST https://api.hinow.ai/v1/chat/completions
Authorization: Bearer YOUR_API_KEY
Content-Type: application/json

{
  "model": "xiaomi/mimo-v2.5-pro",
  "messages": [
    {"role": "user", "content": "Your message here"}
  ]
}
              

Pricing

  • input: $0.5655
  • output: $1.131

Available Parameters

  • temperature: Controls randomness (0-2). Default: 0.7 (Options: 0, 0.3, 0.5, 0.7, 1.0, 1.5, 2.0)
  • top_p: Nucleus sampling (0-1). Default: 0.9 (Options: 0.1, 0.5, 0.7, 0.9, 0.95, 1.0)
  • max_tokens: Max tokens to generate (1-32768) (Options: 512, 1024, 2048, 4096, 8192, 16384, 32768)
  • frequency_penalty: Reduce token repetition (0-2). Default: 0 (Options: 0, 0.3, 0.5, 0.7, 1.0, 1.5, 2.0)
  • presence_penalty: Penalize repeated topics (0-2). Default: 0 (Options: 0, 0.3, 0.5, 0.7, 1.0, 1.5, 2.0)
  • response_format: Output format (Options: text, json_object)

Quick Reference

To use this model, set: "model": "xiaomi/mimo-v2.5-pro"

Featured: No

Documentation: https://hinow.ai/models/xiaomi/mimo-v2.5-pro

API Endpoint: https://api.hinow.ai/v1

Back to Models

MiMo v2.5 Pro

xiaomi/mimo-v2.5-pro

$0.566 / $1.13
per 1M tokens (in/out)

About

MiMo v2.5 Pro is Xiaomi's flagship reasoning model, ranked #10 globally among 371 AI models. Built on a sparse Mixture-of-Experts architecture with 1.02 trillion total parameters and 42 billion active per token, it delivers frontier-level performance on coding, mathematics, and complex agentic tasks.

Architecture:

  • 1.02T total parameters (sparse MoE), 42B active per token
  • 70 layers (1 dense + 69 MoE) with 384 routed experts, 8 selected per token
  • Hybrid attention: 10 full attention + 60 sliding window layers (6:1 ratio, 128-token window)
  • Multi-Token Prediction (MTP): 3 lightweight modules for ~3x output speed
  • FP8 (E4M3) mixed precision

Key Features:

  • 1M token context window for massive document and codebase processing
  • Advanced chain-of-thought reasoning with reasoning_content field
  • Function calling, tool use, and structured JSON outputs
  • SWE-bench Pro: 57.2%% — top-tier agentic coding performance
  • ClawEval: 64%% Pass³ using 40-60%% fewer tokens than competing frontier models
  • OpenAI-compatible API
  • MIT License (fully open-source)

Best For:

  • Complex multi-step reasoning and agentic workflows
  • Long-horizon coding tasks and large codebase analysis
  • Long document summarization and QA (up to 1M tokens)
  • Mathematical and scientific computing
  • Cost-effective frontier-level intelligence

Capabilities

Text To Text
Context1000K tokens
Max Output33K tokens

Parameters

temperature

Controls randomness (0-2). Default: 0.7

00.30.50.71.01.52.0
top_p

Nucleus sampling (0-1). Default: 0.9

0.10.50.70.90.951.0
max_tokens

Max tokens to generate (1-32768)

51210242048409681921638432768
frequency_penalty

Reduce token repetition (0-2). Default: 0

00.30.50.71.01.52.0
presence_penalty

Penalize repeated topics (0-2). Default: 0

00.30.50.71.01.52.0
response_format

Output format

textjson_object

Code Examples

curl -X POST https://api.hinow.ai/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $HINOW_API_KEY" \
  -d '{
    "model": "xiaomi/mimo-v2.5-pro",
    "messages": [
      {"role": "user", "content": "Hello! How are you?"}
    ],
    "parameters": {
      "temperature": "0",
      "top_p": "0.1",
      "max_tokens": "512",
      "frequency_penalty": "0",
      "presence_penalty": "0",
      "response_format": "text"
    }
  }'