Minimax

MiniMax: MiniMax M2.5 Lightning

Chat
minimax/minimax-m2.5-lightning

MiniMax-M2.5 is a SOTA large language model designed for real-world productivity. Trained in a diverse range of complex real-world digital working environments, M2.5 builds upon the coding expertise of M2.1 to extend into general office work, reaching fluency in generating and operating Word, Excel, and Powerpoint files, context switching between diverse software environments, and working across different agent and human teams. Scoring 80.2% on SWE-Bench Verified, 51.3% on Multi-SWE-Bench, and 76.3% on BrowseComp, M2.5 is also more token efficient than previous generations, having been trained to optimize its actions and output through planning.

Context Window
200K
Max Output Tokens
131K
Released
2026-02-12
Capabilities
Function CallingReasoningPrompt CachingWeb Search
Available Providers
MinimaxMiniMax
Supported Protocols
OpenAIopenaiAnthropicanthropic

Providers

MinimaxMiniMax
Input Tokens
$0.3/M
Output Tokens
$2.4/M
Cache Read
$0.03/M
Cache Write
$0.375/M
Protocols
OpenAIopenai/v1/chat/completions/v1/responses
Anthropicanthropic

Code Examples

from openai import OpenAI
client = OpenAI(
base_url="https://api.ofox.ai/v1",
api_key="YOUR_OFOX_API_KEY",
)
response = client.chat.completions.create(
model="minimax/minimax-m2.5-lightning",
messages=[
{"role": "user", "content": "Hello!"}
],
)
print(response.choices[0].message.content)

Frequently Asked Questions

MiniMax: MiniMax M2.5 Lightning on Ofox.ai costs $0.3/M per million input tokens and $2.4/M per million output tokens. Pay-as-you-go, no monthly fees.