OpenAI

GPT-4.1 Mini

Chat
openai/gpt-4.1-mini

Mid-sized GPT-4.1 variant with GPT-4o performance at lower latency and cost. Features 1M context window, supports structured outputs and vision understanding at reduced pricing.

1M context window
33K max output tokens
Released: 2025-04-14
Supported Protocols:OpenAIopenai
Available Providers:AzureAzure
Capabilities:VisionFunction CallingPrompt Caching

Pricing

TypePrice
Input Tokens$0.4/M
Output Tokens$1.6/M
Cache Read$0.1/M
Web Search$0.01/R

Code Examples

from openai import OpenAI
client = OpenAI(
base_url="https://api.ofox.ai/v1",
api_key="YOUR_OFOX_API_KEY",
)
response = client.chat.completions.create(
model="openai/gpt-4.1-mini",
messages=[
{"role": "user", "content": "Hello!"}
],
)
print(response.choices[0].message.content)

Frequently Asked Questions

GPT-4.1 Mini on Ofox.ai costs $0.4/M per million input tokens and $1.6/M per million output tokens. Pay-as-you-go, no monthly fees.

Discord

Join our Discord server

Discord โ†’