How much does Google: Gemini 2.5 Flash Lite cost on Ofox?

Google: Gemini 2.5 Flash Lite on Ofox.ai costs $0.1/M per million input tokens and $0.4/M per million output tokens. Pay-as-you-go, no monthly fees.

What is Google: Gemini 2.5 Flash Lite's context window?

Google: Gemini 2.5 Flash Lite supports a context window of 1,048,576 tokens, allowing you to process large documents and maintain long conversations.

How to use Google: Gemini 2.5 Flash Lite API via Ofox?

Simply set your base URL to https://api.ofox.ai/v1 and use your Ofox API key. The API is OpenAI-compatible — just change the base URL and API key in your existing code.

What capabilities does Google: Gemini 2.5 Flash Lite support?

Google: Gemini 2.5 Flash Lite supports the following capabilities: vision, function calling, prompt caching, pdf input. Access all features through the Ofox.ai unified API.

Google: Gemini 2.5 Flash Lite

Name: Google: Gemini 2.5 Flash Lite
Brand: google
Price: 0.09999999999999999 USD
Availability: InStock
Rating: 5 (1 reviews)

Chat

google/gemini-2.5-flash-lite

对比开始使用

Gemini 2.5 Flash Lite 是 Google Gemini 2.5 系列中的轻量成员，发布于 2025-07-22，面向极低延迟与成本效率优化。相比早期 Flash 模型，它提升了吞吐量与生成速度，并默认关闭多轮思考，以保证响应足够快。能力包括图像与 PDF 输入、函数调用和提示缓存。输入 $0.10/M tokens、输出 $0.40/M tokens 的价格，使它成为大批量分类、抽取与请求路由等场景中最经济的选择之一。上下文窗口：1M tokens，输出：64K。可通过 OpenAI 与 Gemini 两种协议访问。

上下文窗口

最大输出 Token

66K

发布日期

2025-07-22

能力

视觉函数调用提示缓存PDF 输入

可用供应商

Vertex

支持的协议

openaigemini

供应商

Vertex

输入 Token

$0.1/M

输出 Token

$0.4/M

缓存读取

$0.025/M

缓存写入

$1/M

音频输入

$0.3/M

缓存音频

$0.3/M

网络搜索

$0.035/R

接入协议

openai/v1/chat/completions

gemini

代码示例

from google import genai

client = genai.Client(
    api_key="YOUR_OFOX_API_KEY",
    http_options={"api_version": "v1beta", "base_url": "https://api.ofox.ai/gemini"},
)

response = client.models.generate_content(
    model="google/gemini-2.5-flash-lite",
    contents="Hello!",
)

print(response.text)

运行状态

第三方评测

LMArena ↗评测条目:gemini-2.5-flash-lite-preview-09-2025-no-thinking

Google: Gemini 2.5 Flash Lite 在 LMArena 文本榜单(风格控制) 综合类别中获得 1380 分,在 374 个模型中排名第 151,基于 47,185 次人类偏好投票(更新于 2026-07-12)。

gemini-2.5-flash-lite-preview-09-2025-no-thinking 在 LMArena 的评测分数
类别	Arena 分数	95% 置信区间	排名	投票数
综合	1380	1376–1383	第 151 / 374 名	47,185
困难任务	1390	1386–1395	第 158 / 374 名	25,046
编程	1397	1391–1404	第 175 / 369 名	9,675
数学	1364	1353–1375	第 164 / 362 名	2,867
创意写作	1361	1353–1369	第 134 / 372 名	6,502
指令遵循	1365	1360–1371	第 156 / 374 名	12,894
中文	1407	1394–1420	第 143 / 344 名	2,167