Gemini

Google: Gemini 2.5 Flash Lite

Chat
google/gemini-2.5-flash-lite

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, [thinking] (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the Reasoning API parameter to selectively trade off cost for intelligence.

1M 上下文視窗
66K 最大輸出 token
發布日期: 2025-07-22
支援的協定:OpenAIopenaiGeminigemini
可用供應商:GoogleCloudVertex
能力:視覺函式呼叫提示快取PDF 輸入

定價

類型價格
輸入 Token$0.1/M
輸出 Token$0.4/M
音訊輸入$0.3/M
快取讀取$0.025/M
快取寫入$1/M
快取音訊$0.3/M
網路搜尋$0.035/R

程式碼範例

from google import genai
client = genai.Client(
api_key="YOUR_OFOX_API_KEY",
http_options={"api_version": "v1beta", "url": "https://api.ofox.ai/gemini"},
)
response = client.models.generate_content(
model="google/gemini-2.5-flash-lite",
contents="Hello!",
)
print(response.text)

常見問題

Google: Gemini 2.5 Flash Lite 在 Ofox.ai 上的價格為輸入 $0.1/M/百萬 Token,輸出 $0.4/M/百萬 Token。按量計費,無月費。

微信社群

掃碼加入開發者交流群

微信社群

Discord

加入 Discord 社群

Discord

技術顧問

掃碼新增技術顧問

技術顧問