Gemini

Google: Gemini 2.5 Flash Lite

Chat
google/gemini-2.5-flash-lite

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, [thinking] (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the Reasoning API parameter to selectively trade off cost for intelligence.

1M コンテキストウィンドウ
66K 最大出力トークン
リリース日: 2025-07-22
対応プロトコル:OpenAIopenaiGeminigemini
利用可能なプロバイダー:GoogleCloudVertex
機能:VisionFunction CallingプロンプトキャッシュPDF入力

Providers

GoogleCloudVertex
入力トークン
$0.1/M
出力トークン
$0.4/M
キャッシュ読込
$0.025/M
キャッシュ書込
$1/M
音声入力
$0.3/M
キャッシュ音声
$0.3/M
Web検索
$0.035/R
Protocols
OpenAIopenai/v1/chat/completions
Geminigemini

コード例

from google import genai
client = genai.Client(
api_key="YOUR_OFOX_API_KEY",
http_options={"api_version": "v1beta", "url": "https://api.ofox.ai/gemini"},
)
response = client.models.generate_content(
model="google/gemini-2.5-flash-lite",
contents="Hello!",
)
print(response.text)

よくある質問

Ofox.aiでのGoogle: Gemini 2.5 Flash Liteの料金は、入力100万トークンあたり$0.1/M、出力100万トークンあたり$0.4/Mです。従量課金制、月額料金なし。