6月 🎉 GPT 85 折 🎉 直降 15% 全系列生效 🔥查看详情
Gemini

Google: Gemini 3.1 Flash Lite

Chat
google/gemini-3.1-flash-lite

Gemini 3.1 Flash Lite (GA) is Google's high-efficiency multimodal model optimized for low-latency, high-volume workloads. GA version of the preview model. Supports full thinking levels (minimal, low, medium, high) for cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash. Released May 7, 2026.

上下文窗口
1M
最大输出 Token
64K
发布日期
2026-05-07
能力
视觉函数调用推理提示缓存网络搜索音频输入视频输入PDF 输入
可用供应商
GoogleCloudVertex
支持的协议
OpenAIopenaiGeminigemini

供应商

GoogleCloudVertex
输入 Token
$0.25/M
输出 Token
$1.5/M
缓存读取
$0.025/M
缓存写入
$1/M
音频输入
$0.5/M
缓存写入(1 小时)
$1/M
缓存音频
$0.05/M
网络搜索
$0.014/R
接入协议
OpenAIopenai/v1/chat/completions
Geminigemini

代码示例

from google import genai
client = genai.Client(
api_key="YOUR_OFOX_API_KEY",
http_options={"api_version": "v1beta", "base_url": "https://api.ofox.ai/gemini"},
)
response = client.models.generate_content(
model="google/gemini-3.1-flash-lite",
contents="Hello!",
)
print(response.text)

常见问题

Google: Gemini 3.1 Flash Lite 在 Ofox.ai 上的价格为输入 $0.25/M/百万 Token,输出 $1.5/M/百万 Token。按量计费,无月费。