Gemini

Google: Gemini 2.5 Flash Lite

Chat
google/gemini-2.5-flash-lite

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, [thinking] (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the Reasoning API parameter to selectively trade off cost for intelligence.

1M janela de contexto
66K tokens máx de saída
Lançamento: 2025-07-22
Protocolos Suportados:OpenAIopenaiGeminigemini
Provedores Disponíveis:GoogleCloudVertex
Capacidades:VisãoFunction CallingPrompt CachingEntrada PDF

Providers

GoogleCloudVertex
Tokens de Entrada
$0.1/M
Tokens de Saída
$0.4/M
Leitura de Cache
$0.025/M
Escrita de Cache
$1/M
Entrada de Áudio
$0.3/M
Áudio em Cache
$0.3/M
Busca Web
$0.035/R
Protocols
OpenAIopenai/v1/chat/completions
Geminigemini

Exemplos de Código

from google import genai
client = genai.Client(
api_key="YOUR_OFOX_API_KEY",
http_options={"api_version": "v1beta", "url": "https://api.ofox.ai/gemini"},
)
response = client.models.generate_content(
model="google/gemini-2.5-flash-lite",
contents="Hello!",
)
print(response.text)

Perguntas Frequentes

Google: Gemini 2.5 Flash Lite na Ofox.ai custa $0.1/M por milhão de tokens de entrada e $0.4/M por milhão de tokens de saída. Pague por uso, sem mensalidade.