Gemini

Google: Gemini 3.5 Flash

Chat
google/gemini-3.5-flash

Gemini 3.5 Flash is Google's efficient multimodal model delivering near-Pro level coding and reasoning at Flash-tier cost and speed. Optimized for coding tasks and parallel agent execution with configurable thinking levels. Released May 20, 2026.

1M context window
66K max output tokens
Released: 2026-05-20
Supported Protocols:OpenAIopenaiGeminigemini
Available Providers:GoogleCloudVertex
Capabilities:VisionFunction CallingReasoningPrompt CachingWeb SearchAudio InputVideo InputPDF Input

Pricing

TypePrice
Input Tokens$1.5/M
Output Tokens$9/M
Audio Input$3/M
Cache Read$0.15/M
Cache Write$0.83/M
Cached Audio$0.3/M
Web Search$0.014/R

Code Examples

from google import genai
client = genai.Client(
api_key="YOUR_OFOX_API_KEY",
http_options={"api_version": "v1beta", "url": "https://api.ofox.ai/gemini"},
)
response = client.models.generate_content(
model="google/gemini-3.5-flash",
contents="Hello!",
)
print(response.text)

Frequently Asked Questions

Google: Gemini 3.5 Flash on Ofox.ai costs $1.5/M per million input tokens and $9/M per million output tokens. Pay-as-you-go, no monthly fees.