Chat Models
Browse all 65 Chat models available on Ofox.ai
Z.ai: GLM-5-Turbo
z-ai/glm-5-turboContext200K
Max Out128K
Input$1.2/M
Output$4/M
Web Search$0.005/R
Cache Read$0.24/M
chatFunctionsReasoningCachingWeb2026-03-16
OpenAI: GPT-5.4
openai/gpt-5.4Context1M
Max Out128K
Input$2.5/M
Output$15/M
Web Search$0.01/R
Cache Read$0.25/M
chatVisionFunctionsReasoningCachingWeb2026-03-05
OpenAI: GPT-5.4 Pro
openai/gpt-5.4-proContext1M
Max Out128K
Input$30/M
Output$180/M
Web Search$0.01/R
chatVisionFunctionsReasoningWeb2026-03-05
Google: Gemini 3.1 Flash Lite Preview
google/gemini-3.1-flash-lite-previewContext1M
Max Out64K
Input$0.25/M
Output$1.5/M
Audio$0.5/M
Web Search$0.014/R
Cache Read$0.025/M
Cache Write$1/M
Cache 1h$1/M
Cached Audio$0.05/M
chatVisionFunctionsReasoningCachingWebAudioVideoPDF2026-03-03
OpenAI: GPT-5.3 Codex
openai/gpt-5.3-codexContext512K
Max Out128K
Input$1.75/M
Output$14/M
Web Search$0.01/R
Cache Read$0.18/M
chatVisionFunctionsReasoningCachingWebAudioVideoPDF2026-02-25
Qwen: Qwen3.5 122B A10B
bailian/qwen3.5-122b-a10bContext256K
Max Out64K
Input$0.29/M
Output$2.29/M
Web Search$0.01/R
Cache Read$0.29/M
chatVisionFunctionsReasoningCachingWebVideo2026-02-23
Qwen: Qwen3.5 27B
bailian/qwen3.5-27bContext256K
Max Out64K
Input$0.29/M
Output$2.05/M
Web Search$0.01/R
Cache Read$0.29/M
chatVisionFunctionsReasoningCachingWebVideo2026-02-23
Qwen: Qwen3.5 35B A3B
bailian/qwen3.5-35b-a3bContext256K
Max Out64K
Input$0.29/M
Output$1.83/M
Web Search$0.01/R
Cache Read$0.29/M
chatVisionFunctionsReasoningCachingWebVideo2026-02-23
Qwen: Qwen3.5 397B A17B
bailian/qwen3.5-397b-a17bContext256K
Max Out64K
Input$0.55/M
Output$3.5/M
Web Search$0.01/R
Cache Read$0.55/M
chatVisionFunctionsReasoningCachingWebVideo2026-02-23
Qwen: Qwen3.5 Flash
bailian/qwen3.5-flashContext1M
Max Out64K
Input$0.1/M
Output$0.4/M
Web Search$0.01/R
Cache Read$0.01/M
Cache Write$0.125/M
chatVisionFunctionsReasoningCachingVideo2026-02-23
Google: Gemini 3.1 Pro Preview
google/gemini-3.1-pro-previewContext1M
Max Out66K
Input$2/M
Output$12/M
Audio$2/M
Web Search$0.014/R
Cache Read$0.2/M
Cache Write$4.5/M
Cached Audio$0.2/M
chatVisionFunctionsReasoningCachingWebAudioVideoPDF2026-02-19
Qwen3 Coder Next
bailian/qwen3-coder-nextContext256K
Max Out64K
Input$0.2/M
Output$1.5/M
chatReasoning2026-02-19
Anthropic: Claude Sonnet 4.6
anthropic/claude-sonnet-4.6Context1M
Max Out128K
Input$3/M
Output$15/M
Web Search$0.01/R
Cache Read$0.3/M
Cache 5m$3.75/M
Cache 1h$6/M
chatVisionFunctionsReasoningCachingPDF2026-02-17
Qwen3.5 Plus
bailian/qwen3.5-plusContext1M
Max Out64K
Input$0.4/M
Output$2.4/M
Web Search$0.01/R
Cache Read$0.04/M
Cache Write$0.4/M
chatVisionFunctionsReasoningCachingVideo2026-02-16
Doubao Seed 2.0 Code
volcengine/doubao-seed-2.0-codeContext256K
Max Out128K
Input$0.67/M
Output$3.36/M
Cache Read$0.14/M
Cache Write$0.0024/M
chatVisionFunctionsReasoningCachingVideo2026-02-14
Doubao Seed 2.0 Lite
volcengine/doubao-seed-2.0-liteContext256K
Max Out32K
Input$0.13/M
Output$0.76/M
Cache Read$0.03/M
Cache Write$0.0024/M
chatVisionFunctionsReasoningCachingVideo2026-02-14
Doubao Seed 2.0 Mini
volcengine/doubao-seed-2.0-miniContext256K
Max Out32K
Input$0.06/M
Output$0.56/M
Cache Read$0.02/M
Cache Write$0.0024/M
chatVisionFunctionsReasoningCachingVideo2026-02-14
Doubao Seed 2.0 Pro
volcengine/doubao-seed-2.0-proContext256K
Max Out128K
Input$0.67/M
Output$3.36/M
Cache Read$0.14/M
Cache Write$0.0024/M
chatVisionFunctionsReasoningCachingVideo2026-02-14
MiniMax: MiniMax M2.5
minimax/minimax-m2.5Context200K
Max Out131K
Input$0.3/M
Output$1.2/M
Cache Read$0.03/M
Cache Write$0.375/M
chatFunctionsReasoningCachingWeb2026-02-12
MiniMax: MiniMax M2.5 Lightning
minimax/minimax-m2.5-lightningContext200K
Max Out131K
Input$0.3/M
Output$2.4/M
Cache Read$0.03/M
Cache Write$0.375/M
chatFunctionsReasoningCachingWeb2026-02-12
Z.ai: GLM-5
z-ai/glm-5Context200K
Max Out128K
Input$1/M
Output$3.2/M
Web Search$0.005/R
Cache Read$0.2/M
chatFunctionsReasoningCachingWeb2026-02-11
Anthropic: Claude Opus 4.6
anthropic/claude-opus-4.6Context1M
Max Out128K
Input$5/M
Output$25/M
Cache Read$0.5/M
Cache 5m$6.25/M
Cache 1h$10/M
chatVisionFunctionsReasoningCachingPDF2026-02-05
MoonshotAI: Kimi K2.5
moonshotai/kimi-k2.5Context262K
Max Out262K
Input$0.6/M
Output$3/M
Web Search$0.0043/R
Cache Read$0.1/M
chatVisionFunctionsReasoningCachingWebVideo2026-01-27
MiniMax: MiniMax M2 Her
minimax/m2-herContext200K
Max Out131K
Input$0.3/M
Output$1.2/M
chatFunctionsReasoningCachingWeb2026-01-23
Qwen3 Max
bailian/qwen3-maxContext256K
Max Out64K
Input$0.36/M
Output$1.43/M
Web Search$0.01/R
Cache Read$0.072/M
chatFunctionsReasoningCaching2026-01-23
Z.ai: GLM-4.7 FlashX
z-ai/glm-4.7-flashxContext200K
Max Out128K
Input$0.072/M
Output$0.43/M
Web Search$0.005/R
Cache Read$0.015/M
chatFunctionsReasoningCachingWeb2026-01-19
Z.ai: GLM-4.7-Flash (Free)
z-ai/glm-4.7-flash:freeContext200K
Max Out128K
Input$0/M
Output$0/M
Web Search$0.005/R
chatFunctionsCachingWeb2026-01-19
OpenAI: GPT-5.2 Codex
openai/gpt-5.2-codexContext512K
Max Out128K
Input$1.75/M
Output$14/M
Web Search$0.01/R
Cache Read$0.18/M
chatVisionFunctionsReasoningCachingWebAudioVideoPDF2026-01-14
Doubao Seed 1.8
volcengine/doubao-seed-1-8Context256K
Max Out64K
Input$0.12/M
Output$0.29/M
Cache Read$0.023/M
chatFunctionsReasoningCaching2025-12-28
MiniMax: MiniMax M2.1
minimax/minimax-m2.1Context205K
Max Out131K
Input$0.3/M
Output$1.2/M
Cache Read$0.03/M
Cache Write$0.375/M
chatFunctionsReasoningCachingWeb2025-12-23
MiniMax: MiniMax M2.1 Lightning
minimax/minimax-m2.1-lightningContext205K
Max Out131K
Input$0.3/M
Output$2.4/M
Cache Read$0.03/M
Cache Write$0.375/M
chatFunctionsReasoningCachingWeb2025-12-23
Z.ai: GLM 4.7
z-ai/glm-4.7Context200K
Max Out128K
Input$0.4/M
Output$2/M
Web Search$0.005/R
Cache Read$0.08/M
chatFunctionsReasoningCachingWeb2025-12-23
Google: Gemini 3 Flash Preview
google/gemini-3-flash-previewContext1M
Max Out66K
Input$0.5/M
Output$3/M
Audio$1/M
Web Search$0.014/R
Cache Read$0.05/M
Cache Write$1/M
Cached Audio$0.1/M
chatVisionFunctionsReasoningCachingWebAudioVideoPDF2025-12-17
OpenAI: GPT-5.2
openai/gpt-5.2Context512K
Max Out128K
Input$1.75/M
Output$14/M
Web Search$0.01/R
Cache Read$0.18/M
chatVisionFunctionsReasoningCachingWebAudioVideoPDF2025-12-11
OpenAI: GPT-5.1 Codex Max
openai/gpt-5.1-codex-maxContext256K
Max Out128K
Input$1.25/M
Output$10/M
Web Search$0.01/R
Cache Read$0.13/M
chatVisionFunctionsReasoningCachingWebAudioVideoPDF2025-12-04
DeepSeek V3.2
deepseek/deepseek-v3.2Context128K
Max Out32K
Input$0.29/M
Output$0.43/M
Cache Read$0.06/M
chatFunctionsCaching2025-12-01
Qwen Plus
bailian/qwen-plusContext1M
Max Out32K
Input$0.12/M
Output$0.29/M
Cache Read$0.023/M
chatFunctions2025-12-01
Anthropic: Claude Opus 4.5
anthropic/claude-opus-4.5Context200K
Max Out64K
Input$5/M
Output$25/M
Cache Read$0.5/M
Cache 5m$6.25/M
Cache 1h$10/M
chatVisionFunctionsReasoningCachingPDF2025-11-24
Google: Gemini 3 Pro Preview
google/gemini-3-pro-previewContext1M
Max Out66K
Input$2/M
Output$12/M
Audio$2/M
Web Search$0.014/R
Cache Read$0.2/M
Cache Write$4.5/M
Cached Audio$0.2/M
chatVisionFunctionsReasoningCachingWebAudioVideoPDF2025-11-18
GPT-5.1 Codex Mini
openai/gpt-5.1-codex-miniContext256K
Max Out66K
Input$0.25/M
Output$2/M
Web Search$0.01/R
Cache Read$0.03/M
chatVisionFunctionsReasoningCachingWebAudioVideoPDF2025-11-13
OpenAI: GPT-5.1
openai/gpt-5.1Context256K
Max Out128K
Input$1.25/M
Output$10/M
Web Search$0.01/R
Cache Read$0.13/M
chatVisionFunctionsReasoningCachingWebAudioVideoPDF2025-11-13
MiniMax: MiniMax M2
minimax/minimax-m2Context205K
Max Out131K
Input$0.3/M
Output$1.2/M
Cache Read$0.03/M
Cache Write$0.375/M
chatFunctionsReasoningCachingWeb2025-10-23
Anthropic: Claude Haiku 4.5
anthropic/claude-haiku-4.5Context200K
Max Out64K
Input$1/M
Output$5/M
Cache Read$0.1/M
Cache 5m$1.25/M
Cache 1h$2/M
chatVisionFunctionsCachingPDF2025-10-15
Doubao Seed 1.6
volcengine/doubao-seed-1-6Context256K
Max Out64K
Input$0.12/M
Output$0.29/M
Cache Read$0.023/M
chatFunctionsReasoningCaching2025-10-15
Z.ai: GLM-4.6
z-ai/glm-4.6Context200K
Max Out128K
Input$0.4/M
Output$1.9/M
Web Search$0.005/R
Cache Read$0.11/M
chatFunctionsReasoningCachingWeb2025-09-30
Anthropic: Claude Sonnet 4.5
anthropic/claude-sonnet-4.5Context1M
Max Out64K
Input$3/M
Output$15/M
Cache Read$0.3/M
Cache 5m$3.75/M
Cache 1h$6/M
chatVisionFunctionsReasoningCachingPDF2025-09-29
Qwen3 235B A22B (free)
bailian/qwen3-235b-a22b:freeContext128K
Max Out16K
Input$0/M
Output$0/M
chatFunctionsReasoning2025-09-23
Qwen3 Coder Plus
bailian/qwen3-coder-plusContext1M
Max Out64K
Input$1.8/M
Output$9/M
Cache Read$0.2/M
Cache Write$1/M
chatFunctionsReasoningCaching2025-09-23
Doubao Seed 1.6 Flash
volcengine/doubao-seed-1-6-flashContext256K
Max Out32K
Input$0.03/M
Output$0.22/M
Cache Read$0.0043/M
chatFunctionsCaching2025-08-28
Doubao Seed 1.6 Vision
volcengine/doubao-seed-1-6-visionContext256K
Max Out32K
Input$0.12/M
Output$1.15/M
Cache Read$0.023/M
chatVisionFunctionsCaching2025-08-15
Qwen VL Max
bailian/qwen-vl-maxContext128K
Max Out8K
Input$0.23/M
Output$0.58/M
Cache Read$0.046/M
chatVisionFunctions2025-08-13
GPT-5
openai/gpt-5Context256K
Max Out66K
Input$1.25/M
Output$10/M
Web Search$0.01/R
Cache Read$0.13/M
chatVisionFunctionsReasoningCachingWebAudioVideoPDF2025-08-07
GPT-5 Mini
openai/gpt-5-miniContext256K
Max Out33K
Input$0.25/M
Output$2/M
Web Search$0.01/R
Cache Read$0.03/M
chatVisionFunctionsCachingAudioPDF2025-08-07
GPT-5 Nano
openai/gpt-5-nanoContext128K
Max Out16K
Input$0.05/M
Output$0.4/M
Web Search$0.01/R
Cache Read$0.01/M
chatFunctionsCaching2025-08-07
Qwen3 Coder Flash
bailian/qwen3-coder-flashContext1M
Max Out64K
Input$0.5/M
Output$2.5/M
Cache Read$0.06/M
Cache Write$0.27/M
chatFunctionsReasoningCaching2025-08-05
Qwen Flash
bailian/qwen-flashContext1M
Max Out32K
Input$0.022/M
Output$0.22/M
Web Search$0.01/R
Cache Read$0.0043/M
Cache Write$0.027/M
chatFunctionsCachingWeb2025-07-28
Google: Gemini 2.5 Flash Lite
google/gemini-2.5-flash-liteContext1M
Max Out66K
Input$0.1/M
Output$0.4/M
Audio$0.3/M
Web Search$0.035/R
Cache Read$0.025/M
Cache Write$1/M
Cached Audio$0.3/M
chatVisionFunctionsCachingPDF2025-07-22
Qwen Turbo
bailian/qwen-turboContext128K
Max Out16K
Input$0.05/M
Output$0.09/M
Cache Read$0.0086/M
chatFunctions2025-07-15
Google: Gemini 2.5 Flash
google/gemini-2.5-flashContext1M
Max Out66K
Input$0.3/M
Output$2.5/M
Audio$1/M
Web Search$0.035/R
Cache Read$0.03/M
Cache Write$1/M
Cached Audio$0.1/M
chatVisionFunctionsReasoningCachingWebAudioVideoPDF2025-06-17
Google: Gemini 2.5 Pro
google/gemini-2.5-proContext1M
Max Out66K
Input$1.25/M
Output$10/M
Audio$1.25/M
Web Search$0.035/R
Cache Read$0.125/M
Cache Write$4.5/M
Cached Audio$0.125/M
chatVisionFunctionsReasoningCachingWebAudioVideoPDF2025-06-17
GPT-4.1
openai/gpt-4.1Context1M
Max Out33K
Input$2/M
Output$8/M
Web Search$0.01/R
Cache Read$0.5/M
chatVisionFunctionsCachingAudioPDF2025-04-14
GPT-4.1 Mini
openai/gpt-4.1-miniContext1M
Max Out33K
Input$0.4/M
Output$1.6/M
Web Search$0.01/R
Cache Read$0.1/M
chatVisionFunctionsCaching2025-04-14
Qwen Max
bailian/qwen-maxContext32K
Max Out8K
Input$0.35/M
Output$1.38/M
Cache Read$0.069/M
chatFunctionsCaching2025-01-25
GPT-4o Mini
openai/gpt-4o-miniContext128K
Max Out16K
Input$0.15/M
Output$0.6/M
Web Search$0.01/R
Cache Read$0.075/M
chatVisionFunctionsCaching2024-07-18
GPT-4o
openai/gpt-4oContext128K
Max Out16K
Input$2.5/M
Output$10/M
Web Search$0.01/R
Cache Read$1.25/M
chatVisionFunctionsCachingAudio2024-05-13