How much does Qwen Flash cost on Ofox?

Qwen Flash on Ofox.ai costs $0.022/M per million input tokens and $0.22/M per million output tokens. Pay-as-you-go, no monthly fees.

What is Qwen Flash's context window?

Qwen Flash supports a context window of 1,000,000 tokens, allowing you to process large documents and maintain long conversations.

How to use Qwen Flash API via Ofox?

Simply set your base URL to https://api.ofox.ai/v1 and use your Ofox API key. The API is OpenAI-compatible — just change the base URL and API key in your existing code.

What capabilities does Qwen Flash support?

Qwen Flash supports the following capabilities: function calling, prompt caching, web search. Access all features through the Ofox.ai unified API.

Qwen Flash

Name: Qwen Flash
Brand: bailian
Price: 0.022 USD
Availability: InStock
Rating: 5 (1 reviews)

Chat

bailian/qwen-flash

비교시작하기

Qwen Flash는 Alibaba Qwen 계열에서 가장 빠르고 가장 저렴한 모델로, Dashscope를 통해 제공되며 지연 시간에 민감한 작업을 위해 설계되었습니다. 초고속 추론을 제공하면서 도구 호출(function calling), 프롬프트 캐싱, 웹 검색을 지원하며, 가격은 입력 $0.022/M tokens, 출력 $0.22/M tokens입니다. 컨텍스트 윈도우: 1M tokens, 출력: 32K. OpenAI 및 Anthropic 프로토콜로 사용할 수 있습니다.

컨텍스트 윈도우

최대 출력 토큰

32K

출시일

2025-07-28

기능

Function Calling프롬프트 캐싱웹 검색

제공업체

Aliyun

지원 프로토콜

openaianthropic

Providers

Aliyun

입력 토큰

$0.022/M

출력 토큰

$0.22/M

캐시 읽기

$0.0043/M

캐시 쓰기

$0.027/M

웹 검색

$0.01/R

Protocols

openai/v1/chat/completions/v1/responses

anthropic

코드 예제

from openai import OpenAI

client = OpenAI(
    base_url="https://api.ofox.ai/v1",
    api_key="YOUR_OFOX_API_KEY",
)

response = client.chat.completions.create(
    model="bailian/qwen-flash",
    messages=[
        {"role": "user", "content": "Hello!"}
    ],
)

print(response.choices[0].message.content)