LIVE
BODYBUILDER$-1000000.00 5.4%
GLM-4-LONG$0.00 14.1%
GEMINI-2.0-FLA$0.30 20.0%
GEMINI-2.0-FLA$0.40 6.6%
-CF$0.00 9.7%
DEEPSEEK-V4-FL$0.28 26.3%
LLAMA-4-MAVERI$0.60 11.9%
GROK-4-FAST$0.50 6.0%
QWEN-PLUS$0.78 0.6%
MINIMAX-M2.1$0.95 2.6%
MINIMAX-M2$1.00 2.9%
MINIMAX-01$1.10 8.0%
GPT-4.1-MINI$1.60 13.7%
GROK-4-1-FAST-$0.50 8.9%
GPT-4.1-MINI-2$1.60 31.8%
XAI$0.00 4.5%
MINIMAX-M2.7$0.00 9.9%
QWEN3.5-FLASH-$0.26 31.0%
GPT-4.1-NANO$0.40 13.0%
GEMINI-2.5-FLA$0.40 0.8%
DEEPSEEK-AI$0.00 27.0%
GROK-4.1-FAST$0.50 16.4%
MINIMAX-M3$1.20 6.1%
GOOGLE$1.50 3.4%
AUTO$0.00 9.7%
GOOGLE$0.00 27.8%
MINIMAX-M2.7$1.20 7.2%
GOOGLE$0.00 0.6%
BODYBUILDER$-1000000.00 5.4%
GLM-4-LONG$0.00 14.1%
GEMINI-2.0-FLA$0.30 20.0%
GEMINI-2.0-FLA$0.40 6.6%
-CF$0.00 9.7%
DEEPSEEK-V4-FL$0.28 26.3%
LLAMA-4-MAVERI$0.60 11.9%
GROK-4-FAST$0.50 6.0%
QWEN-PLUS$0.78 0.6%
MINIMAX-M2.1$0.95 2.6%
MINIMAX-M2$1.00 2.9%
MINIMAX-01$1.10 8.0%
GPT-4.1-MINI$1.60 13.7%
GROK-4-1-FAST-$0.50 8.9%
GPT-4.1-MINI-2$1.60 31.8%
XAI$0.00 4.5%
MINIMAX-M2.7$0.00 9.9%
QWEN3.5-FLASH-$0.26 31.0%
GPT-4.1-NANO$0.40 13.0%
GEMINI-2.5-FLA$0.40 0.8%
DEEPSEEK-AI$0.00 27.0%
GROK-4.1-FAST$0.50 16.4%
MINIMAX-M3$1.20 6.1%
GOOGLE$1.50 3.4%
AUTO$0.00 9.7%
GOOGLE$0.00 27.8%
MINIMAX-M2.7$1.20 7.2%
GOOGLE$0.00 0.6%

qwen3-coder-flash vs qwen3-coder-plus

Input price
$0.30
$0.65
Output price
$1.50
$3.25
Context window
1000K
1000K
Throughput
127 tok/s
152 tok/s
Availability
97.7%
100.0%
Cost / task
$0.001
$0.003
Efficiency score
95
95

Estimated monthly cost by workload

Metric
QWEN3-CODER-FL
QWEN3-CODER-PL
Chat assistant
$270.00
$585.00
RAG / long context
$495.00
$1,073
Agent / tool use
$756.00
$1,638

Efficiency score: qwen3-coder-flash

Across price, speed and reliability, qwen3-coder-flash offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.

Figures are illustrative demo data, not financial advice.

Frequently asked questions

Is qwen3-coder-flash or qwen3-coder-plus cheaper?+

qwen3-coder-flash has the lower input price — $0.30 vs $0.65 per 1M tokens — so for most blended workloads it is the more cost-effective of the two. Figures are illustrative demo data.

Which should I choose, qwen3-coder-flash or qwen3-coder-plus?+

Across price, speed and reliability, qwen3-coder-flash offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.