LIVE

BODYBUILDER$-1000000.00▼ 5.4%

MINIMAX-M2.7$1.20▼ 7.2%

QWEN-PLUS-2025$0.78▼ 28.3%

DEEPSEEK-AI$0.00▼ 27.0%

GPT-4.1-NANO$0.40▼ 13.0%

GLM-4-LONG$0.00▼ 14.1%

GROK-4.1-FAST$0.50▼ 16.4%

GROK-4-FAST-RE$0.50▼ 17.1%

GOOGLE$0.00▼ 13.0%

LLAMA-4-MAVERI$0.60▼ 11.9%

GROK-4-FAST-NO$0.50▼ 14.3%

GROK-4-FAST$0.50▼ 6.0%

GROK-4-1-FAST-$0.50▼ 31.2%

MINIMAX-M2.5$0.99▲ 10.7%

GOOGLE$1.50▼ 3.4%

GOOGLE$0.00▼ 27.8%

XAI$0.00▲ 4.5%

-CF$0.00▼ 9.7%

GEMINI-3.1-FLA$1.50▲ 9.4%

MINIMAX-M2.7$0.00▼ 9.9%

GPT-4.1-MINI-2$1.60▼ 31.8%

DEEPSEEK$0.28▼ 2.2%

LYRIA-3-PRO-PR$0.00▼ 29.4%

LYRIA-3-CLIP-P$0.00▼ 22.9%

MINIMAX-M2.1$0.95▲ 2.6%

MINIMAX-01$1.10▲ 8.0%

MINIMAX-M3$1.20▼ 6.1%

GEMINI-2.5-FLA$0.40▼ 28.8%

BODYBUILDER$-1000000.00▼ 5.4%

MINIMAX-M2.7$1.20▼ 7.2%

QWEN-PLUS-2025$0.78▼ 28.3%

DEEPSEEK-AI$0.00▼ 27.0%

GPT-4.1-NANO$0.40▼ 13.0%

GLM-4-LONG$0.00▼ 14.1%

GROK-4.1-FAST$0.50▼ 16.4%

GROK-4-FAST-RE$0.50▼ 17.1%

GOOGLE$0.00▼ 13.0%

LLAMA-4-MAVERI$0.60▼ 11.9%

GROK-4-FAST-NO$0.50▼ 14.3%

GROK-4-FAST$0.50▼ 6.0%

GROK-4-1-FAST-$0.50▼ 31.2%

MINIMAX-M2.5$0.99▲ 10.7%

GOOGLE$1.50▼ 3.4%

GOOGLE$0.00▼ 27.8%

XAI$0.00▲ 4.5%

-CF$0.00▼ 9.7%

GEMINI-3.1-FLA$1.50▲ 9.4%

MINIMAX-M2.7$0.00▼ 9.9%

GPT-4.1-MINI-2$1.60▼ 31.8%

DEEPSEEK$0.28▼ 2.2%

LYRIA-3-PRO-PR$0.00▼ 29.4%

LYRIA-3-CLIP-P$0.00▼ 22.9%

MINIMAX-M2.1$0.95▲ 2.6%

MINIMAX-01$1.10▲ 8.0%

MINIMAX-M3$1.20▼ 6.1%

GEMINI-2.5-FLA$0.40▼ 28.8%

devstral-medium vs qwen/qwen2.5-coder-32b-instruct

qwen/qwen2.5-coder-32b-instruct

qwen/qwen2.5-coder-32b-instruct

Higher efficiency

qwen/qwen2.5-coder-32b-instruct

Metric

devstral-medium qwen/qwen2.5-coder-32b-instruct

Input price

$0.40

$0.00

Output price

$2.00

$0.00

Context window

131K

1K

Throughput

147 tok/s

169 tok/s

Availability

98.3%

98.1%

Cost / task

$0.002

$0.000

Efficiency score

88

88

Estimated monthly cost by workload

Metric

DEVSTRAL-MEDIU

QWEN

Chat assistant

$360.00

$0.00

RAG / long context

$660.00

$0.00

Agent / tool use

$1,008

$0.00

Efficiency score: qwen/qwen2.5-coder-32b-instruct

Across price, speed and reliability, qwen/qwen2.5-coder-32b-instruct offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.

Figures are illustrative demo data, not financial advice.

Frequently asked questions

Is devstral-medium or qwen/qwen2.5-coder-32b-instruct cheaper?+

qwen/qwen2.5-coder-32b-instruct has the lower input price — $0.00 vs $0.40 per 1M tokens — so for most blended workloads it is the more cost-effective of the two. Figures are illustrative demo data.

Which should I choose, devstral-medium or qwen/qwen2.5-coder-32b-instruct?+

Across price, speed and reliability, qwen/qwen2.5-coder-32b-instruct offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.