LIVE
BODYBUILDER$-1000000.00 5.4%
MINIMAX-M2.7$1.20 7.2%
QWEN-PLUS-2025$0.78 28.3%
DEEPSEEK-AI$0.00 27.0%
GPT-4.1-NANO$0.40 13.0%
GLM-4-LONG$0.00 14.1%
GROK-4.1-FAST$0.50 16.4%
GROK-4-FAST-RE$0.50 17.1%
GOOGLE$0.00 13.0%
LLAMA-4-MAVERI$0.60 11.9%
GROK-4-FAST-NO$0.50 14.3%
GROK-4-FAST$0.50 6.0%
GROK-4-1-FAST-$0.50 31.2%
MINIMAX-M2.5$0.99 10.7%
GOOGLE$1.50 3.4%
GOOGLE$0.00 27.8%
XAI$0.00 4.5%
-CF$0.00 9.7%
GEMINI-3.1-FLA$1.50 9.4%
MINIMAX-M2.7$0.00 9.9%
GPT-4.1-MINI-2$1.60 31.8%
DEEPSEEK$0.28 2.2%
LYRIA-3-PRO-PR$0.00 29.4%
LYRIA-3-CLIP-P$0.00 22.9%
MINIMAX-M2.1$0.95 2.6%
MINIMAX-01$1.10 8.0%
MINIMAX-M3$1.20 6.1%
GEMINI-2.5-FLA$0.40 28.8%
BODYBUILDER$-1000000.00 5.4%
MINIMAX-M2.7$1.20 7.2%
QWEN-PLUS-2025$0.78 28.3%
DEEPSEEK-AI$0.00 27.0%
GPT-4.1-NANO$0.40 13.0%
GLM-4-LONG$0.00 14.1%
GROK-4.1-FAST$0.50 16.4%
GROK-4-FAST-RE$0.50 17.1%
GOOGLE$0.00 13.0%
LLAMA-4-MAVERI$0.60 11.9%
GROK-4-FAST-NO$0.50 14.3%
GROK-4-FAST$0.50 6.0%
GROK-4-1-FAST-$0.50 31.2%
MINIMAX-M2.5$0.99 10.7%
GOOGLE$1.50 3.4%
GOOGLE$0.00 27.8%
XAI$0.00 4.5%
-CF$0.00 9.7%
GEMINI-3.1-FLA$1.50 9.4%
MINIMAX-M2.7$0.00 9.9%
GPT-4.1-MINI-2$1.60 31.8%
DEEPSEEK$0.28 2.2%
LYRIA-3-PRO-PR$0.00 29.4%
LYRIA-3-CLIP-P$0.00 22.9%
MINIMAX-M2.1$0.95 2.6%
MINIMAX-01$1.10 8.0%
MINIMAX-M3$1.20 6.1%
GEMINI-2.5-FLA$0.40 28.8%

accounts/fireworks/models/kimi-k2p5 vs qwen3-vl-32b-instruct

Input price
$0.60
$0.10
Output price
$3.00
$0.42
Context window
262K
131K
Throughput
132 tok/s
147 tok/s
Availability
96.4%
100.0%
Cost / task
$0.003
$0.000
Efficiency score
89
89

Estimated monthly cost by workload

Metric
ACCOUNTS
QWEN3-VL-32B-I
Chat assistant
$540.00
$80.40
RAG / long context
$990.00
$157.80
Agent / tool use
$1,512
$223.20

Efficiency score: qwen3-vl-32b-instruct

Across price, speed and reliability, qwen3-vl-32b-instruct offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.

Figures are illustrative demo data, not financial advice.

Frequently asked questions

Is accounts/fireworks/models/kimi-k2p5 or qwen3-vl-32b-instruct cheaper?+

qwen3-vl-32b-instruct has the lower input price — $0.10 vs $0.60 per 1M tokens — so for most blended workloads it is the more cost-effective of the two. Figures are illustrative demo data.

Which should I choose, accounts/fireworks/models/kimi-k2p5 or qwen3-vl-32b-instruct?+

Across price, speed and reliability, qwen3-vl-32b-instruct offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.