Voxtral Small 24B 2507 vs GPT-5.4 Mini
Lower price
Voxtral Small 24B 2507
Faster
Voxtral Small 24B 2507
Higher efficiency
Voxtral Small 24B 2507
Input price
$0.10
$0.75
Output price
$0.30
$4.50
Context window
32K
400K
Throughput
154 tok/s
130 tok/s
Availability
99.6%
100.0%
Cost / task
$0.000
$0.004
Efficiency score
89
89
Estimated monthly cost by workload
Metric
VOXTRAL-SMALL-
GPT-5.4-MINI
Chat assistant
$66.00
$765.00
RAG / long context
$147.00
$1,305
Agent / tool use
$180.00
$2,160
Efficiency score: Voxtral Small 24B 2507
Across price, speed and reliability, Voxtral Small 24B 2507 offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.
Figures are illustrative demo data, not financial advice.
Frequently asked questions
Is Voxtral Small 24B 2507 or GPT-5.4 Mini cheaper?+
Voxtral Small 24B 2507 has the lower input price — $0.10 vs $0.75 per 1M tokens — so for most blended workloads it is the more cost-effective of the two. Figures are illustrative demo data.
Which should I choose, Voxtral Small 24B 2507 or GPT-5.4 Mini?+
Across price, speed and reliability, Voxtral Small 24B 2507 offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.