Claude Sonnet 4.6 vs Voxtral Small 24B 2507
Input price
$3.00
$0.10
Output price
$15.00
$0.30
Context window
1000K
32K
Throughput
172 tok/s
154 tok/s
Availability
100.0%
99.6%
Cost / task
$0.014
$0.000
Efficiency score
89
89
Estimated monthly cost by workload
Metric
CLAUDE-SONNET-
VOXTRAL-SMALL-
Chat assistant
$2,700
$66.00
RAG / long context
$4,950
$147.00
Agent / tool use
$7,560
$180.00
Efficiency score: Voxtral Small 24B 2507
Across price, speed and reliability, Voxtral Small 24B 2507 offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.
Figures are illustrative demo data, not financial advice.
Frequently asked questions
Is Claude Sonnet 4.6 or Voxtral Small 24B 2507 cheaper?+
Voxtral Small 24B 2507 has the lower input price — $0.10 vs $3.00 per 1M tokens — so for most blended workloads it is the more cost-effective of the two. Figures are illustrative demo data.
Which should I choose, Claude Sonnet 4.6 or Voxtral Small 24B 2507?+
Across price, speed and reliability, Voxtral Small 24B 2507 offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.