Granite 4.0 Micro vs LFM2-24B-A2B
Input price
$0.02
$0.03
Output price
$0.11
$0.12
Context window
131K
128K
Throughput
155 tok/s
159 tok/s
Availability
100.0%
100.0%
Cost / task
$0.000
$0.000
Efficiency score
89
89
Estimated monthly cost by workload
Metric
GRANITE-4.0-H-
LFM-2-24B-A2B
Chat assistant
$19.20
$23.40
RAG / long context
$33.90
$46.80
Agent / tool use
$54.00
$64.80
Efficiency score: Granite 4.0 Micro
Across price, speed and reliability, Granite 4.0 Micro offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.
Figures are illustrative demo data, not financial advice.
Frequently asked questions
Is Granite 4.0 Micro or LFM2-24B-A2B cheaper?+
Granite 4.0 Micro has the lower input price — $0.02 vs $0.03 per 1M tokens — so for most blended workloads it is the more cost-effective of the two. Figures are illustrative demo data.
Which should I choose, Granite 4.0 Micro or LFM2-24B-A2B?+
Across price, speed and reliability, Granite 4.0 Micro offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.