LIVE
BODYBUILDER$-1000000.00 5.4%
MINIMAX-M2.7$1.20 7.2%
QWEN-PLUS-2025$0.78 28.3%
DEEPSEEK-AI$0.00 27.0%
GPT-4.1-NANO$0.40 13.0%
GLM-4-LONG$0.00 14.1%
GROK-4.1-FAST$0.50 16.4%
GROK-4-FAST-RE$0.50 17.1%
GOOGLE$0.00 13.0%
LLAMA-4-MAVERI$0.60 11.9%
GROK-4-FAST-NO$0.50 14.3%
GROK-4-FAST$0.50 6.0%
GROK-4-1-FAST-$0.50 31.2%
MINIMAX-M2.5$0.99 10.7%
GOOGLE$1.50 3.4%
GOOGLE$0.00 27.8%
XAI$0.00 4.5%
-CF$0.00 9.7%
GEMINI-3.1-FLA$1.50 9.4%
MINIMAX-M2.7$0.00 9.9%
GPT-4.1-MINI-2$1.60 31.8%
DEEPSEEK$0.28 2.2%
LYRIA-3-PRO-PR$0.00 29.4%
LYRIA-3-CLIP-P$0.00 22.9%
MINIMAX-M2.1$0.95 2.6%
MINIMAX-01$1.10 8.0%
MINIMAX-M3$1.20 6.1%
GEMINI-2.5-FLA$0.40 28.8%
BODYBUILDER$-1000000.00 5.4%
MINIMAX-M2.7$1.20 7.2%
QWEN-PLUS-2025$0.78 28.3%
DEEPSEEK-AI$0.00 27.0%
GPT-4.1-NANO$0.40 13.0%
GLM-4-LONG$0.00 14.1%
GROK-4.1-FAST$0.50 16.4%
GROK-4-FAST-RE$0.50 17.1%
GOOGLE$0.00 13.0%
LLAMA-4-MAVERI$0.60 11.9%
GROK-4-FAST-NO$0.50 14.3%
GROK-4-FAST$0.50 6.0%
GROK-4-1-FAST-$0.50 31.2%
MINIMAX-M2.5$0.99 10.7%
GOOGLE$1.50 3.4%
GOOGLE$0.00 27.8%
XAI$0.00 4.5%
-CF$0.00 9.7%
GEMINI-3.1-FLA$1.50 9.4%
MINIMAX-M2.7$0.00 9.9%
GPT-4.1-MINI-2$1.60 31.8%
DEEPSEEK$0.28 2.2%
LYRIA-3-PRO-PR$0.00 29.4%
LYRIA-3-CLIP-P$0.00 22.9%
MINIMAX-M2.1$0.95 2.6%
MINIMAX-01$1.10 8.0%
MINIMAX-M3$1.20 6.1%
GEMINI-2.5-FLA$0.40 28.8%

granite-4.0-h-micro vs gpt-oss-safeguard-20b

Input price
$0.02
$0.08
Output price
$0.11
$0.30
Context window
131K
131K
Throughput
155 tok/s
121 tok/s
Availability
100.0%
97.3%
Cost / task
$0.000
$0.000
Efficiency score
89
89

Estimated monthly cost by workload

Metric
GRANITE-4.0-H-
GPT-OSS-SAFEGU
Chat assistant
$19.20
$60.00
RAG / long context
$33.90
$123.00
Agent / tool use
$54.00
$165.60

Efficiency score: granite-4.0-h-micro

Across price, speed and reliability, granite-4.0-h-micro offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.

Figures are illustrative demo data, not financial advice.

Frequently asked questions

Is granite-4.0-h-micro or gpt-oss-safeguard-20b cheaper?+

granite-4.0-h-micro has the lower input price — $0.02 vs $0.08 per 1M tokens — so for most blended workloads it is the more cost-effective of the two. Figures are illustrative demo data.

Which should I choose, granite-4.0-h-micro or gpt-oss-safeguard-20b?+

Across price, speed and reliability, granite-4.0-h-micro offers the stronger overall balance for most workloads — but the right pick depends on your exact mix of input, output and latency needs.