LIVE
DEEPSEEK-V4-FL$0.20 26.3%
DEEPSEEK-V4-PR$0.87 4.6%
QWEN3.6-FLASH$1.13 24.9%
NEMOTRON-3-SUP$0.45 23.9%
LLAMA-4-MAVERI$0.60 11.9%
LLAMA-4-SCOUT$0.30 5.8%
GEMINI-3.1-FLA$1.50 10.0%
GEMINI-2.5-FLA$0.40 0.8%
MINIMAX-01$1.10 8.0%
MIMO-V2.5$0.28 5.8%
MIMO-V2.5-PRO$0.87 2.6%
MINIMAX-M3$1.20 6.1%
QWEN3.5-PLUS-2$1.80 6.0%
NOVA-2-LITE-V1$2.50 31.4%
GEMINI-2.5-FLA$2.50 9.5%
GROK-4.3$2.50 8.7%
QWEN3.6-PLUS$1.95 31.6%
NEMOTRON-3-ULT$2.50 25.9%
QWEN3.7-PLUS$1.60 21.4%
MINIMAX-M1$2.20 11.9%
PALMYRA-X5$6.00 26.9%
QWEN3.7-MAX$3.75 3.1%
GEMINI-3.5-FLA$9.00 8.3%
GEMINI-2.5-PRO$10.00 7.2%
GPT-5.4-NANO$1.25 10.0%
NOVA-LITE-V1$0.24 28.9%
KIMI-K2.5$1.90 9.8%
MINISTRAL-14B-$0.20 17.7%
DEEPSEEK-V4-FL$0.20 26.3%
DEEPSEEK-V4-PR$0.87 4.6%
QWEN3.6-FLASH$1.13 24.9%
NEMOTRON-3-SUP$0.45 23.9%
LLAMA-4-MAVERI$0.60 11.9%
LLAMA-4-SCOUT$0.30 5.8%
GEMINI-3.1-FLA$1.50 10.0%
GEMINI-2.5-FLA$0.40 0.8%
MINIMAX-01$1.10 8.0%
MIMO-V2.5$0.28 5.8%
MIMO-V2.5-PRO$0.87 2.6%
MINIMAX-M3$1.20 6.1%
QWEN3.5-PLUS-2$1.80 6.0%
NOVA-2-LITE-V1$2.50 31.4%
GEMINI-2.5-FLA$2.50 9.5%
GROK-4.3$2.50 8.7%
QWEN3.6-PLUS$1.95 31.6%
NEMOTRON-3-ULT$2.50 25.9%
QWEN3.7-PLUS$1.60 21.4%
MINIMAX-M1$2.20 11.9%
PALMYRA-X5$6.00 26.9%
QWEN3.7-MAX$3.75 3.1%
GEMINI-3.5-FLA$9.00 8.3%
GEMINI-2.5-PRO$10.00 7.2%
GPT-5.4-NANO$1.25 10.0%
NOVA-LITE-V1$0.24 28.9%
KIMI-K2.5$1.90 9.8%
MINISTRAL-14B-$0.20 17.7%
USTextNEMOTRON-3-NAN Live · updated daily

Nemotron 3 Nano 30B A3B

NVIDIA
Efficiency score
90/100
+0.6%
Input price
$0.05/ 1M tokens
Output price
$0.20/ 1M tokens
Context window
262K
Throughput
151tok/s
Availability
96.9%
Cost / task
$0.000

Capabilities

Accepts (input)
Text
Produces (output)
Text

7-day heat trend

+0.6%

Pricing breakdown

Input price
$0.05/ 1M tokens
$0.0001 / 1K
Output price
$0.20/ 1M tokens
$0.0002 / 1K
Blended price
$0.16/ 1M tokens
$0.0002 / 1K

Typical 3:1 output-to-input mix, per 1M tokens

Estimated monthly cost by workload

Chat assistant
$39.00/ mo
1K in · 400 out · 10K req/day
RAG / long context
$78.00/ mo
8K in · 600 out · 5K req/day
Agent / tool use
$108.00/ mo
3K in · 1.5K out · 8K req/day
Estimate your cost

Market position

  • Cheaper than 91% of tracked models
  • Faster than 50% of tracked models
  • Efficiency rank: #42 of 120

Best suited for

General-purpose text generation, chat, summarization and content workloads where broad capability and low cost matter most.

About Nemotron 3 Nano 30B A3B

Nemotron 3 Nano 30B A3B is a Text model from NVIDIA (US). HotON.ai tracks it at $0.05 per 1M input tokens and $0.20 per 1M output tokens, with a 262K-token context window, ~151 tokens/sec throughput and 96.9% availability. Its composite efficiency score is 90/100 at an estimated $0.000 per successful task.

Frequently asked questions

How much does Nemotron 3 Nano 30B A3B cost per 1M tokens?+

Nemotron 3 Nano 30B A3B is tracked at $0.05 per 1M input tokens and $0.20 per 1M output tokens. A typical 3:1 output-to-input workload blends to roughly $0.16 per 1M tokens. Figures are illustrative demo data.

What is Nemotron 3 Nano 30B A3B best for?+

General-purpose text generation, chat, summarization and content workloads where broad capability and low cost matter most.

How fast is Nemotron 3 Nano 30B A3B?+

Nemotron 3 Nano 30B A3B delivers about 151 tokens/sec with 96.9% tracked availability, suitable for latency-sensitive, real-time applications.

Is Nemotron 3 Nano 30B A3B cheaper than other AI models?+

Within the HotON.ai tracked set, Nemotron 3 Nano 30B A3B is cheaper than 91% of models on input price and ranks #42 of 120 by overall efficiency.

Pricing and availability are real (via OpenRouter, updated daily — availability is the best-serving provider's 24h uptime). Efficiency is a HotON composite of real price and context. Speed is a modeled estimate.