USTextNEMOTRON-NANO- Live · updated daily

nemotron-nano-9b-v2:free

NVIDIA

Efficiency score

89/100

−16.5%

Input price

$0.00/ 1M tokens

Output price

$0.00/ 1M tokens

Context window

128K

Throughput

138tok/s

Availability

97.5%

Cost / task

$0.000

Capabilities

Accepts (input)

Text

Produces (output)

Text

7-day heat trend

−16.5%

Pricing breakdown

Input price

$0.00/ 1M tokens

$0.0000 / 1K

Output price

$0.00/ 1M tokens

$0.0000 / 1K

Blended price

$0.00/ 1M tokens

$0.0000 / 1K

Typical 3:1 output-to-input mix, per 1M tokens

Estimated monthly cost by workload

Chat assistant

$0.00/ mo

1K in · 400 out · 10K req/day

RAG / long context

$0.00/ mo

8K in · 600 out · 5K req/day

Agent / tool use

$0.00/ mo

3K in · 1.5K out · 8K req/day

Estimate your cost →

Market position

Cheaper than 43% of tracked models
Faster than 30% of tracked models
Efficiency rank: #399 of 1105

Best suited for

General-purpose text generation, chat, summarization and content workloads where broad capability and low cost matter most.

About nemotron-nano-9b-v2:free

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...

nemotron-nano-9b-v2:free is a Text model from NVIDIA (US). HotON.ai tracks it at $0.00 per 1M input tokens and $0.00 per 1M output tokens, with a 128K-token context window, ~138 tokens/sec throughput and 97.5% availability. Its composite efficiency score is 89/100 at an estimated $0.000 per successful task.

Compare nemotron-nano-9b-v2:free

NEMOTRON-NANO- vs ZAMBA2-7B-INST NEMOTRON-NANO- vs AION-1.0-MINI NEMOTRON-NANO- vs TONGYI-DEEPRES NEMOTRON-NANO- vs MAGNUM-V4-9B

Related market news

Google’s New Colab CLI Lets Developers and AI Agents Run Python on Remote Colab GPUs and TPUs From the Terminal5 hours ago OpenAI unveils Lockdown Mode to protect sensitive data from prompt injection attacks6 hours ago Five labs, five minds: building a multi-model finance drama on small models8 hours ago

Frequently asked questions

How much does nemotron-nano-9b-v2:free cost per 1M tokens?+

nemotron-nano-9b-v2:free is tracked at $0.00 per 1M input tokens and $0.00 per 1M output tokens. A typical 3:1 output-to-input workload blends to roughly $0.00 per 1M tokens. Figures are illustrative demo data.

What is nemotron-nano-9b-v2:free best for?+

General-purpose text generation, chat, summarization and content workloads where broad capability and low cost matter most.

How fast is nemotron-nano-9b-v2:free?+

nemotron-nano-9b-v2:free delivers about 138 tokens/sec with 97.5% tracked availability, suitable for latency-sensitive, real-time applications.

Is nemotron-nano-9b-v2:free cheaper than other AI models?+

Within the HotON.ai tracked set, nemotron-nano-9b-v2:free is cheaper than 43% of models on input price and ranks #399 of 1105 by overall efficiency.

Related models

tongyi-deepresearch-30b-a3b

Pricing is real (via the TestKey catalog, updated daily). Quality (Arena Elo) is real where the model is ranked on LMArena. Speed, availability and efficiency are modeled estimates.