LIVE
DEEPSEEK-V4-FL$0.20 26.3%
DEEPSEEK-V4-PR$0.87 4.6%
QWEN3.6-FLASH$1.13 24.9%
NEMOTRON-3-SUP$0.45 23.9%
LLAMA-4-MAVERI$0.60 11.9%
LLAMA-4-SCOUT$0.30 5.8%
GEMINI-3.1-FLA$1.50 10.0%
GEMINI-2.5-FLA$0.40 0.8%
MINIMAX-01$1.10 8.0%
MIMO-V2.5$0.28 5.8%
MIMO-V2.5-PRO$0.87 2.6%
MINIMAX-M3$1.20 6.1%
QWEN3.5-PLUS-2$1.80 6.0%
NOVA-2-LITE-V1$2.50 31.4%
GEMINI-2.5-FLA$2.50 9.5%
GROK-4.3$2.50 8.7%
QWEN3.6-PLUS$1.95 31.6%
NEMOTRON-3-ULT$2.50 25.9%
QWEN3.7-PLUS$1.60 21.4%
MINIMAX-M1$2.20 11.9%
PALMYRA-X5$6.00 26.9%
QWEN3.7-MAX$3.75 3.1%
GEMINI-3.5-FLA$9.00 8.3%
GEMINI-2.5-PRO$10.00 7.2%
GPT-5.4-NANO$1.25 10.0%
NOVA-LITE-V1$0.24 28.9%
KIMI-K2.5$1.90 9.8%
MINISTRAL-14B-$0.20 17.7%
DEEPSEEK-V4-FL$0.20 26.3%
DEEPSEEK-V4-PR$0.87 4.6%
QWEN3.6-FLASH$1.13 24.9%
NEMOTRON-3-SUP$0.45 23.9%
LLAMA-4-MAVERI$0.60 11.9%
LLAMA-4-SCOUT$0.30 5.8%
GEMINI-3.1-FLA$1.50 10.0%
GEMINI-2.5-FLA$0.40 0.8%
MINIMAX-01$1.10 8.0%
MIMO-V2.5$0.28 5.8%
MIMO-V2.5-PRO$0.87 2.6%
MINIMAX-M3$1.20 6.1%
QWEN3.5-PLUS-2$1.80 6.0%
NOVA-2-LITE-V1$2.50 31.4%
GEMINI-2.5-FLA$2.50 9.5%
GROK-4.3$2.50 8.7%
QWEN3.6-PLUS$1.95 31.6%
NEMOTRON-3-ULT$2.50 25.9%
QWEN3.7-PLUS$1.60 21.4%
MINIMAX-M1$2.20 11.9%
PALMYRA-X5$6.00 26.9%
QWEN3.7-MAX$3.75 3.1%
GEMINI-3.5-FLA$9.00 8.3%
GEMINI-2.5-PRO$10.00 7.2%
GPT-5.4-NANO$1.25 10.0%
NOVA-LITE-V1$0.24 28.9%
KIMI-K2.5$1.90 9.8%
MINISTRAL-14B-$0.20 17.7%
The Global AI Market Intelligence Platform

See what's hot.
Know what's changing.
Act before the market moves.

Track AI model prices, token costs, inference trends, market heat and performance signals across the global AI economy.

hoton://terminal
LIVE
HOTN-GTI
Global AI Token Price Index
4.8$/1M
+0.0%
30D Low
4.75
30D High
9.00
Models
120
Top Mover
MINIMAX-M1+11.9%
Models Tracked
120
Providers Covered
32
Global Token Price Index
4.75+0.0%
Model Efficiency Index
89.82.0%
Live marketModels120Providers32Avg price$4.75/1MCheapest$0.01/1MMultimodal61Token Price Index4.75+0.0%
00Why HotON

AI is becoming a global market. HotON.ai helps you read it.

Models are changing. Prices are moving. Compute costs are shifting. HotON.ai turns fragmented AI signals into structured market intelligence.

Model Price Tracking

Monitor API prices, token costs, context limits and pricing changes across leading AI models.

AI Market Heat

Discover which models, providers, regions and categories are gaining momentum in real time.

Inference Cost Intelligence

Compare the real cost of running AI tasks across models, regions and infrastructure conditions.

Model Efficiency Index

Measure models not only by price, but by speed, stability, output quality and task success cost.

01The AI Market Dashboard

A real-time view of the global AI market

From model prices and usage trends to availability, latency and market heat — the whole AI economy on one screen.

Trending Models

Live
ModelOut / 1MHeat
MINIMAX-M1
MiniMax
$2.20
+11.9%
$25.00
+11.8%
$1.15
+10.7%
$1.25
+10.0%
KIMI-K2.5
Moonshot
$1.90
+9.8%

Price Movers

24H
QWEN3.6-PLUS31.6%
NOVA-2-LITE-V131.4%
MISTRAL-LARGE-30.5%
MISTRAL-SMALL-30.3%
HERMES-4-70B29.1%

Latency Watch

tok/s
GEMINI-3.1-FLA
172
CLAUDE-SONNET-
172
ERNIE-4.5-VL-4
172
INFLECTION-3-P
171
DEVSTRAL-2512
170

Availability Signals

Live
DEEPSEEK-V4-FL
100.0%
DEEPSEEK-V4-PR
100.0%
QWEN3.6-FLASH
100.0%
LLAMA-4-MAVERI
100.0%
LLAMA-4-SCOUT
100.0%

New Model Launches

Recent
NewGemini 3 Pro
2d
UpdatedClaude Opus 4.8
4d
NewDeepSeek V4
6d
PreviewQwen3 Max
1w
UpdatedKimi k2
1w

Category Heat

Momentum by category
Reasoning
+6.4%
Agentic
+9.1%
Multimodal
+4.2%
Code
+2.7%
Open Source
+5.5%
Audio / Video
+3.3%
02Model Prices

Track the price of intelligence

AI pricing is no longer simple. HotON.ai helps builders, enterprises and investors understand how model costs move across providers and categories.

Input Token PriceOutput Token PriceContext Window CostMultimodal PricingEnterprise API Pricing

Model Price Board

$ / 1M tokens
ModelRegionInputOutputCtxΔ
DEEPSEEK-V4-FL
DeepSeek
CN$0.10$0.201049K26.3%
DEEPSEEK-V4-PR
DeepSeek
CN$0.44$0.871049K+4.6%
QWEN3.6-FLASH
Alibaba
CN$0.19$1.131000K24.9%
NEMOTRON-3-SUP
NVIDIA
US$0.09$0.451000K23.9%
LLAMA-4-MAVERI
Meta
US$0.15$0.601049K11.9%
LLAMA-4-SCOUT
Meta
US$0.08$0.3010000K+5.8%
GEMINI-3.1-FLA
Google
US$0.25$1.501049K10.0%
GEMINI-2.5-FLA
Google
US$0.10$0.401049K0.8%
MINIMAX-01
MiniMax
CN$0.20$1.101000K+8.0%
MIMO-V2.5
Xiaomi
CN$0.14$0.281049K5.8%
MIMO-V2.5-PRO
Xiaomi
CN$0.44$0.871049K+2.6%
MINIMAX-M3
MiniMax
CN$0.30$1.201049K6.1%
QWEN3.5-PLUS-2
Alibaba
CN$0.30$1.801000K6.0%
NOVA-2-LITE-V1
Amazon
US$0.30$2.501000K31.4%
GEMINI-2.5-FLA
Google
US$0.30$2.501049K9.5%
GROK-4.3
xAI
US$1.25$2.501000K8.7%
QWEN3.6-PLUS
Alibaba
CN$0.33$1.951000K31.6%
NEMOTRON-3-ULT
NVIDIA
US$0.50$2.501000K25.9%
QWEN3.7-PLUS
Alibaba
CN$0.40$1.601000K21.4%
MINIMAX-M1
MiniMax
CN$0.40$2.201000K+11.9%
PALMYRA-X5
Writer
US$0.60$6.001040K26.9%
QWEN3.7-MAX
Alibaba
CN$1.25$3.751000K3.1%
GEMINI-3.5-FLA
Google
US$1.50$9.001049K8.3%
GEMINI-2.5-PRO
Google
US$1.25$10.001049K+7.2%
GPT-5.4-NANO
OpenAI
US$0.20$1.25400K+10.0%
NOVA-LITE-V1
Amazon
US$0.06$0.24300K28.9%
KIMI-K2.5
Moonshot
CN$0.40$1.90262K+9.8%
MINISTRAL-14B-
Mistral
EU$0.20$0.20262K17.7%
MINIMAX-M2
MiniMax
CN$0.26$1.00205K+2.9%
RING-2.6-1T
InclusionAI
CN$0.08$0.63262K28.2%
MISTRAL-SMALL-
Mistral
EU$0.15$0.60262K30.3%
MIMO-V2-FLASH
Xiaomi
CN$0.10$0.30262K18.3%
MINIMAX-M2.1
MiniMax
CN$0.29$0.95205K+2.6%
SEED-2.0-LITE
ByteDance
CN$0.25$2.00262K3.3%
QWEN3.5-9B
Alibaba
CN$0.04$0.15262K+6.0%
LING-2.6-FLASH
InclusionAI
CN$0.01$0.03262K15.6%
SEED-2.0-MINI
ByteDance
CN$0.10$0.40262K+5.2%
CODESTRAL-2508
Mistral
EU$0.30$0.90256K24.2%
SEED-1.6
ByteDance
CN$0.25$2.00262K27.8%
QWEN3.6-35B-A3
Alibaba
CN$0.14$1.00262K11.6%
STEP-3.5-FLASH
StepFun
CN$0.09$0.30262K21.8%
NEMOTRON-3-NAN
NVIDIA
US$0.05$0.20262K+0.6%
SEED-1.6-FLASH
ByteDance
CN$0.08$0.30262K2.0%
GLM-4.7-FLASH
Zhipu
CN$0.06$0.40203K20.7%
NOVA-PREMIER-V
Amazon
US$2.50$12.501000K28.8%
LING-2.6-1T
InclusionAI
CN$0.08$0.63262K16.0%
STEP-3.7-FLASH
StepFun
CN$0.20$1.15256K7.4%
MISTRAL-LARGE-
Mistral
EU$0.50$1.50262K30.5%
GEMMA-4-26B-A4
Google
US$0.06$0.33262K+8.8%
GEMMA-4-31B-IT
Google
US$0.12$0.36262K7.9%
LLAMA-3.3-70B-
Meta
US$0.10$0.32131K8.9%
GRANITE-4.1-8B
IBM
US$0.05$0.10131K+7.4%
QWEN3.6-27B
Alibaba
CN$0.29$3.20262K10.4%
KIMI-K2.6
Moonshot
CN$0.68$3.42262K7.3%
MINIMAX-M2.7
MiniMax
CN$0.28$1.20205K7.2%
GPT-5.4-MINI
OpenAI
US$0.75$4.50400K+9.6%
LFM-2-24B-A2B
Liquid
US$0.03$0.12128K+6.3%
CLAUDE-SONNET-
Anthropic
US$3.00$15.001000K12.9%
MINIMAX-M2.5
MiniMax
CN$0.15$1.15205K+10.7%
SOLAR-PRO-3
Upstage
US$0.15$0.60128K9.4%
GLM-4.6V
Zhipu
CN$0.30$0.90131K4.2%
TRINITY-MINI
Arcee
US$0.05$0.15131K8.2%
DEEPSEEK-V3.2
DeepSeek
CN$0.23$0.34131K3.9%
OLMO-3-32B-THI
AllenAI
US$0.15$0.5066K3.1%
PHI-4-MINI-INS
Microsoft
US$0.08$0.35131K+8.7%
LLAMA-3.3-NEMO
NVIDIA
US$0.10$0.40131K+2.4%
DEVSTRAL-2512
Mistral
EU$0.40$2.00262K13.4%
KIMI-K2-0905
Moonshot
CN$0.60$2.50262K14.0%
HERMES-4-70B
Nous
US$0.13$0.40131K29.1%
DEEPSEEK-CHAT-
DeepSeek
CN$0.21$0.79164K23.1%
ERNIE-4.5-VL-2
Baidu
CN$0.14$0.56131K12.4%
GLM-4.5-AIR
Zhipu
CN$0.13$0.85131K21.0%
GLM-4-32B
Zhipu
CN$0.10$0.10128K5.7%
UI-TARS-1.5-7B
ByteDance
CN$0.10$0.20128K+1.1%
HUNYUAN-A13B-I
Tencent
CN$0.14$0.57131K+0.9%
ERNIE-4.5-VL-4
Baidu
CN$0.42$1.25131K4.1%
MISTRAL-SMALL-
Mistral
EU$0.08$0.20128K+4.2%
NEMOTRON-NANO-
NVIDIA
US$0.04$0.16131K9.0%
GEMMA-3N-E4B-I
Google
US$0.06$0.1233K22.3%
SPOTLIGHT
Arcee
US$0.18$0.18131K21.8%
VIRTUOSO-LARGE
Arcee
US$0.75$1.20131K10.1%
REKA-FLASH-3
Reka
US$0.10$0.2066K10.7%
DEEPSEEK-R1-DI
DeepSeek
CN$0.29$0.29128K4.8%
SONAR
Perplexity
US$1.00$1.00127K27.3%
COMMAND-R7B-12
Cohere
US$0.04$0.15128K+0.4%
DEEPSEEK-R1-05
DeepSeek
CN$0.50$2.15164K+7.9%
NOVA-MICRO-V1
Amazon
US$0.04$0.14128K13.5%
NOVA-PRO-V1
Amazon
US$0.80$3.20300K7.0%
LLAMA-3.2-3B-I
Meta
US$0.05$0.34131K13.2%
DEEPSEEK-V3.1-
DeepSeek
CN$0.27$0.95164K17.2%
DEEPSEEK-R1-DI
DeepSeek
CN$0.70$0.80131K+7.0%
VOXTRAL-SMALL-
Mistral
EU$0.10$0.3032K18.3%
GRANITE-4.0-H-
IBM
US$0.02$0.11131K9.1%
GLM-5.1
Zhipu
CN$0.98$3.08203K+6.7%
GLM-4.5V
Zhipu
CN$0.60$1.8066K1.7%
CLAUDE-3.5-HAI
Anthropic
US$0.80$4.00200K2.5%
CODER-LARGE
Arcee
US$0.50$0.8033K6.6%
GLM-5-TURBO
Zhipu
CN$1.20$4.00203K21.3%
CLAUDE-HAIKU-4
Anthropic
US$1.00$5.00200K13.4%
MAESTRO-REASON
Arcee
US$0.90$3.30131K14.9%
GLM-5V-TURBO
Zhipu
CN$1.20$4.00203K13.6%
MINIMAX-M2-HER
MiniMax
CN$0.30$1.2066K12.9%
PHI-4
Microsoft
US$0.07$0.1416K+7.9%
REKA-EDGE
Reka
US$0.10$0.1016K24.2%
JAMBA-LARGE-1.
AI21
US$2.00$8.00256K4.2%
MISTRAL-MEDIUM
Mistral
EU$1.50$7.50262K2.6%
SONAR-REASONIN
Perplexity
US$2.00$8.00128K27.5%
SONAR-DEEP-RES
Perplexity
US$2.00$8.00128K26.9%
COMMAND-A
Cohere
US$2.50$10.00256K+5.5%
GPT-5.3-CODEX
OpenAI
US$1.75$14.00400K19.8%
GPT-5.2-CODEX
OpenAI
US$1.75$14.00400K+0.7%
INFLECTION-3-P
Inflection
US$2.50$10.008K16.1%
CLAUDE-OPUS-4.
Anthropic
US$5.00$25.001000K+11.8%
INFLECTION-3-P
Inflection
US$2.50$10.008K22.9%
SONAR-PRO
Perplexity
US$3.00$15.00200K7.3%
GPT-5.3-CHAT
OpenAI
US$1.75$14.00128K+9.0%
SONAR-PRO-SEAR
Perplexity
US$3.00$15.00200K13.3%
GPT-5.5-PRO
OpenAI
US$30.00$180.001050K13.1%
GPT-5.4-PRO
OpenAI
US$30.00$180.001050K8.8%
GPT-5.5
OpenAI
US$5.00$30.001050K24.3%

Token Cost Calculator

DEMO
Cost / request
$0.000
Est. cost / month
$192.00
Cheapest for this workload: LING-2.6-FLASH $22.80/mo (save $169.20)
04Efficiency Ranking

Price is only one part of the story. Efficiency tells the truth.

The cheapest model is not always the most efficient. HotON.ai compares models by total task cost, success rate, speed, stability and output quality.

#ModelCost / TaskSpeedStabilityQualityEfficiency
01
DEEPSEEK-V4-FL
DeepSeek
$0.000120 t/s100.0%A+
96
02
DEEPSEEK-V4-PR
DeepSeek
$0.001121 t/s100.0%A+
96
03
QWEN3.6-FLASH
Alibaba
$0.001145 t/s100.0%A+
96
04
NEMOTRON-3-SUP
NVIDIA
$0.000152 t/s96.5%A+
96
05
LLAMA-4-MAVERI
Meta
$0.001158 t/s100.0%A+
96
06
LLAMA-4-SCOUT
Meta
$0.000140 t/s100.0%A+
96
07
GEMINI-3.1-FLA
Google
$0.001172 t/s99.5%A+
96

Find the model that actually delivers the best result for the cost.

05Inference Cost Map

Where is AI cheaper to run?

AI costs are shaped by more than model prices. Region, compute supply, energy cost, latency and availability all matter.

Regional Inference Cost

Index · lower = cheaper
China · NorthCheap
417.2%
India · WestCheap
465.1%
NordicsCheap
523.8%
US · WestStable
680.6%
SingaporeRising
71+2.4%
US · EastStable
74+0.9%
EU · WestRising
79+3.6%
Middle EastTight
83+5.9%
View all

Global Compute Network

Live
US-W 68
US-E 74
EU-N 52
EU-W 79
ME 83
IN 46
CN-N 41
Cheap Rising Tight

Regional Cost Signals

Compare inference cost patterns across global regions.

Compute Availability

Understand where AI infrastructure capacity is becoming more attractive.

Energy-Aware AI

Track how energy conditions may influence compute and inference pricing.

Time-Based Cost Windows

Discover when certain regions may become more cost-efficient for AI workloads.

HotON.ai helps the market understand the geography of AI cost.

07Market Media

Video, visuals and briefings — every format, one feed

HotON.ai delivers market intelligence as video, visuals and text. Choose how you consume the AI economy — the platform handles every format.

hoton://media
01:32
Video

Weekly AI Market Pulse

A 90-second video recap of the week's biggest moves in AI prices, models and infrastructure.

Watch recap
Global token price heatmapVisual
Infographic

Global token price heatmap

Where input and output costs are rising and falling, at a glance.

Text Brief

Inference costs fall as new capacity comes online

Regional compute supply loosened this week, pushing the Inference Cost Index to a new monthly low across three major regions…

Video reports

Showreels, recaps and explainers with adaptive playback.

Visual & images

Charts, infographics and covers, responsive and crisp.

Text & briefings

Structured articles, summaries and data notes.

08HotON Reports

Reports built for AI decision makers

Structured intelligence on model pricing, AI infrastructure, inference cost, market heat and global AI supply-chain trends.

BriefWeekly

Weekly AI Market Brief

A concise summary of the most important AI market changes.

Read report
PricingMonthly

Monthly AI Pricing Report

A deeper look at model pricing, token cost and efficiency trends.

Read report
CostMonthly

Global Inference Cost Report

How AI task costs are changing across models and regions.

Read report
EfficiencyMonthly

Model Efficiency Report

Leading models compared by real task performance and total cost.

Read report
InfraQuarterly

AI Infrastructure Intelligence

Compute, energy, cloud and data-center signals behind the economy.

Read report
Briefings

Subscribe to HotON Reports

AI market briefings, pricing reports and index updates — in your inbox.

No spam. Unsubscribe anytime.

09HotON Data API

AI market data for builders and institutions

Access structured AI market data through HotON.ai APIs, feeds and custom intelligence products.

Model pricing feedsToken cost dataModel metadataIndex dataMarket heat rankingsLatency & availabilityReport feedsCustom intelligence
  • Developers building AI tools
  • Enterprises optimizing AI costs
  • Model providers tracking position
  • Investors following AI infrastructure
  • Analysts researching the AI economy
api.hoton.ai
> GET /v1/models/OPUS-4.8/price

{
  "symbol": "OPUS-4.8",
  "provider": "Anthropic",
  "input_per_1m": 6.00,
  "output_per_1m": 22.50,
  "context_k": 500,
  "efficiency": 96,
  "availability": "99.9%",
  "change_24h": "+6.2%"
}
HotON.ai

Understand the AI market before everyone else.

HotON.ai gives you the data, indexes and intelligence to understand where the AI economy is moving next.