Model	Region	Input	Output	Ctx
GROK-4.1-FAST xAI	US	$0.20	$0.50	2000K
GROK-4-FAST xAI	US	$0.20	$0.50	2000K
GEMINI-2.0-FLA Google	US	$0.10	$0.40	1049K
GROK-4-1-FAST- xAI	US	$0.20	$0.50	2000K
DEEPSEEK Novita AI	US	$0.14	$0.28	1049K
LLAMA-4-MAVERI Meta	US	$0.15	$0.60	1049K
GROK-4-FAST-RE xAI	US	$0.20	$0.50	2000K
GEMINI-2.5-FLA Google	US	$0.10	$0.40	1049K
MINIMAX-M3 MiniMax	CN	$0.30	$1.20	1000K
GEMINI-2.5-FLA Google	US	$0.10	$0.40	1049K
GEMINI-2.0-FLA Google	US	$0.08	$0.30	1049K
QWEN3.5-FLASH- Alibaba Cloud · Qwen	CN	$0.07	$0.26	1000K
QWEN-PLUS-2025 Alibaba Cloud · Qwen	CN	$0.26	$0.78	1000K
GEMINI-3.1-FLA Google	US	$0.25	$1.50	1049K

Token Cost Calculator

DEMO

Model

Input tokens / request

Output tokens / request

Requests / day

Cost / request

$0.001

Est. cost / month

$420.00

Cheapest for this workload: BAAI — $15.60/mo (save $404.40)

View all →

03AI Market Indexes

AI indexes for a new computing economy

Structured benchmarks for AI model prices, efficiency, inference cost and market momentum. News gets copied — indexes don't.

View all→

HOTN-CN

China AI Model Price Index

1.5$/1M

Pricing trend across China-based model providers.

HOTN-US

US AI Model Price Index

8.4$/1M

Pricing changes across leading US AI companies.

HOTN-ICX

AI Inference Cost Index

8.1pts

Cost of completing real AI tasks end to end.

HOTN-EFF

Model Efficiency Index

88.2/100

Price, quality, speed and reliability combined.

HOTN-MMX

Multimodal AI Cost Index

11.4$/1M

Cost of image, video, audio and multimodal output.

HOTN-FPI

Frontier Intelligence Price Index

15.9$/1M

What it costs to use frontier intelligence — the average blended price of the ten highest-ranked models by real LMArena human-preference Elo. Tracks how fast top-tier capability is getting cheaper.

HOTN-GTI

Global AI Token Price Index

6.4$/1M

Benchmark for model usage cost across major providers.

View all →

04Efficiency Ranking

Price is only one part of the story.
Efficiency tells the truth.

The cheapest model is not always the most efficient. HotON.ai compares models by total task cost, success rate, speed, stability and output quality.

View all→

#	Model	Cost / Task	Context	Arena Elo	Efficiency
01	GROK-4.1-FAST xAI	$0.001	2000K	—	96
02	GROK-4-FAST xAI	$0.001	2000K	—	96
03	GEMINI-2.0-FLA Google	$0.000	1049K	—	96
04	GROK-4-1-FAST- xAI	$0.001	2000K	—	96
05	DEEPSEEK Novita AI	$0.000	1049K	—	96
06	LLAMA-4-MAVERI Meta	$0.001	1049K	—	96
07	GROK-4-FAST-RE xAI	$0.001	2000K	—	96

Find the model that actually delivers the best result for the cost. See the full price-vs-intelligence frontier →

05Inference Cost Map

Where is AI cheaper to run?

AI costs are shaped by more than model prices. Region, compute supply, energy cost, latency and availability all matter.

View all→

Pricing by Region

Avg blended $/1M

US385 models

$6.46

CN150 models

$1.15

View all →

Global Compute Network

Provider regions

Illustrative map of provider regions — not a live feed.

Regional Cost Signals

Compare inference cost patterns across global regions.

Compute Availability

Understand where AI infrastructure capacity is becoming more attractive.

Energy-Aware AI

Track how energy conditions may influence compute and inference pricing.

Time-Based Cost Windows

Discover when certain regions may become more cost-efficient for AI workloads.

HotON.ai helps the market understand the geography of AI cost.

06AI Market Radar

The signals that actually matter

Model launches, pricing changes, infrastructure shifts, policy updates, funding events and market movements — filtered from the noise.

View all→

■

Model Launches· Hacker News· 4 hours ago

Zitron: "Everyone Has Been Sold a Lie" on AI

Aug 2, 2026 ▲

Model Launches· Simon Willison· 5 hours ago

Quoting Greg Brockman

Aug 2, 2026 ■

Model Launches· Hacker News· 5 hours ago

AI financial advice is surprisingly good, especially if you ask right questions

Aug 2, 2026 ▲

Model Launches· Simon Willison· 6 hours ago

datasette-apps 0.2a0

Aug 2, 2026 ▼

Model Launches· Hacker News· 6 hours ago

AI's real threat to jobs isn't job loss, it's lower paychecks, new research says

Aug 2, 2026 ■

Model Launches· Simon Willison· 7 hours ago

Ten advances in mathematics and theoretical computer science

Aug 2, 2026 ■

Policy· TechCrunch· 7 hours ago

Judge denies xAI’s request to block Minnesota ban on ‘nudify’ apps

Aug 2, 2026 ■

Model Launches· TechCrunch· 7 hours ago

YouTuber Hank Green says his AI usage is ‘not healthy’

Aug 2, 2026 ▲

Pricing· MarkTechPost· 8 hours ago

AMD Releases Instella-MoE-16B-A3B: A Fully Open Mixture-of-Experts LLM With 2.8B Active Parameters Trained On Instinct GPUs

Aug 2, 2026 ■

Model Launches· Hacker News· 8 hours ago

Google cancels AI Studio app after 800k preorders

Aug 2, 2026 ■

Infrastructure· MarkTechPost· 9 hours ago

Accelerating Transformer Training with NVIDIA Transformer Engine, Fused Kernels, BF16, FP8, and GPU Benchmarking

Aug 2, 2026 ■

Model Launches· The Verge· 9 hours ago

Is this Billboard Hot 100 hit AI slop?

Aug 2, 2026 ■

Model Launches· Hacker News· 9 hours ago

Reddit Stock Collapses 23% as AI Eats Away at User Growth

Aug 2, 2026 ■

Model Launches· TechCrunch· 10 hours ago

Sam Altman is still making the case for parenting via ChatGPT

Aug 2, 2026 ■

Model Launches· Hacker News· 10 hours ago

I Fired My AI Assitant

Aug 2, 2026 ■

Model Launches· The Decoder· 11 hours ago

AI keeps cracking unsolved math problems, and mathematicians have mixed feelings

Aug 2, 2026

HotON.ai Radar filters noise from the AI market and highlights the changes that may affect cost, access, capability and competition.

View all →

07Market Media

Video, visuals and briefings — every format, one feed

HotON.ai delivers market intelligence as video, visuals and text. Choose how you consume the AI economy — the platform handles every format.

View all→

hoton://media

01:32

Video

Weekly AI Market Pulse

A 90-second video recap of the week's biggest moves in AI prices, models and infrastructure.

Watch recap

Visual

Infographic

Global token price heatmap

Where input and output costs are rising and falling, at a glance.

Text Brief

Inference costs fall as new capacity comes online

Regional compute supply loosened this week, pushing the Inference Cost Index to a new monthly low across three major regions…

Video reports

Showreels, recaps and explainers with adaptive playback.

Visual & images

Charts, infographics and covers, responsive and crisp.

Text & briefings

Structured articles, summaries and data notes.

08HotON Reports

Reports built for AI decision makers

Structured intelligence on model pricing, AI infrastructure, inference cost, market heat and global AI supply-chain trends.

View all→

BriefWeekly

Weekly AI Market Brief

A concise summary of the most important AI market changes.

Read report→

PricingMonthly

Monthly AI Pricing Report

A deeper look at model pricing, token cost and efficiency trends.

Read report→

CostMonthly

Global Inference Cost Report

How AI task costs are changing across models and regions.

Read report→

EfficiencyMonthly

Model Efficiency Report

Leading models compared by real task performance and total cost.

Read report→

InfraQuarterly

AI Infrastructure Intelligence

Compute, energy, cloud and data-center signals behind the economy.

Read report→

Briefings

Subscribe to HotON Reports

AI market briefings, pricing reports and index updates — in your inbox.

09HotON Data API

AI market data for builders and institutions

Access structured AI market data through HotON.ai APIs, feeds and custom intelligence products.

Model pricing feedsToken cost dataModel metadataIndex dataMarket heat rankingsLatency & availabilityReport feedsCustom intelligence

Developers building AI tools
Enterprises optimizing AI costs
Model providers tracking position
Investors following AI infrastructure
Analysts researching the AI economy

Request API Access Contact Data Team

api.hoton.ai

> GET /v1/models/OPUS-4.8/price

{
  "symbol": "OPUS-4.8",
  "provider": "Anthropic",
  "input_per_1m": 6.00,
  "output_per_1m": 22.50,
  "context_k": 500,
  "efficiency": 96,
  "arena_elo": 1432,
  "modalities": ["text", "image"]
}

Frequently asked questions

What is HotON.ai?+

HotON.ai is an AI market-intelligence platform tracking current prices (updated daily), token costs, quality (Arena Elo) and version price trends for 535 AI models across 79 providers worldwide.

Where does the pricing data come from, and how often is it updated?+

Prices come from each provider's official pricing (via the TestKey catalog), cross-checked against OpenRouter, and refreshed daily. Every model page shows the price's source and 'as of' date.

Is the data real?+

Yes. Model prices, context windows, modalities and Arena Elo scores are real and sourced; efficiency and cost-per-task are computed from those real inputs and labeled as such.

What is the cheapest AI model right now?+

As of Aug 2, 2026, the lowest blended price we track is $0.01 per 1M tokens. See the full 'cheapest models' ranking for the current list.

Which AI model is rated highest?+

By LMArena human-preference Elo, the top-rated model we track is claude-opus-4.6 (1505). See the quality ranking for the full leaderboard.

How is the efficiency / value score calculated?+

It is a composite of a model's real price and context window, normalized to 0-100 so cheaper, larger-context models score higher. Full details are on our Methodology page.

HotON.ai

Understand the AI market before everyone else.

HotON.ai gives you the data, indexes and intelligence to understand where the AI economy is moving next.

Explore the Dashboard Subscribe to HotON Reports

See what's hot. Know what's changing. Act before the market moves.

AI is becoming a global market. HotON.ai helps you read it.

Model Price Tracking

AI Market Heat

Inference Cost Intelligence

Model Efficiency Index

Today's view of the global AI market

Top Rated

Cheapest Models

Largest Context

Most Multimodal

Top Providers

Category Distribution

Track the price of intelligence

Model Price Board

Token Cost Calculator

AI indexes for a new computing economy

China AI Model Price Index

US AI Model Price Index

AI Inference Cost Index

Model Efficiency Index

Multimodal AI Cost Index

Frontier Intelligence Price Index

Global AI Token Price Index

Price is only one part of the story. Efficiency tells the truth.

Where is AI cheaper to run?

Pricing by Region

Global Compute Network

Regional Cost Signals

Compute Availability

Energy-Aware AI

Time-Based Cost Windows

The signals that actually matter

Video, visuals and briefings — every format, one feed

Weekly AI Market Pulse

Global token price heatmap

Inference costs fall as new capacity comes online

Video reports

Visual & images

Text & briefings

Reports built for AI decision makers

Weekly AI Market Brief

Monthly AI Pricing Report

Global Inference Cost Report

Model Efficiency Report

AI Infrastructure Intelligence

Subscribe to HotON Reports

AI market data for builders and institutions

Frequently asked questions

Understand the AI market before everyone else.

See what's hot. Know what's changing. Act before the market moves.

AI is becoming a global market. HotON.ai helps you read it.

Model Price Tracking

AI Market Heat

Inference Cost Intelligence

Model Efficiency Index

Today's view of the global AI market

Top Rated

Cheapest Models

Largest Context

Most Multimodal

Top Providers

Category Distribution

Track the price of intelligence

Model Price Board

Token Cost Calculator

AI indexes for a new computing economy

China AI Model Price Index

US AI Model Price Index

AI Inference Cost Index

Model Efficiency Index

Multimodal AI Cost Index

Frontier Intelligence Price Index

Global AI Token Price Index

Price is only one part of the story. Efficiency tells the truth.

Where is AI cheaper to run?

Pricing by Region

Global Compute Network

Regional Cost Signals

Compute Availability

See what's hot.
Know what's changing.
Act before the market moves.

AI is becoming a global market.
HotON.ai helps you read it.

Price is only one part of the story.
Efficiency tells the truth.

See what's hot.
Know what's changing.
Act before the market moves.

AI is becoming a global market.
HotON.ai helps you read it.

Price is only one part of the story.
Efficiency tells the truth.