LIVE
DEEPSEEK-V4-FL$0.20 26.3%
DEEPSEEK-V4-PR$0.87 4.6%
QWEN3.6-FLASH$1.13 24.9%
NEMOTRON-3-SUP$0.45 23.9%
LLAMA-4-MAVERI$0.60 11.9%
LLAMA-4-SCOUT$0.30 5.8%
GEMINI-3.1-FLA$1.50 10.0%
GEMINI-2.5-FLA$0.40 0.8%
MINIMAX-01$1.10 8.0%
MIMO-V2.5$0.28 5.8%
MIMO-V2.5-PRO$0.87 2.6%
MINIMAX-M3$1.20 6.1%
QWEN3.5-PLUS-2$1.80 6.0%
NOVA-2-LITE-V1$2.50 31.4%
GEMINI-2.5-FLA$2.50 9.5%
GROK-4.3$2.50 8.7%
QWEN3.6-PLUS$1.95 31.6%
NEMOTRON-3-ULT$2.50 25.9%
QWEN3.7-PLUS$1.60 21.4%
MINIMAX-M1$2.20 11.9%
PALMYRA-X5$6.00 26.9%
QWEN3.7-MAX$3.75 3.1%
GEMINI-3.5-FLA$9.00 8.3%
GEMINI-2.5-PRO$10.00 7.2%
GPT-5.4-NANO$1.25 10.0%
NOVA-LITE-V1$0.24 28.9%
KIMI-K2.5$1.90 9.8%
MINISTRAL-14B-$0.20 17.7%
DEEPSEEK-V4-FL$0.20 26.3%
DEEPSEEK-V4-PR$0.87 4.6%
QWEN3.6-FLASH$1.13 24.9%
NEMOTRON-3-SUP$0.45 23.9%
LLAMA-4-MAVERI$0.60 11.9%
LLAMA-4-SCOUT$0.30 5.8%
GEMINI-3.1-FLA$1.50 10.0%
GEMINI-2.5-FLA$0.40 0.8%
MINIMAX-01$1.10 8.0%
MIMO-V2.5$0.28 5.8%
MIMO-V2.5-PRO$0.87 2.6%
MINIMAX-M3$1.20 6.1%
QWEN3.5-PLUS-2$1.80 6.0%
NOVA-2-LITE-V1$2.50 31.4%
GEMINI-2.5-FLA$2.50 9.5%
GROK-4.3$2.50 8.7%
QWEN3.6-PLUS$1.95 31.6%
NEMOTRON-3-ULT$2.50 25.9%
QWEN3.7-PLUS$1.60 21.4%
MINIMAX-M1$2.20 11.9%
PALMYRA-X5$6.00 26.9%
QWEN3.7-MAX$3.75 3.1%
GEMINI-3.5-FLA$9.00 8.3%
GEMINI-2.5-PRO$10.00 7.2%
GPT-5.4-NANO$1.25 10.0%
NOVA-LITE-V1$0.24 28.9%
KIMI-K2.5$1.90 9.8%
MINISTRAL-14B-$0.20 17.7%
Infrastructure

AI Infrastructure

Compute supply, GPUs, data centers and energy — the physical layer that decides how cheaply AI runs.

10 stories

Why it matters

Compute supply, energy and data-center capacity decide how cheaply AI can run. Infrastructure shifts show up in inference costs weeks later.

Infrastructure· Ars Technica· Jun 6, 2026· 11 hours ago

"We pissed off a lot of people": Giant data center plan cut 50% amid protests

Developer felt "beaten up," with "no choice" but to shrink data center.

Infrastructure· MarkTechPost· Jun 5, 2026· yesterday

NVIDIA AI Releases Nemotron 3 Ultra: An Open 550B Mixture-of-Experts Hybrid Mamba-Transformer for Long-Running Agents

NVIDIA has released Nemotron 3 Ultra, a 550B total (55B active) open Mixture-of-Experts hybrid Mamba-Transformer for long-running agents. It pairs a 1M-token context with up to ~6x higher inference throughput than compa…

Infrastructure· TechCrunch· Jun 5, 2026· yesterday

Meta steals a tactic from Tesla and builds data centers in tents

Meta may have found one way to slash its massive data center bill: tents.

Infrastructure· The Verge· Jun 5, 2026· yesterday

Kevin O’Leary agrees to downsize massive Utah data center

Kevin O'Leary agreed to halve the size of his planned 40,000-acre data center in Utah amid mounting pressure from residents and activists, as reported earlier by local affiliate ABC4. The Shark Tank star sent a letter t…

Infrastructure· TechCrunch· Jun 4, 2026· 2 days ago

Is Silicon Valley ready to put robots in people’s homes? Hello Robot is.

The California startup released the fourth-generation of its home assistance robot, Stretch.

Infrastructure· Ars Technica· Jun 4, 2026· 2 days ago

How some data center operators are tackling their water use problems

Hyperscalers have come under scrutiny for their impact on water quality and availability.

Infrastructure· HotON Desk· Jun 3, 2026· 3 days ago

Regional GPU capacity loosens as new clusters come online

Fresh data-center capacity in several regions eased GPU availability, helping push regional inference cost indices lower this week.

Infrastructure· HotON Desk· Jun 1, 2026· 5 days ago

Energy-linked compute pricing rolls out in two more regions

More providers are tying compute prices to local energy conditions, adding a time-of-day dimension to where and when AI workloads run cheapest.

Infrastructure· NVIDIA· Jun 1, 2026· 5 days ago

NVIDIA AI Cloud Ecosystem Expands Worldwide to Meet Global AI Compute Demand

The NVIDIA AI Cloud ecosystem is accelerating the global buildout of AI factory infrastructure. Partners are expanding capacity to meet growing demand from enterprises, startups, nations, AI labs and developers scaling…

Infrastructure· VentureBeat· Jan 12, 2026· 5 months ago

Anthropic launches Cowork, a Claude Desktop agent that works in your files — no coding required

Anthropic released Cowork on Monday, a new AI agent capability that extends the power of its wildly successful Claude Code tool to non-technical users — and according to company insiders, the team built the entire featu…

Summaries are aggregated for information only — follow the source link for the full story. Demo entries are illustrative.