Pasokan komputasi, GPU, pusat data, dan energi — lapisan fisik yang menentukan seberapa murah AI dijalankan.
cerita 28
Menghitung pasokan, energi, dan kapasitas pusat data menentukan seberapa murah AI dapat dijalankan. Pergeseran infrastruktur muncul dalam biaya inferensi beberapa minggu kemudian.
Bahkan pusat data berukuran sedang pun dapat menimbulkan dampak lokal yang sangat besar.
OpenAI sedang negosiasi untuk menyewa data center yang direncanakan 10 gigawatt di Ohio yang dapat didukung secara keuangan oleh Nvidia, menurut The Information.
Dengan kapasitas awal 24 megawatt, data center inovatif ini menggunakan air laut sebagai sistem penyejuk alam.
Laporan baru dari OpenAI menggambarkan operasi pengaruh yang terhubung dengan PRC menggunakan AI untuk menargetkan debat teknologi di Amerika Serikat, naskah data center, tarif, dan klaim palsu tentang ChatGPT.
The 168-megawatt facility will support Meta's global AI computing needs and can be expanded over time.
NVIDIA GPUs with Confidential Computing are now used for confidential inference in Apple’s Private Cloud Compute (PCC), as it expands beyond Apple’s data centers to Google Cloud. Unveiled during Apple’s annual WWDC gath…
At an event in San Francisco today, General Motors made a series of announcements around EV batteries, energy storage, and grid resiliency in the face of growing electricity demand from AI data centers. The automaker an…
In this post, we show how to train robot policies for the Unitree H1 humanoid with NVIDIA Isaac Lab on Amazon SageMaker AI across two compute options: Amazon SageMaker HyperPod and Amazon SageMaker Training Jobs.
On Tuesday, the Seattle City Council will vote on whether to enact a one-year moratorium on new data centers - just two months after several companies proposed building five large-scale centers in the city. Among the mo…
How Notion uses Codex to one-shot specs, build AI Voice Input for the web, and multiply engineering power across small teams.
In this tutorial, we implement a hands-on workflow for NVIDIA cuTile Python, a tile-based GPU programming interface for CUDA-style kernels in Python. We prepare a Colab-friendly environment and check GPU, driver, CUDA,…
Google has ordered more than three million AI chips from Intel for 2028. Nvidia is testing Intel's manufacturing tech for its upcoming Feynman architecture. Both moves come as TSMC can't keep up with AI chip demand. Int…
Xiaomi's MiMo team, with TileRT, released MiMo-V2.5-Pro-UltraSpeed, a serving mode for the MiMo-V2.5-Pro model. It decodes over 1000 tokens per second on a 1-trillion-parameter model using a single 8-GPU commodity node.…
With access to the latest generative AI models and high-performance accelerated compute in high global demand, AWS customers need tools to take advantage of model availability and capacity across multiple AWS Regions, w…
NVIDIA and LG Group are building an AI factory to accelerate LG Group’s next wave of AI-driven businesses, spanning robotics, autonomous driving, data center technologies and GPU cloud services. The AI factory will prov…
Clive Chan, by his own account the second hardware employee in OpenAI's custom chip program, is moving to Anthropic. He brings experience from Tesla's Autopilot ASIC and the OpenAI-Broadcom partnership. The move comes a…
Google released the Colab CLI, letting developers and AI agents run local code on remote Colab GPU and TPU runtime The post Google’s New Colab CLI Lets Developers and AI Agents Run Python on Remote Colab GPUs and TPUs F…
Sakana AI has launched a dedicated research lab for recursive self-improvement: AI that iteratively improves itself. The Japanese startup, co-founded by Transformer co-author Llion Jones, sees RSI as an alternative to t…
Developer felt "beaten up," with "no choice" but to shrink data center.
NVIDIA has released Nemotron 3 Ultra, a 550B total (55B active) open Mixture-of-Experts hybrid Mamba-Transformer for long-running agents. It pairs a 1M-token context with up to ~6x higher inference throughput than compa…
Meta may have found one way to slash its massive data center bill: tents.
Kevin O'Leary agreed to halve the size of his planned 40,000-acre data center in Utah amid mounting pressure from residents and activists, as reported earlier by local affiliate ABC4. The Shark Tank star sent a letter t…
The California startup released the fourth-generation of its home assistance robot, Stretch.
Hyperscalers have come under scrutiny for their impact on water quality and availability.
Fresh data-center capacity in several regions eased GPU availability, helping push regional inference cost indices lower this week.
More providers are tying compute prices to local energy conditions, adding a time-of-day dimension to where and when AI workloads run cheapest.
The NVIDIA AI Cloud ecosystem is accelerating the global buildout of AI factory infrastructure. Partners are expanding capacity to meet growing demand from enterprises, startups, nations, AI labs and developers scaling…
Anthropic released Cowork on Monday, a new AI agent capability that extends the power of its wildly successful Claude Code tool to non-technical users — and according to company insiders, the team built the entire featu…
Ringkasan dikumpulkan untuk informasi saja — ikuti tautan sumber untuk cerita selengkapnya. Entri demo bersifat ilustratif.