基础设施· MarkTechPost· 2026年6月5日· 昨天· 1 分钟阅读
NVIDIA AI Releases Nemotron 3 Ultra: An Open 550B Mixture-of-Experts Hybrid Mamba-Transformer for Long-Running Agents
NVIDIA has released Nemotron 3 Ultra, a 550B total (55B active) open Mixture-of-Experts hybrid Mamba-Transformer for long-running agents. It pairs a 1M-token context with up to ~6x higher inference throughput than compa…
为何重要
算力供给、能源与数据中心容量决定了 AI 运行的成本。基础设施的变化会在数周后体现在推理成本上。
摘要仅供参考,请点击来源链接查看全文。演示条目为示意。
更多资讯
模型发布10小时前
Google DeepMind Releases Gemma 4 QAT Checkpoints: Q4_0 and a New Mobile Format Cut On-Device Memory
价格10小时前
Google will pay SpaceX $920M per month for compute
融资并购10小时前
S&P 500 rejects SpaceX, also blocking entry for OpenAI and Anthropic
基础设施11小时前
"We pissed off a lot of people": Giant data center plan cut 50% amid protests