LIVE
DEEPSEEK-V4-FL$0.20 26.3%
DEEPSEEK-V4-PR$0.87 4.6%
QWEN3.6-FLASH$1.13 24.9%
NEMOTRON-3-SUP$0.45 23.9%
LLAMA-4-MAVERI$0.60 11.9%
LLAMA-4-SCOUT$0.30 5.8%
GEMINI-3.1-FLA$1.50 10.0%
GEMINI-2.5-FLA$0.40 0.8%
MINIMAX-01$1.10 8.0%
MIMO-V2.5$0.28 5.8%
MIMO-V2.5-PRO$0.87 2.6%
MINIMAX-M3$1.20 6.1%
QWEN3.5-PLUS-2$1.80 6.0%
NOVA-2-LITE-V1$2.50 31.4%
GEMINI-2.5-FLA$2.50 9.5%
GROK-4.3$2.50 8.7%
QWEN3.6-PLUS$1.95 31.6%
NEMOTRON-3-ULT$2.50 25.9%
QWEN3.7-PLUS$1.60 21.4%
MINIMAX-M1$2.20 11.9%
PALMYRA-X5$6.00 26.9%
QWEN3.7-MAX$3.75 3.1%
GEMINI-3.5-FLA$9.00 8.3%
GEMINI-2.5-PRO$10.00 7.2%
GPT-5.4-NANO$1.25 10.0%
NOVA-LITE-V1$0.24 28.9%
KIMI-K2.5$1.90 9.8%
MINISTRAL-14B-$0.20 17.7%
DEEPSEEK-V4-FL$0.20 26.3%
DEEPSEEK-V4-PR$0.87 4.6%
QWEN3.6-FLASH$1.13 24.9%
NEMOTRON-3-SUP$0.45 23.9%
LLAMA-4-MAVERI$0.60 11.9%
LLAMA-4-SCOUT$0.30 5.8%
GEMINI-3.1-FLA$1.50 10.0%
GEMINI-2.5-FLA$0.40 0.8%
MINIMAX-01$1.10 8.0%
MIMO-V2.5$0.28 5.8%
MIMO-V2.5-PRO$0.87 2.6%
MINIMAX-M3$1.20 6.1%
QWEN3.5-PLUS-2$1.80 6.0%
NOVA-2-LITE-V1$2.50 31.4%
GEMINI-2.5-FLA$2.50 9.5%
GROK-4.3$2.50 8.7%
QWEN3.6-PLUS$1.95 31.6%
NEMOTRON-3-ULT$2.50 25.9%
QWEN3.7-PLUS$1.60 21.4%
MINIMAX-M1$2.20 11.9%
PALMYRA-X5$6.00 26.9%
QWEN3.7-MAX$3.75 3.1%
GEMINI-3.5-FLA$9.00 8.3%
GEMINI-2.5-PRO$10.00 7.2%
GPT-5.4-NANO$1.25 10.0%
NOVA-LITE-V1$0.24 28.9%
KIMI-K2.5$1.90 9.8%
MINISTRAL-14B-$0.20 17.7%
Latest

AI Market News

The latest signals across the AI economy — model launches, pricing moves, infrastructure shifts, policy and funding — summarized with sources.

Model Launches· MarkTechPost· Jun 6, 2026· 12 hours ago

Google DeepMind Releases Gemma 4 QAT Checkpoints: Q4_0 and a New Mobile Format Cut On-Device Memory

Compare Gemma 4 edge formats: BF16, Q4_0 QAT, and mobile QAT, on published memory numbers and design tradeoffs. The post Google DeepMind Releases Gemma 4 QAT Checkpoints: Q4_0 and a New Mobile Format Cut On-Device Memor…

Pricing· TechCrunch· Jun 6, 2026· 12 hours ago

Google will pay SpaceX $920M per month for compute

The companies announced the deal on Friday, just one week ahead of SpaceX's historic IPO.

Funding & M&A· Ars Technica· Jun 6, 2026· 12 hours ago

S&P 500 rejects SpaceX, also blocking entry for OpenAI and Anthropic

SpaceX won’t get easy access to billions of dollars from passive investors.

Infrastructure· Ars Technica· Jun 6, 2026· 13 hours ago

"We pissed off a lot of people": Giant data center plan cut 50% amid protests

Developer felt "beaten up," with "no choice" but to shrink data center.

Funding & M&A· The Decoder· Jun 6, 2026· 13 hours ago

Florida's lawsuit against OpenAI and CEO Altman treats ChatGPT as a defective product and public nuisance

Florida is the first US state to sue OpenAI and CEO Sam Altman personally over risks to minors, missing age checks, and inadequate safety investment. The 83-page complaint treats ChatGPT as a product subject to liabilit…

Funding & M&A· TechCrunch· Jun 6, 2026· 14 hours ago

The most interesting startups right now want to get you off your phone

While the AI fundraising machine keeps breaking its own records, some founders are building in the other direction. Mirror founder Brynn Putnam just raised money for Board, a startup focused on bringing people together…

Model Launches· The Verge· Jun 6, 2026· 14 hours ago

This is your laptop… on AI

We're now deep into developer conference season, and one of the themes so far is the relentless conviction from Big Tech companies that AI is going to change everything about how we do everything. Nvidia's Jensen Huang…

Model Launches· Ars Technica· Jun 5, 2026· 15 hours ago

The Fitbit Air is a good wearable weighed down by a chatty AI "coach"

The Air succeeds as a minimalist, reliable fitness tracker, but Google's AI Health Coach feels unnecessary.

Model Launches· The Decoder· Jun 5, 2026· 16 hours ago

Satya Nadella publicly torches a VP's plan to make Microsoft's AI agent deliberately addictive

Microsoft CEO Satya Nadella has sharply criticized an internal memo proposing to make users "addicted" to the company's new AI agent Scout. "Not sure who is writing and leaking this nonsense," Nadella wrote to about 50…

Policy· The Verge· Jun 5, 2026· 16 hours ago

New York lawmakers pass one-year ban on new data centers

The New York State legislature passed a one-year moratorium on new large data centers, the first statewide ban of its kind if Democratic Governor Kathy Hochul signs it into law. Lawmakers behind the bill say it's meant…

Model Launches· WIRED· Jun 5, 2026· 16 hours ago

Has Microsoft Lost Its Mojo (Again)?

Microsoft’s AI products aren’t selling, and Github’s been plagued with troubles. WIRED spoke with VP Scott Hanselman about whether the company is in catch-up mode.

Model Launches· TechCrunch· Jun 5, 2026· 16 hours ago

The token bill comes due: Inside the industry scramble to manage AI’s runaway costs

"The whole conversation shifted from tokenmaxxing and 'go fast' to 'we need guardrails, how do we control this?'"

Model Launches· Google AI· Jun 5, 2026· 16 hours ago

The latest AI news we announced in May 2026

Here are Google’s latest AI updates from May 2026

Funding & M&A· TechCrunch· Jun 5, 2026· 17 hours ago

The ‘together tech’ wave might be the most intriguing startup bet of 2026

While the AI fundraising machine keeps breaking its own records, some founders are building in the other direction. Mirror founder Brynn Putnam just raised money for Board, a startup focused on bringing people together…

Model Launches· The Verge· Jun 5, 2026· 17 hours ago

This AI startup says it can tell if a script will make a hit film

When Quilty hit the industry trades earlier this year, the AI startup promised that its tool could accurately predict a film's success just by reading the script. When people actually got a chance to experiment with Qui…

Pricing· TechCrunch· Jun 5, 2026· 18 hours ago

AirTrunk commits $30B to build 5GW of AI data centers in India

The Australian data center operator plans to set up 5GW of capacity in India.

Model Launches· The Decoder· Jun 5, 2026· 19 hours ago

Microsoft trained its MAI models on unlicensed web data despite promising "enterprise grade, clean and commercially licensed data"

Microsoft sells its LLM training approach as different from other AI companies. It isn't. The company trained its new MAI models partly on unlicensed web data like Common Crawl, despite claiming they used only "clean an…

Model Launches· The Decoder· Jun 5, 2026· 20 hours ago

Anthropic's Mythos model is reportedly powering NSA offensive cyber ops against China and Iran

Anthropic has reportedly stationed about half a dozen engineers directly at the NSA to adapt its Mythos AI model for offensive cyber operations. The model could be used to break into networks in China or Iran. That fits…

Model Launches· Simon Willison· Jun 5, 2026· 20 hours ago

Quoting Andreas Kling

<blockquote cite="https://ladybird.org/posts/changing-how-we-develop-ladybird/"><p>We will no longer accept public pull requests. [...]</p> <p>A substantial patch used to imply substantial effort, and that effort was a…

Funding & M&A· WIRED· Jun 5, 2026· 21 hours ago

OpenAI and Anthropic May Be Rivals, but Investors Aren’t Picking Sides

“Why wouldn’t you want to be in both Pepsi and Coke?” says one venture capitalist. “It’s the same here.”

Model Launches· MarkTechPost· Jun 5, 2026· 21 hours ago

NVIDIA AI Releases Dynamo Snapshot: A CRIU-Based Fast Startup System for AI Inference on Kubernetes

NVIDIA Dynamo Snapshot checkpoints and restores vLLM inference workers on Kubernetes using CRIU and cuda-checkpoint tools. The post NVIDIA AI Releases Dynamo Snapshot: A CRIU-Based Fast Startup System for AI Inference o…

Policy· WIRED· Jun 5, 2026· 21 hours ago

Why Apple Might Put Cameras Into Its Next AirPods

From battery life to privacy, there are many hurdles to the idea taking off.

Model Launches· MarkTechPost· Jun 5, 2026· 21 hours ago

Perplexity AI Introduces Hybrid Local-Server Inference Orchestrator for Personal Computer: Automatic On-Device and Cloud Task Routing

Perplexity AI announces a hybrid local-server inference orchestrator for Personal Computer, automatically routing AI tasks between on-device and cloud models. The post Perplexity AI Introduces Hybrid Local-Server Infere…

Model Launches· MarkTechPost· Jun 5, 2026· 22 hours ago

Microsoft Fara Tutorial: Run a Browser-Use Agent in Google Colab with a Mock OpenAI-Compatible Endpoint

A hands-on guide to running Microsoft Fara in Colab, testing the browser agent loop with a mock endpoint. The post Microsoft Fara Tutorial: Run a Browser-Use Agent in Google Colab with a Mock OpenAI-Compatible Endpoint…

Model Launches· MIT Tech Review· Jun 5, 2026· 22 hours ago

The Meta hack shows there’s more to AI security than Mythos

On June 5, 404 Media reported that attackers had been using Meta’s AI customer support agent to steal Instagram accounts. Their approach was simple: They asked the agent to link the accounts to email addresses that they…

Model Launches· WIRED· Jun 5, 2026· 22 hours ago

AI Has Come for Serif Fonts

AI companies are using serif to project humanity. Critics are calling it “tasteslop.”

Model Launches· The Decoder· Jun 5, 2026· 22 hours ago

Anthropic says Claude now writes over 90% of its code and wants the world to have an AI pause button

Anthropic is sharing internal data showing how much Claude is speeding up its own AI development: more than 80 percent of production code now comes from Claude, and engineers are shipping eight times as much code per da…

Pricing· MarkTechPost· Jun 5, 2026· 23 hours ago

15 Best Vibe Coding Tools in 2026 Compared: Pricing, Features, and Best Fit

Vibe coding turns plain language into working software. Explore 15 tools shaping how developers build apps in 2026. The post 15 Best Vibe Coding Tools in 2026 Compared: Pricing, Features, and Best Fit appeared first on…

Model Launches· NVIDIA· Jun 5, 2026· yesterday

Seoul Purpose: How NVIDIA and South Korea Are Building the Future of AI

Home to cutting-edge sovereign AI infrastructure and robotics innovators, as well as one of the world’s most passionate gaming communities, South Korea is one of the world’s centers of AI. NVIDIA founder and CEO Jensen…

Model Launches· TechCrunch· Jun 5, 2026· yesterday

Mira Murati steps back into the spotlight, carefully

In the current environment, remaining heads down has diminishing returns; at some point, you have to make some noise just to remind the market you exist.

Model Launches· Simon Willison· Jun 5, 2026· yesterday

AI enthusiasts are in a race against time, AI skeptics are in a race against entropy

<p><strong><a href="https://charitydotwtf.substack.com/p/ai-enthusiasts-are-in-a-race-against">AI enthusiasts are in a race against time, AI skeptics are in a race against entropy</a></strong></p> Charity Majors neatly…

Pricing· TechCrunch· Jun 5, 2026· yesterday

Ahead of its IPO, Anthropic’s Daniela Amodei shrugs off doubts about AI’s returns

Anthropic has been growing at a breakneck pace. The company announced that annualized revenue crossed $47 billion in May, up dramatically from roughly $9 billion at the end of 2025. That trajectory faces a real test, th…

Model Launches· TechCrunch· Jun 5, 2026· yesterday

Airbnb’s Brian Chesky plans to launch a new AI lab

The Airbnb CEO said last year it hasn't struck an LLM partnership because existing products weren't quite ready.

Model Launches· MarkTechPost· Jun 5, 2026· yesterday

Building a Semantic Search Engine and Open-Status Classifier over the ResearchMath-14k Dataset

This tutorial walks through a complete NLP pipeline for research-level mathematics. Using the ResearchMath-14k dataset, we extract field-specific keywords with TF-IDF, generate sentence embeddings, visualize the problem…

Model Launches· Ars Technica· Jun 5, 2026· yesterday

The skeptic’s guide to humanoid robots going viral on the Internet

Robot demonstrations can distort public perceptions of robotic capabilities.

Infrastructure· MarkTechPost· Jun 5, 2026· yesterday

NVIDIA AI Releases Nemotron 3 Ultra: An Open 550B Mixture-of-Experts Hybrid Mamba-Transformer for Long-Running Agents

NVIDIA has released Nemotron 3 Ultra, a 550B total (55B active) open Mixture-of-Experts hybrid Mamba-Transformer for long-running agents. It pairs a 1M-token context with up to ~6x higher inference throughput than compa…

Funding & M&A· TechCrunch· Jun 5, 2026· yesterday

Defense tech, AI, and fundraising take center stage at StrictlyVC Los Angeles on June 18

On Thursday, June 18, at The Aerospace Corporation Campus, investors, founders, and tech leaders will gather for an evening of conversation exploring some of the most consequential shifts taking place across venture cap…

Policy· Ars Technica· Jun 5, 2026· yesterday

These LLMs are the best at resisting Russian propaganda

Estonian government benchmark shows how dozens of models combat Russia's "strategic narratives."

Policy· Ars Technica· Jun 5, 2026· yesterday

Elon Musk tries again to escape FTC audits of X data handling

Musk can't be trusted to protect X user privacy, public commenters warn FTC.

Infrastructure· TechCrunch· Jun 5, 2026· 2 days ago

Meta steals a tactic from Tesla and builds data centers in tents

Meta may have found one way to slash its massive data center bill: tents.

Model Launches· TechCrunch· Jun 5, 2026· 2 days ago

Apple approves Poke as the first AI agent on its Messages for Business platform

Poke, the startup that lets people use AI agents through simple text messages, has become the first AI agent approved for Apple’s Messages for Business platform.

Model Launches· Hugging Face· Jun 5, 2026· 2 days ago

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

Model Launches· The Decoder· Jun 5, 2026· 2 days ago

Cloudflare CEO says the web's future is "pay to crawl" as bots overtake human traffic

Bot traffic now outpaces human traffic on the internet, Cloudflare CEO Matthew Prince says, years ahead of his late 2027 forecast. He blames AI agents for the surge. His conclusion for the future of the web: "Clearly it…

Funding & M&A· WIRED· Jun 5, 2026· 2 days ago

The AI IPO Race Heats Up, DOGE Whistleblower Sues Elon Musk, and Instagram Gets Hacked

On Uncanny Valley, we dive into the IPO bonanza that the top AI companies are embarking on to the point where some real estate listings are looking for not just regular old cash, but Anthropic stock.

Infrastructure· The Verge· Jun 5, 2026· 2 days ago

Kevin O’Leary agrees to downsize massive Utah data center

Kevin O'Leary agreed to halve the size of his planned 40,000-acre data center in Utah amid mounting pressure from residents and activists, as reported earlier by local affiliate ABC4. The Shark Tank star sent a letter t…

Model Launches· AWS ML· Jun 5, 2026· 2 days ago

NVIDIA Nemotron 3 Ultra now available on Amazon SageMaker JumpStart

Deploy NVIDIA Nemotron 3 Ultra on Amazon SageMaker JumpStart. Get 5x faster inference and 30% lower cost for agentic AI workloads with this frontier reasoning model.

Model Launches· The Decoder· Jun 5, 2026· 2 days ago

ChatGPT now saves narrative dossiers about you sorted by work, hobbies, and travel preferences

ChatGPT's updated "Dreaming" memory system now builds coherent user profiles from conversations instead of saving scattered bullet points. OpenAI says the success rate for keeping information current jumped from 52.2 pe…

Model Launches· Simon Willison· Jun 5, 2026· 2 days ago

Quoting Emanuel Maiberg, 404 Media

<blockquote cite="https://www.404media.co/google-employees-internally-share-memes-about-how-its-ai-sucks/"><p>After this story was published Google's spokesperson reached out and asked us to publish a slightly different…

Model Launches· TechCrunch· Jun 5, 2026· 2 days ago

Meta rolls out a new AI creator assistant on Facebook

Creators often have to parse through charts and dashboards to understand their performance, but with the new AI assistant, they can get quick answers to questions like "When should I post?" and "What are people saying i…

Model Launches· TechCrunch· Jun 5, 2026· 2 days ago

What to expect from WWDC 2026: Siri’s highly anticipated revamp and Apple Intelligence updates

Apple's WWDC nears: Here's what you can look forward to.

Model Launches· The Decoder· Jun 5, 2026· 2 days ago

Bain study finds companies miss AI savings targets because humans keep getting in the way

According to a Bain survey of 951 companies, almost 40 percent achieved less than 10 percent in AI cost savings, even though most had targeted 11 to 20 percent. One alleged reason is that only 7 percent actually run ful…

Infrastructure· TechCrunch· Jun 4, 2026· 2 days ago

Is Silicon Valley ready to put robots in people’s homes? Hello Robot is.

The California startup released the fourth-generation of its home assistance robot, Stretch.

Model Launches· The Verge· Jun 4, 2026· 2 days ago

TSMC struggles to keep up with AI demand: &#8216;We can only support so much&#8217;

Taiwan Semiconductor Manufacturing Co. - the world's biggest semiconductor-maker - is struggling to meet demands from American customers even with its factory buildout in the US, according to reports from Reuters and Bl…

Infrastructure· Ars Technica· Jun 4, 2026· 2 days ago

How some data center operators are tackling their water use problems

Hyperscalers have come under scrutiny for their impact on water quality and availability.

Pricing· TechCrunch· Jun 4, 2026· 2 days ago

Apple touts $1.4 trillion in App Store billings and sales, 90% without a commission

Apple's App Store generated $1.4 trillion in sales, up from $1.3 trillion last year, with $149 billion in sales for digital goods.

Model Launches· The Verge· Jun 4, 2026· 2 days ago

Elon Musk is steamrolling Wall Street to become a trillionaire

Today on Decoder, I’m talking to Ryan Mac, a technology reporter at The New York Times and coauthor of the excellent book Character Limit: How Elon Musk Destroyed Twitter, which came out in 2024. I can’t recommend it en…

Model Launches· The Decoder· Jun 4, 2026· 2 days ago

OpenAI CEO Sam Altman sees "proactive AI" as the next big phase after chatbots and agents

OpenAI CEO Sam Altman outlines the next phase of AI products: a "proactive AI" that runs constantly in the background and acts on its own instead of waiting for user prompts. Companies are also wrestling with spiraling…

Model Launches· NVIDIA· Jun 4, 2026· 2 days ago

Forecast: Fun Ahead — 18 Games Join in June to Stream on GeForce NOW

June’s forecast with GeForce NOW: 100% chance of gaming. GeForce NOW is lining up new adventures for the month, from big-name blockbusters to quirky indies ready for the spotlight. Members can dive into fresh worlds, sq…

Model Launches· Hugging Face· Jun 4, 2026· 2 days ago

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Model Launches· The Verge· Jun 4, 2026· 2 days ago

Let us filter AI slop, you cowards

Nobody should be subjected to seeing shrimp Jesus all over their social feeds. | Image: Cath Virginia / The Verge, Getty Images It's almost impossible to avoid seeing AI-generated content online, but it doesn't have to…

Model Launches· Hugging Face· Jun 4, 2026· 2 days ago

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

Model Launches· The Verge· Jun 4, 2026· 2 days ago

AI leaders call for tougher protections against AI-aided bioweapons

Some of the AI industry's biggest rivals have put their many, many grievances aside for a common cause: making it harder for people to use their technology to develop biological weapons. In an open letter to US lawmaker…

Model Launches· OpenAI· Jun 4, 2026· 2 days ago

How Endava is redesigning software delivery around AI agents

Learn how Endava is using AI agents, ChatGPT Enterprise, and Codex to accelerate software delivery, automate workflows, and build an AI-native culture across the enterprise.

Policy· MIT Tech Review· Jun 4, 2026· 2 days ago

How courts are coping with a flood of AI-generated lawsuits

Most days in her chambers, Judge Maritza Braswell, a federal magistrate judge in Colorado, sifts through stacks of documents written by people without a lawyer. Many of them can’t afford to hire a lawyer, and others hav…

Pricing· WIRED· Jun 4, 2026· 2 days ago

Jeff Bezos Is Funding a Wild Hunt for the Brain’s ‘Core Algorithm’

With $500 million in funding and a reported $2.5 billion valuation, Flourish wants to reinvent AI by putting real neurons under the microscope.

Policy· The Decoder· Jun 4, 2026· 2 days ago

AI can now coach amateur virologists, and top tech leaders want Congress to act on DNA security

Sam Altman, Dario Amodei, Demis Hassabis, and other tech leaders are urging the US government to make screening of synthetic DNA orders a legal requirement. AI systems already outperform PhD-level virologists on lab pro…

Pricing· WIRED· Jun 4, 2026· 2 days ago

Alpha School’s Ritzy New York City Campus Costs $65,000 a Year—but Isn’t Actually a School

A homeschooling center in Manhattan is part of the company’s nationwide expansion. Internal documents reveal its strategy: “Opening date > safety.”

Funding & M&A· WIRED· Jun 4, 2026· 2 days ago

Quantum Computing Is Having Its Public Market Moment

Quantinuum, a quantum computing startup, is losing millions. Investors want in anyway.

Model Launches· The Verge· Jun 4, 2026· 2 days ago

Amazon develops a warehouse robot that workers can speak to

The design hasn’t changed much from the original Proteus, which was announced in 2022. | Image: Amazon Amazon has announced a new version of its fully autonomous warehouse robot, Proteus, that will interact using langua…

Model Launches· OpenAI· Jun 4, 2026· 2 days ago

Dreaming: Better memory for a more helpful ChatGPT

ChatGPT introduces a new memory system to better remember preferences, keeping context fresh and relevant across conversations.

Pricing· HotON Desk· Jun 4, 2026· 2 days ago

Major provider cuts flagship output token price by 28%

A leading provider lowered output token pricing on its flagship tier, narrowing the gap to mid-tier models and pressuring competitors on cost.

Open Source· MarkTechPost· Jun 4, 2026· 2 days ago

Miso Labs Releases MisoTTS: An 8B Emotive Text-to-Speech Model with Open Weights

Miso Labs has released MisoTTS, an open-weights 8B text-to-speech model. It uses residual vector quantization (RVQ) to scale its sonic range without scaling parameters, and conditions on both text and audio context to r…

Model Launches· The Decoder· Jun 4, 2026· 2 days ago

xAI updates Grok Imagine to 1.5 with image-to-video generation at 720p resolution

xAI has released "grok-imagine-video-1.5-preview," an image-to-video model that turns still images into cinematic videos at up to 720p based on text prompts. Multiple clips can be stitched together into longer scenes. T…

Open Source· MarkTechPost· Jun 4, 2026· 2 days ago

Meet OpenJarvis: A Local-First Framework for On-Device Personal AI Agents with Tools, Memory, and Learning

Stanford researchers released OpenJarvis, an open-source framework that runs inference, agents, memory, and learning entirely on-device. It decomposes a personal AI system into five composable primitives — Intelligence,…

Model Launches· HotON Desk· Jun 4, 2026· 2 days ago

New 1M-context multimodal model enters public preview

The model handles long documents, images and audio in a single context window, expanding the design space for agentic and retrieval-heavy workloads.

Model Launches· WIRED· Jun 4, 2026· 2 days ago

OpenAI and Anthropic Sign Letter to Prevent AI-Developed Biological Weapons

Leading AI labs, executives, and scientists are sending a letter to lawmakers urging them to improve tracking of synthetic DNA sequences that could be used for bioweapons.

Model Launches· OpenAI· Jun 4, 2026· 2 days ago

Biodefense in the Intelligence Age

An action plan for AI-powered biological resilience

Model Launches· Hugging Face· Jun 4, 2026· 2 days ago

Designing the hf CLI as an agent-optimized way to work with the Hub

Designing the hf CLI as an agent-optimized way to work with the Hub

Model Launches· TechCrunch· Jun 4, 2026· 2 days ago

Lovable signs multiyear deal with Google Cloud to up usage 5x, source says

Lovable and Google signed an expanded multiyear deal that involves a 5x expansion of Lovable's footprint on Google Cloud, and expanded access to Anthropic Claude.

Model Launches· Ars Technica· Jun 4, 2026· 2 days ago

Google ordered to put clearer links in AI search and let UK publishers opt out

Google must change AI Overviews after claiming users don't want "lots of sources."

Model Launches· AWS ML· Jun 4, 2026· 2 days ago

How to build self-driving AI operations on Amazon Bedrock at scale

In this post, we introduce Amazon Bedrock Ops Alert, a three-layer automated monitoring solution that proactively detects operational issues, dynamically adjusts alarm thresholds, classifies alarms by category, automati…

Open Source· The Decoder· Jun 4, 2026· 2 days ago

Google Deepmind's Gemma 4 12B squeezes multimodal AI onto a laptop with just 16 GB of RAM

Google Deepmind's Gemma 4 12B is an open-source model that processes text, images, and audio natively and runs on laptops with just 16 GB of RAM. It nearly matches the twice-as-large 26B model in benchmarks and ships un…

Pricing· TechCrunch· Jun 4, 2026· 2 days ago

Alphabet’s record-breaking $85B raise for Google’s AI business is a helluva good signal

If Alphabet's record-breaking $85 billion stock sale signals investor appetite for AI-related offerings, we can see that investors are ready to chow.

Funding & M&A· The Decoder· Jun 4, 2026· 3 days ago

Google lets sites opt out of AI search results, knowing most have nowhere else to go

For the first time, Google is giving website operators an opt-out toggle in Search Console for AI search features like AI Overviews and AI Mode, which together already reach more than 3.5 billion monthly users. New perf…

Model Launches· MarkTechPost· Jun 4, 2026· 3 days ago

How to Build a Document Intelligence Backend with iii Using Workers, Functions, and Cron Triggers

We build a document intelligence backend with iii by registering modular functions and reusing them across multiple triggers. The post How to Build a Document Intelligence Backend with iii Using Workers, Functions, and…

Model Launches· Ars Technica· Jun 4, 2026· 3 days ago

Google's new Gemma 4 12B model is designed to run on any laptop with 16GB of RAM

Gemma 4 12B uses a new encoding scheme and token prediction to punch above its weight.

Model Launches· TechCrunch· Jun 4, 2026· 3 days ago

Google’s Dreambeans, its weirdest-named AI tool to date, will turn your life into a cartoon

Dreambeans is a curated list of AI-illustrated "stories" culled from the personal data in your Google account.

Policy· WIRED· Jun 4, 2026· 3 days ago

xAI Asks Court to Strip Alleged Grok Deepfake Nudes Victims of Anonymity

Four people suing Elon Musk's AI firm under pseudonyms due to the risks of being identified may face a difficult choice: Reveal your real names, or drop the lawsuit.

Open Source· MarkTechPost· Jun 4, 2026· 3 days ago

Google DeepMind Releases Gemma 4 12B: An Encoder-Free Multimodal Model with Native audio that runs on a 16 GB laptop

Gemma 4 12B feeds vision and audio straight into the LLM backbone, running locally under an Apache 2.0 license. The post Google DeepMind Releases Gemma 4 12B: An Encoder-Free Multimodal Model with Native audio that runs…

Model Launches· Ars Technica· Jun 4, 2026· 3 days ago

Trump plan to test AI models has a problem—US security teams were gutted by DOGE

Critics say Trump plan to test AI models is short-sighted, performative.

Model Launches· WIRED· Jun 4, 2026· 3 days ago

The Humanoid Robot of the Future Is a 6-Foot-Tall Beefcake With a Chinese Body and an American Brain

Spencer Huang, Nvidia’s robotics lead, tells WIRED that the new bot combines the best of both worlds.

Model Launches· AWS ML· Jun 4, 2026· 3 days ago

Fundamental’s Large Tabular Model NEXUS is now available on Amazon SageMaker JumpStart

In this post, we show you how to get started with NEXUS on Amazon SageMaker JumpStart, walk through the deployment process, and demonstrate how to run predictions against your enterprise datasets.

Model Launches· The Verge· Jun 4, 2026· 3 days ago

As AI gets better, it reveals an empty promise

This week we've got tandem hands-ons with Google's new Gemini AI agent - Spark - from my colleagues David Pierce and Jay Peters. Their takeaways are similar: It's so effective that it's scary. Spark knew that David's do…

Model Launches· AWS ML· Jun 4, 2026· 3 days ago

Reducing container cold start times using SOCI index on DLAMI and DLC

In this post, we look at how to use SOCI on publicly available Deep Learning AMIs and Containers, when to use the various SOCI modes provided by the tool, and how to quickly and efficiently use this tool in your workloa…

Model Launches· The Verge· Jun 4, 2026· 3 days ago

Amazon&#8217;s search bar will invent AI-generated products you can&#8217;t buy

Amazon's updated search bar will now show you AI-generated images of products as you describe them. For now, the in-app feature only surfaces AI images of clothing and home goods, allowing you to tap on the image that b…

Model Launches· AWS ML· Jun 3, 2026· 3 days ago

Improve your agent’s tool-calling accuracy with SFT and DPO on Amazon SageMaker AI

In this post, you learn how to use Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) together to improve the tool-calling accuracy of a small language model (SLM). The example uses Amazon SageMaker A…

Model Launches· TechCrunch· Jun 3, 2026· 3 days ago

Amazon will show AI product images when you search for some reason

Amazon will use visual search and AI to show AI-generated product images that match your search queries. The retailer says it will help guide users to products.

Policy· WIRED· Jun 3, 2026· 3 days ago

This Is How Trump Finally Signed the AI Executive Order

After shelving the original executive order last month, Donald Trump finally got on board Monday night.

Infrastructure· HotON Desk· Jun 3, 2026· 3 days ago

Regional GPU capacity loosens as new clusters come online

Fresh data-center capacity in several regions eased GPU availability, helping push regional inference cost indices lower this week.

Model Launches· NVIDIA· Jun 3, 2026· 3 days ago

NVIDIA Research Unlocks Advanced Grasping, Smarter Autonomous Driving and Agent Training at Scale

What makes a robot gripper useful isn’t that it can pick up one object — it’s that it can pick up the next one, and the one after that, with a tool it’s never held before. What makes an autonomous vehicle system safe is…

Model Launches· NVIDIA· Jun 3, 2026· 3 days ago

NVIDIA Enables the Next Era Of Physical AI Research With Agent Skills For Autonomous Vehicles, Robotics And Vision AI

At CVPR, NVIDIA is unveiling new physical AI agent skills that help researchers and developers speed the development of autonomous vehicles, robots and vision AI systems. The core challenge in physical AI research isn’t…

Policy· The Verge· Jun 3, 2026· 3 days ago

Microsoft and OpenAI broke up — now they’re ready to fight

At Microsoft's annual Build conference on Tuesday, the company announced a slew of new or expanded AI initiatives, including a super app, in-house reasoning models, a cybersecurity tool, and OpenClaw-esque AI agents. Al…

Model Launches· Ars Technica· Jun 3, 2026· 3 days ago

Inside Meta's attempts to play catch-up with AI

Doubts linger over whether Meta can close the gap with rivals.

Model Launches· OpenAI· Jun 3, 2026· 3 days ago

Introducing new capabilities to GPT-Rosalind

GPT-Rosalind advances life sciences research with enhanced biological reasoning, medicinal chemistry expertise, genomics analysis, and experimental workflow capabilities.

Model Launches· Google AI· Jun 3, 2026· 3 days ago

5 ways Google Search can level up your thrift and vintage shopping

Uncover second-hand scores with AI tools in Google Search and Shopping.

Model Launches· Hugging Face· Jun 3, 2026· 3 days ago

Direct Preference Optimization Beyond Chatbots

Direct Preference Optimization Beyond Chatbots

Model Launches· Simon Willison· Jun 3, 2026· 3 days ago

Uber Caps Usage of AI Tools Like Claude Code to Manage Costs

<p><strong><a href="https://www.bloomberg.com/news/articles/2026-06-02/uber-caps-usage-of-ai-tools-like-claude-code-to-cut-costs">Uber Caps Usage of AI Tools Like Claude Code to Manage Costs</a></strong></p> I wrote <a…

Model Launches· OpenAI· Jun 3, 2026· 3 days ago

How Wasmer used Codex to build a Node.js runtime for the edge

See how Wasmer used Codex with GPT-5.5 to build a Node.js runtime for the edge, accelerating development 10x to 20x and shipping in weeks instead of months.

Funding & M&A· HotON Desk· Jun 3, 2026· 3 days ago

Inference-optimization startup raises Series B for edge serving

The round funds expansion of low-latency edge inference, a segment drawing investor attention as deployment costs become a competitive lever.

Policy· OpenAI· Jun 3, 2026· 3 days ago

OpenAI public policy agenda

OpenAI outlines its public policy agenda for AI, including safety, youth protection, workforce transition, and global standards to ensure AI benefits society.

Policy· OpenAI· Jun 3, 2026· 3 days ago

A blueprint for democratic governance of frontier AI

OpenAI outlines a blueprint for U.S. governance of frontier AI, proposing a federal framework for safety, resilience, and national security.

Model Launches· MarkTechPost· Jun 3, 2026· 3 days ago

Nous Research Releases Hermes Desktop: A Native Cross-Platform Front End for Hermes Agent v0.15.2 with Streaming Tool Output

Hermes Desktop is a no-terminal GUI sharing one agent core, skills, and memory with the Hermes Agent CLI. The post Nous Research Releases Hermes Desktop: A Native Cross-Platform Front End for Hermes Agent v0.15.2 with S…

Model Launches· MarkTechPost· Jun 3, 2026· 3 days ago

NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation Model Unifying Physical Reasoning, World Generation, and Action Generation

NVIDIA released Cosmos 3, open omnimodal world models pairing an autoregressive VLM reasoner with a diffusion generator for physical AI. The post NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation…

Model Launches· Hugging Face· Jun 3, 2026· 3 days ago

Adding MCP Tools to Reachy Mini

Adding MCP Tools to Reachy Mini

Model Launches· Simon Willison· Jun 3, 2026· 3 days ago

Microsoft's new MAI models

<p>Microsoft <a href="https://microsoft.ai/news/building-a-hillclimbing-machine-launching-seven-new-mai-models/">announced two new text LLMs</a> this morning - <strong><a href="https://microsoft.ai/news/introducing-mai-…

Model Launches· NVIDIA· Jun 3, 2026· 3 days ago

Industrial Software Leaders Build Secure, Autonomous AI Engineers With NVIDIA NemoClaw

Accelerated computing has revolutionized industrial engineering, compressing simulation times from weeks to hours. Today’s remaining challenges sit in the end-to-end workflow surrounding the simulations: computer-aided…

Model Launches· Ars Technica· Jun 3, 2026· 3 days ago

Microsoft's Project Solara is an Android OS designed for agents instead of apps

Microsoft missed the boat on apps, so get ready for agents.

Model Launches· Simon Willison· Jun 3, 2026· 4 days ago

datasette-agent-micropython 0.1a0

<p><strong>Release:</strong> <a href="https://github.com/datasette/datasette-agent-micropython/releases/tag/0.1a0">datasette-agent-micropython 0.1a0</a></p> <p>I want <a href="https://agent.datasette.io">Datasette Agent…

Model Launches· Simon Willison· Jun 3, 2026· 4 days ago

micropython-wasm 0.1a1

<p><strong>Release:</strong> <a href="https://github.com/simonw/micropython-wasm/releases/tag/0.1a1">micropython-wasm 0.1a1</a></p> <p>Fixes for some limitations that emerged while I was trying to use this to build <cod…

Open Source· HotON Desk· Jun 3, 2026· 4 days ago

Open-weights reasoning model matches closed peers on key benchmarks

A new open-weights release reports parity with leading closed models on several reasoning benchmarks, strengthening the open ecosystem's momentum.

Model Launches· NVIDIA· Jun 3, 2026· 4 days ago

NVIDIA Partners With Microsoft on Unified Stack for Agentic AI Deployment, From Windows Devices to Cloud to Local

The agentic AI moment has arrived, but delivering on its promise requires more than good models. It also takes fast hardware, secure runtimes, a responsive data layer and models tuned for long-running reasoning. NVIDIA…

Model Launches· Ars Technica· Jun 3, 2026· 4 days ago

Mathematicians warn of AI threats to profession as industry encroaches

International Mathematical Union endorses warning about tech industry influence.

Model Launches· Simon Willison· Jun 3, 2026· 4 days ago

California Brown Pelican

<p><img src="https://static.inaturalist.org/photos/671786719/large.jpg" alt="California Brown Pelican"></p><p>California Brown Pelican, in Fort Mason, CA, US</p><p>I'm at the <a href="https://build.microsoft.com/">Micro…

Model Launches· Ars Technica· Jun 3, 2026· 4 days ago

Android phones will soon be able to detect spoofed calls and impersonation scams

Google's June Android feature drop includes more scam detection, more AirDrop, and yes, more AI.

Model Launches· AWS ML· Jun 3, 2026· 4 days ago

The art and science of hyperparameter optimization on Amazon Nova Forge

Fine-tuning for domain-specific tasks means improving performance in one area without degrading the model’s general capabilities, and getting that balance right is harder than it looks. This post walks through how to na…

Model Launches· AWS ML· Jun 3, 2026· 4 days ago

Object detection with Amazon Nova 2 Lite

In this post, we'll walk through implementing object detection with Amazon Nova 2 Lite. You'll learn how to deploy an object detection application using Amazon Bedrock, AWS Lambda, and Amazon API Gateway. You'll also le…

Model Launches· AWS ML· Jun 2, 2026· 4 days ago

How Baz improved its AI Agent Code Review accuracy using Amazon Bedrock AgentCore

This post walks through how Baz built their Spec Review agent using Amazon Bedrock and Amazon Bedrock AgentCore. We'll cover the architecture decisions, implementation details, and the business outcomes they achieved by…

Model Launches· Hugging Face· Jun 2, 2026· 4 days ago

Holo3.1: Fast & Local Computer Use Agents

Holo3.1: Fast & Local Computer Use Agents

Model Launches· OpenAI· Jun 2, 2026· 4 days ago

Travelers deploys AI-powered claims countrywide with OpenAI

Travelers built an AI-powered Claim Assistant with OpenAI to guide customers through filing claims, provide 24/7 support, and scale operations during peak demand.

Model Launches· MIT Tech Review· Jun 2, 2026· 4 days ago

Rehumanizing global health care with agentic AI

The global health care sector is under increasing strain. Decades of chronic underinvestment and constraints in recruitment have coincided with a surge in demand for services for aging populations. Gaps in provision are…

Funding & M&A· OpenAI· Jun 2, 2026· 4 days ago

Codex for every role, tool, and workflow

Discover new Codex plugins, sites, and annotations that help analysts, marketers, designers, investors, and other teams get more done with AI.

Policy· HotON Desk· Jun 2, 2026· 4 days ago

New regional rules clarify cross-border inference and data residency

Updated guidance sets clearer expectations for where inference may run and how data is stored, with implications for multi-region deployments.

Model Launches· MIT Tech Review· Jun 2, 2026· 4 days ago

How small businesses can leverage AI

This article is from Making AI Work, MIT Technology Review’s limited-run newsletter examining how to apply LLMs across industries. To receive it in your inbox,sign up here. From accounting to design to market research a…

Model Launches· NVIDIA· Jun 2, 2026· 4 days ago

Why Financial Institutions Are Converging on Transaction Foundation Models to Build Their Own Intelligence

Financial institutions have spent years building AI: fraud models, credit models, recommendation engines and risk systems. While this sprawl of task-specific models has been effective, it’s also constrained by siloed sy…

Model Launches· Simon Willison· Jun 2, 2026· 4 days ago

Pasted File Editor

<p><strong>Tool:</strong> <a href="https://tools.simonwillison.net/pasted-file-editor">Pasted File Editor</a></p> <p>I really like how you can paste a large volume of text into <a href="https://claude.ail">claude.ai</a>…

Model Launches· NVIDIA· Jun 2, 2026· 4 days ago

NVIDIA Jetson Brings Agentic AI to the Physical World

Agentic AI is getting physical. At COMPUTEX on Tuesday, NVIDIA announced NVIDIA JetPack 7.2 and NVIDIA NemoClaw support on NVIDIA Jetson. JetPack 7.2 brings agentic AI skills, Yocto project support, NVIDIA CUDA 13 on NV…

Model Launches· Google AI· Jun 2, 2026· 5 days ago

How we used Gemini to build Google I/O 2026

Learn how Googlers used AI to produce Google I/O 2026.

Model Launches· Hugging Face· Jun 1, 2026· 5 days ago

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

Pricing· HotON Desk· Jun 1, 2026· 5 days ago

Batch and cached-prompt discounts widen the gap to real-time pricing

Deeper discounts for batched and cached workloads are reshaping cost planning, rewarding teams that can tolerate latency or reuse context.

Model Launches· Hugging Face· Jun 1, 2026· 5 days ago

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic

Infrastructure· HotON Desk· Jun 1, 2026· 5 days ago

Energy-linked compute pricing rolls out in two more regions

More providers are tying compute prices to local energy conditions, adding a time-of-day dimension to where and when AI workloads run cheapest.

Infrastructure· NVIDIA· Jun 1, 2026· 5 days ago

NVIDIA AI Cloud Ecosystem Expands Worldwide to Meet Global AI Compute Demand

The NVIDIA AI Cloud ecosystem is accelerating the global buildout of AI factory infrastructure. Partners are expanding capacity to meet growing demand from enterprises, startups, nations, AI labs and developers scaling…

Model Launches· HotON Desk· Jun 1, 2026· 6 days ago

Agentic model adds native tool-use and longer action horizons

An updated agentic model improves multi-step tool use and reliability on long tasks, a focus area as agent workloads move toward production.

Pricing· HotON Desk· May 31, 2026· 6 days ago

Price competition intensifies among China-based model providers

Several China-based providers adjusted token pricing downward, pushing the China AI Model Price Index to a new monthly low.

Model Launches· Google AI· May 30, 2026· 8 days ago

Take our I/O 2026 quiz, vibe coded in Google AI Studio.

We used Google AI Studio to vibe code a quiz about our top I/O 2026 announcements.

Model Launches· Google AI· May 30, 2026· 8 days ago

9 demos of Gemini Omni and Gemini 3.5 in action

Watch 9 videos showing the capabilities of Gemini Omni and Gemini 3.5, announced at Google I/O 2026.

Model Launches· Google AI· May 29, 2026· 8 days ago

Check out real-life AI prototypes from the Futures Lab.

University of Waterloo students develop AI prototypes like sign language tutors to reshape the future of education and work.

Policy· MIT Tech Review· May 29, 2026· 8 days ago

How the Pope’s Magnifica Humanitas offers a template for individuals to meet the AI moment

Pope Leo XIV’s new encyclical on artificial intelligence includes a statement that warrants serious attention from technologists and policymakers: “Technology is never neutral.” Magnifica Humanitas (“Magnificent Humanit…

Model Launches· Google AI· May 28, 2026· 9 days ago

Catch up on 12 major I/O 2026 moments

Here are 12 of the biggest Google I/O 2026 keynote moments, including news about Gemini Omni, Gemini 3.5 Flash and more.

Model Launches· MIT Tech Review· May 28, 2026· 9 days ago

The AI Hype Index: AI gets booed in graduation season

It is one thing to say AI will change the world. It is another to expect the class of 2026 to applaud it. In fact, when former Google CEO Eric Schmidt told University of Arizona graduates that their task is to help shap…

Model Launches· MIT Tech Review· May 26, 2026· 11 days ago

Rethinking organizational design in the age of agentic AI

Amid rapidly growing adoption of enterprise-level AI agents, there’s a disconnect emerging between ambition and execution. Although 85% of organizations say they want to be agentic within the next three years, 76% say t…

Model Launches· MIT Tech Review· May 26, 2026· 11 days ago

It’s time to address the looming crisis in entry-level work

Artificial intelligence has not so far produced a clean story of mass unemployment. Aggregate employment in developed countries remains broadly stable, and recent assessments have found limited evidence that AI has shif…

Model Launches· MIT Tech Review· May 26, 2026· 11 days ago

A reality check on the AI jobs hysteria

Haven’t you heard? White-collar jobs are going away, decimated by AI. Waves of layoffs in the tech sector (most recently at Coinbase and Meta and Cisco) are said to presage what will soon come for all of us knowledge wo…

Model Launches· Google AI· May 23, 2026· 15 days ago

Catch up on the Dialogues stage at Google I/O 2026.

A recap of the 2026 I/O Dialogues, where leaders discuss the future of AI, quantum computing, robotics and creativity.

Model Launches· Google DeepMind· May 22, 2026· 15 days ago

We’re launching the Google DeepMind Accelerator program in Asia Pacific to tackle environmental risks

Funding & M&A· Google AI· May 21, 2026· 16 days ago

We’re announcing new community investments in Missouri.

We’re helping build the state’s next-generation workforce and investing in energy programs.

Model Launches· VentureBeat· May 20, 2026· 18 days ago

Google just redesigned the search box for the first time in 25 years — here’s why it matters more than you think.

For a quarter century, the Google search box has been one of the most recognizable interfaces in computing: a thin white rectangle, a blinking cursor, a few typed words, and a list of blue links. On Tuesday, Google will…

Model Launches· Google DeepMind· May 19, 2026· 19 days ago

Fast-tracking genetic leads to reverse cellular aging

Biologists use Co-Scientist to find novel factors that successfully rejuvenate human cells.

Model Launches· Google DeepMind· May 18, 2026· 19 days ago

Simulate real-world places with Project Genie and Street View

We’re expanding access to Google AI Ultra subscribers globally and introducing a new capability powered by Street View.

Model Launches· Google DeepMind· May 18, 2026· 19 days ago

Introducing Gemini Omni

Model Launches· Google DeepMind· May 18, 2026· 19 days ago

Introducing Google Antigravity 2.0

Model Launches· Google DeepMind· May 17, 2026· 20 days ago

Gemini for Science: AI experiments and tools for a new era of discovery

A collection of science tools and experiments to expand the scale and precision of scientific exploration.

Model Launches· Google DeepMind· May 17, 2026· 20 days ago

Making it easier to understand how content was created and edited

We're expanding our tools to help you understand how content was created and edited across the web.

Model Launches· Google DeepMind· May 16, 2026· 21 days ago

Strengthening Singapore’s AI Future: A New National Partnership

Google DeepMind and Singapore partner to apply frontier AI to address complex challenges across health, education, and sustainability and more.

Model Launches· Berkeley AI (BAIR)· May 8, 2026· 29 days ago

Adaptive Parallel Reasoning: The Next Paradigm in Efficient Inference Scaling

.apr-fig { text-align: center; margin: 1.35em 0; line-height: 1.4; } .apr-fig--wide img { display: inline-block; width: 100%; max-width: 100%; height: auto; vertical-align: middle; } .apr-fig--wide-0-8 { max-width: 80%;…

Model Launches· Berkeley AI (BAIR)· Apr 20, 2026· 2 months ago

Gradient-based Planning for World Models at Longer Horizons

.grasp-results-table table { font-size: 0.875rem; line-height: 1.35; width: 100%; } .grasp-results-table th, .grasp-results-table td { padding: 0.35rem 0.5rem; } /* Consistent whitespace between major sections (this pos…

Model Launches· Berkeley AI (BAIR)· Mar 13, 2026· 3 months ago

Identifying Interactions at Scale for LLMs

--> Understanding the behavior of complex machine learning systems, particularly Large Language Models (LLMs), is a critical challenge in modern artificial intelligence. Interpretability research aims to make the decisi…

Pricing· VentureBeat· Jan 22, 2026· 5 months ago

Railway secures $100 million to challenge AWS with AI-native cloud infrastructure

Railway, a San Francisco-based cloud platform that has quietly amassed two million developers without spending a dollar on marketing, announced Thursday that it raised $100 million in a Series B funding round, as surgin…

Pricing· VentureBeat· Jan 19, 2026· 5 months ago

Claude Code costs up to $200 a month. Goose does the same thing for free.

The artificial intelligence coding revolution comes with a catch: it's expensive. Claude Code, Anthropic's terminal-based AI agent that can write, debug, and deploy code autonomously, has captured the imagination of sof…

Pricing· VentureBeat· Jan 16, 2026· 5 months ago

Listen Labs raises $69M after viral billboard hiring stunt to scale AI customer interviews

Alfred Wahlforss was running out of options. His startup, Listen Labs, needed to hire over 100 engineers, but competing against Mark Zuckerberg's $100 million offers seemed impossible. So he spent $5,000 — a fifth of hi…

Model Launches· VentureBeat· Jan 13, 2026· 5 months ago

Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI

Salesforce on Tuesday launched an entirely rebuilt version of Slackbot, the company's workplace assistant, transforming it from a simple notification tool into what executives describe as a fully powered AI agent capabl…

Infrastructure· VentureBeat· Jan 12, 2026· 5 months ago

Anthropic launches Cowork, a Claude Desktop agent that works in your files — no coding required

Anthropic released Cowork on Monday, a new AI agent capability that extends the power of its wildly successful Claude Code tool to non-technical users — and according to company insiders, the team built the entire featu…

Model Launches· Berkeley AI (BAIR)· Jan 10, 2026· 5 months ago

Information-Driven Design of Imaging Systems

An encoder (optical system) maps objects to noiseless images, which noise corrupts into measurements. Our information estimator uses only these noisy measurements and a noise model to quantify how well measurements dist…

Open Source· VentureBeat· Jan 8, 2026· 5 months ago

Nous Research's NousCoder-14B is an open-source coding model landing right in the Claude Code moment

Nous Research, the open-source artificial intelligence startup backed by crypto venture firm Paradigm, released a new competitive programming model on Monday that it says matches or exceeds several larger proprietary sy…

Model Launches· Berkeley AI (BAIR)· Nov 1, 2025· 7 months ago

RL without TD learning

In this post, I’ll introduce a reinforcement learning (RL) algorithm based on an “alternative” paradigm: divide and conquer. Unlike traditional methods, this algorithm is not based on temporal difference (TD) learning (…

Model Launches· Berkeley AI (BAIR)· Sep 1, 2025· 9 months ago

What exactly does word2vec learn?

What exactly does word2vec learn, and how? Answering this question amounts to understanding representation learning in a minimal yet interesting language modeling task. Despite the fact that word2vec is a well-known pre…

Model Launches· Berkeley AI (BAIR)· Jul 1, 2025· 11 months ago

Whole-Body Conditioned Egocentric Video Prediction

.modal { display: none; position: fixed; z-index: 9999; padding-top: 50px; left: 0; top: 0; width: 100%; height: 100%; overflow: auto; background-color: rgba(0,0,0,0.9); } .modal-content { margin: auto; display: block;…

Model Launches· Berkeley AI (BAIR)· Apr 11, 2025· last year

Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)

Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated applications. However, as LLMs have improved, so have the attacks against them. Prompt injection attack is listed as the #1 threat by OWASP…

Model Launches· Meta Research· May 17, 2023· 3 years ago

How generational differences affect consumer attitudes towards ads

Our research study, in collaboration with CrowdDNA, aims to understand people's relationship with social media ads across different social media platforms.

Model Launches· Meta Research· Apr 17, 2023· 3 years ago

Every tree counts

Meta set a goal to reach net zero emissions by 2030. We are developing technology to mitigate our carbon footprint and making these openly available.

Model Launches· Meta Research· Apr 14, 2023· 3 years ago

How a non-traditional background led to cutting-edge XR tech

Model Launches· Meta Research· Apr 13, 2023· 3 years ago

A new, unique AI dataset for animating amateur drawings

Model Launches· Meta Research· Apr 12, 2023· 3 years ago

How the metaverse can transform education

Open Source· Meta Research· Apr 6, 2023· 3 years ago

Build faster with Buck2: Our open source build system

Model Launches· Meta Research· Apr 5, 2023· 3 years ago

Announcing the 2023 Meta Research PhD Fellowship award winners

...

Model Launches· Meta Research· Apr 5, 2023· 3 years ago

Introducing Segment Anything: Working toward the first foundation model for image segmentation

Model Launches· Microsoft AI· Dec 7, 2022· 4 years ago

A conversation with Kevin Scott: What’s next in AI

The post A conversation with Kevin Scott: What’s next in AI appeared first on The AI Blog.

Model Launches· Microsoft AI· Oct 13, 2022· 4 years ago

From Hot Wheels to handling content: How brands are using Microsoft AI to be more productive and imaginative

The post From Hot Wheels to handling content: How brands are using Microsoft AI to be more productive and imaginative appeared first on The AI Blog.

Open Source· Microsoft AI· Oct 6, 2022· 4 years ago

Microsoft open sources its ‘farm of the future’ toolkit

The post Microsoft open sources its ‘farm of the future’ toolkit appeared first on The AI Blog.

Model Launches· Microsoft AI· Jul 25, 2022· 4 years ago

How data and AI will transform contact centres for financial services

The post How data and AI will transform contact centres for financial services appeared first on The AI Blog.

Model Launches· Microsoft AI· Jul 21, 2022· 4 years ago

AI-equipped drones study dolphins on the edge of extinction

The post AI-equipped drones study dolphins on the edge of extinction appeared first on The AI Blog.

Model Launches· Microsoft AI· Jul 13, 2022· 4 years ago

Online math tutoring service uses AI to help boost students’ skills and confidence

The post Online math tutoring service uses AI to help boost students’ skills and confidence appeared first on The AI Blog.

Model Launches· Microsoft AI· Jul 6, 2022· 4 years ago

AI-Mimi is building inclusive TV experiences for Deaf and Hard of Hearing user in Japan

The post AI-Mimi is building inclusive TV experiences for Deaf and Hard of Hearing user in Japan appeared first on The AI Blog.

Model Launches· Microsoft AI· Jun 22, 2022· 4 years ago

Microsoft’s framework for building AI systems responsibly

The post Microsoft’s framework for building AI systems responsibly appeared first on The AI Blog.

Summaries are aggregated for information only — follow the source link for the full story. Demo entries are illustrative.