LIVE
DEEPSEEK-V4-FL$0.20 26.3%
DEEPSEEK-V4-PR$0.87 4.6%
QWEN3.6-FLASH$1.13 24.9%
NEMOTRON-3-SUP$0.45 23.9%
LLAMA-4-MAVERI$0.60 11.9%
LLAMA-4-SCOUT$0.30 5.8%
GEMINI-3.1-FLA$1.50 10.0%
GEMINI-2.5-FLA$0.40 0.8%
MINIMAX-01$1.10 8.0%
MIMO-V2.5$0.28 5.8%
MIMO-V2.5-PRO$0.87 2.6%
MINIMAX-M3$1.20 6.1%
QWEN3.5-PLUS-2$1.80 6.0%
NOVA-2-LITE-V1$2.50 31.4%
GEMINI-2.5-FLA$2.50 9.5%
GROK-4.3$2.50 8.7%
QWEN3.6-PLUS$1.95 31.6%
NEMOTRON-3-ULT$2.50 25.9%
QWEN3.7-PLUS$1.60 21.4%
MINIMAX-M1$2.20 11.9%
PALMYRA-X5$6.00 26.9%
QWEN3.7-MAX$3.75 3.1%
GEMINI-3.5-FLA$9.00 8.3%
GEMINI-2.5-PRO$10.00 7.2%
GPT-5.4-NANO$1.25 10.0%
NOVA-LITE-V1$0.24 28.9%
KIMI-K2.5$1.90 9.8%
MINISTRAL-14B-$0.20 17.7%
DEEPSEEK-V4-FL$0.20 26.3%
DEEPSEEK-V4-PR$0.87 4.6%
QWEN3.6-FLASH$1.13 24.9%
NEMOTRON-3-SUP$0.45 23.9%
LLAMA-4-MAVERI$0.60 11.9%
LLAMA-4-SCOUT$0.30 5.8%
GEMINI-3.1-FLA$1.50 10.0%
GEMINI-2.5-FLA$0.40 0.8%
MINIMAX-01$1.10 8.0%
MIMO-V2.5$0.28 5.8%
MIMO-V2.5-PRO$0.87 2.6%
MINIMAX-M3$1.20 6.1%
QWEN3.5-PLUS-2$1.80 6.0%
NOVA-2-LITE-V1$2.50 31.4%
GEMINI-2.5-FLA$2.50 9.5%
GROK-4.3$2.50 8.7%
QWEN3.6-PLUS$1.95 31.6%
NEMOTRON-3-ULT$2.50 25.9%
QWEN3.7-PLUS$1.60 21.4%
MINIMAX-M1$2.20 11.9%
PALMYRA-X5$6.00 26.9%
QWEN3.7-MAX$3.75 3.1%
GEMINI-3.5-FLA$9.00 8.3%
GEMINI-2.5-PRO$10.00 7.2%
GPT-5.4-NANO$1.25 10.0%
NOVA-LITE-V1$0.24 28.9%
KIMI-K2.5$1.90 9.8%
MINISTRAL-14B-$0.20 17.7%
Model Launches

AI Model Launches

Every major model release and capability update — who shipped what, and how it shifts the price-performance frontier.

137 stories

Why it matters

New models reset the capability and price-performance frontier. Teams re-evaluate what to build on whenever a launch shifts what's possible per dollar.

Model Launches· MarkTechPost· Jun 6, 2026· 10 hours ago

Google DeepMind Releases Gemma 4 QAT Checkpoints: Q4_0 and a New Mobile Format Cut On-Device Memory

Compare Gemma 4 edge formats: BF16, Q4_0 QAT, and mobile QAT, on published memory numbers and design tradeoffs. The post Google DeepMind Releases Gemma 4 QAT Checkpoints: Q4_0 and a New Mobile Format Cut On-Device Memor…

Model Launches· The Verge· Jun 6, 2026· 12 hours ago

This is your laptop… on AI

We're now deep into developer conference season, and one of the themes so far is the relentless conviction from Big Tech companies that AI is going to change everything about how we do everything. Nvidia's Jensen Huang…

Model Launches· Ars Technica· Jun 5, 2026· 13 hours ago

The Fitbit Air is a good wearable weighed down by a chatty AI "coach"

The Air succeeds as a minimalist, reliable fitness tracker, but Google's AI Health Coach feels unnecessary.

Model Launches· The Decoder· Jun 5, 2026· 13 hours ago

Satya Nadella publicly torches a VP's plan to make Microsoft's AI agent deliberately addictive

Microsoft CEO Satya Nadella has sharply criticized an internal memo proposing to make users "addicted" to the company's new AI agent Scout. "Not sure who is writing and leaking this nonsense," Nadella wrote to about 50…

Model Launches· WIRED· Jun 5, 2026· 14 hours ago

Has Microsoft Lost Its Mojo (Again)?

Microsoft’s AI products aren’t selling, and Github’s been plagued with troubles. WIRED spoke with VP Scott Hanselman about whether the company is in catch-up mode.

Model Launches· TechCrunch· Jun 5, 2026· 14 hours ago

The token bill comes due: Inside the industry scramble to manage AI’s runaway costs

"The whole conversation shifted from tokenmaxxing and 'go fast' to 'we need guardrails, how do we control this?'"

Model Launches· Google AI· Jun 5, 2026· 14 hours ago

The latest AI news we announced in May 2026

Here are Google’s latest AI updates from May 2026

Model Launches· The Verge· Jun 5, 2026· 15 hours ago

This AI startup says it can tell if a script will make a hit film

When Quilty hit the industry trades earlier this year, the AI startup promised that its tool could accurately predict a film's success just by reading the script. When people actually got a chance to experiment with Qui…

Model Launches· The Decoder· Jun 5, 2026· 17 hours ago

Microsoft trained its MAI models on unlicensed web data despite promising "enterprise grade, clean and commercially licensed data"

Microsoft sells its LLM training approach as different from other AI companies. It isn't. The company trained its new MAI models partly on unlicensed web data like Common Crawl, despite claiming they used only "clean an…

Model Launches· The Decoder· Jun 5, 2026· 18 hours ago

Anthropic's Mythos model is reportedly powering NSA offensive cyber ops against China and Iran

Anthropic has reportedly stationed about half a dozen engineers directly at the NSA to adapt its Mythos AI model for offensive cyber operations. The model could be used to break into networks in China or Iran. That fits…

Model Launches· Simon Willison· Jun 5, 2026· 18 hours ago

Quoting Andreas Kling

<blockquote cite="https://ladybird.org/posts/changing-how-we-develop-ladybird/"><p>We will no longer accept public pull requests. [...]</p> <p>A substantial patch used to imply substantial effort, and that effort was a…

Model Launches· MarkTechPost· Jun 5, 2026· 19 hours ago

NVIDIA AI Releases Dynamo Snapshot: A CRIU-Based Fast Startup System for AI Inference on Kubernetes

NVIDIA Dynamo Snapshot checkpoints and restores vLLM inference workers on Kubernetes using CRIU and cuda-checkpoint tools. The post NVIDIA AI Releases Dynamo Snapshot: A CRIU-Based Fast Startup System for AI Inference o…

Model Launches· MarkTechPost· Jun 5, 2026· 19 hours ago

Perplexity AI Introduces Hybrid Local-Server Inference Orchestrator for Personal Computer: Automatic On-Device and Cloud Task Routing

Perplexity AI announces a hybrid local-server inference orchestrator for Personal Computer, automatically routing AI tasks between on-device and cloud models. The post Perplexity AI Introduces Hybrid Local-Server Infere…

Model Launches· MarkTechPost· Jun 5, 2026· 20 hours ago

Microsoft Fara Tutorial: Run a Browser-Use Agent in Google Colab with a Mock OpenAI-Compatible Endpoint

A hands-on guide to running Microsoft Fara in Colab, testing the browser agent loop with a mock endpoint. The post Microsoft Fara Tutorial: Run a Browser-Use Agent in Google Colab with a Mock OpenAI-Compatible Endpoint…

Model Launches· MIT Tech Review· Jun 5, 2026· 20 hours ago

The Meta hack shows there’s more to AI security than Mythos

On June 5, 404 Media reported that attackers had been using Meta’s AI customer support agent to steal Instagram accounts. Their approach was simple: They asked the agent to link the accounts to email addresses that they…

Model Launches· WIRED· Jun 5, 2026· 20 hours ago

AI Has Come for Serif Fonts

AI companies are using serif to project humanity. Critics are calling it “tasteslop.”

Model Launches· The Decoder· Jun 5, 2026· 20 hours ago

Anthropic says Claude now writes over 90% of its code and wants the world to have an AI pause button

Anthropic is sharing internal data showing how much Claude is speeding up its own AI development: more than 80 percent of production code now comes from Claude, and engineers are shipping eight times as much code per da…

Model Launches· NVIDIA· Jun 5, 2026· 23 hours ago

Seoul Purpose: How NVIDIA and South Korea Are Building the Future of AI

Home to cutting-edge sovereign AI infrastructure and robotics innovators, as well as one of the world’s most passionate gaming communities, South Korea is one of the world’s centers of AI. NVIDIA founder and CEO Jensen…

Model Launches· TechCrunch· Jun 5, 2026· yesterday

Mira Murati steps back into the spotlight, carefully

In the current environment, remaining heads down has diminishing returns; at some point, you have to make some noise just to remind the market you exist.

Model Launches· Simon Willison· Jun 5, 2026· yesterday

AI enthusiasts are in a race against time, AI skeptics are in a race against entropy

<p><strong><a href="https://charitydotwtf.substack.com/p/ai-enthusiasts-are-in-a-race-against">AI enthusiasts are in a race against time, AI skeptics are in a race against entropy</a></strong></p> Charity Majors neatly…

Model Launches· TechCrunch· Jun 5, 2026· yesterday

Airbnb’s Brian Chesky plans to launch a new AI lab

The Airbnb CEO said last year it hasn't struck an LLM partnership because existing products weren't quite ready.

Model Launches· MarkTechPost· Jun 5, 2026· yesterday

Building a Semantic Search Engine and Open-Status Classifier over the ResearchMath-14k Dataset

This tutorial walks through a complete NLP pipeline for research-level mathematics. Using the ResearchMath-14k dataset, we extract field-specific keywords with TF-IDF, generate sentence embeddings, visualize the problem…

Model Launches· Ars Technica· Jun 5, 2026· yesterday

The skeptic’s guide to humanoid robots going viral on the Internet

Robot demonstrations can distort public perceptions of robotic capabilities.

Model Launches· TechCrunch· Jun 5, 2026· yesterday

Apple approves Poke as the first AI agent on its Messages for Business platform

Poke, the startup that lets people use AI agents through simple text messages, has become the first AI agent approved for Apple’s Messages for Business platform.

Model Launches· Hugging Face· Jun 5, 2026· yesterday

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

Model Launches· The Decoder· Jun 5, 2026· yesterday

Cloudflare CEO says the web's future is "pay to crawl" as bots overtake human traffic

Bot traffic now outpaces human traffic on the internet, Cloudflare CEO Matthew Prince says, years ahead of his late 2027 forecast. He blames AI agents for the surge. His conclusion for the future of the web: "Clearly it…

Model Launches· AWS ML· Jun 5, 2026· 2 days ago

NVIDIA Nemotron 3 Ultra now available on Amazon SageMaker JumpStart

Deploy NVIDIA Nemotron 3 Ultra on Amazon SageMaker JumpStart. Get 5x faster inference and 30% lower cost for agentic AI workloads with this frontier reasoning model.

Model Launches· The Decoder· Jun 5, 2026· 2 days ago

ChatGPT now saves narrative dossiers about you sorted by work, hobbies, and travel preferences

ChatGPT's updated "Dreaming" memory system now builds coherent user profiles from conversations instead of saving scattered bullet points. OpenAI says the success rate for keeping information current jumped from 52.2 pe…

Model Launches· Simon Willison· Jun 5, 2026· 2 days ago

Quoting Emanuel Maiberg, 404 Media

<blockquote cite="https://www.404media.co/google-employees-internally-share-memes-about-how-its-ai-sucks/"><p>After this story was published Google's spokesperson reached out and asked us to publish a slightly different…

Model Launches· TechCrunch· Jun 5, 2026· 2 days ago

Meta rolls out a new AI creator assistant on Facebook

Creators often have to parse through charts and dashboards to understand their performance, but with the new AI assistant, they can get quick answers to questions like "When should I post?" and "What are people saying i…

Model Launches· TechCrunch· Jun 5, 2026· 2 days ago

What to expect from WWDC 2026: Siri’s highly anticipated revamp and Apple Intelligence updates

Apple's WWDC nears: Here's what you can look forward to.

Model Launches· The Decoder· Jun 5, 2026· 2 days ago

Bain study finds companies miss AI savings targets because humans keep getting in the way

According to a Bain survey of 951 companies, almost 40 percent achieved less than 10 percent in AI cost savings, even though most had targeted 11 to 20 percent. One alleged reason is that only 7 percent actually run ful…

Model Launches· The Verge· Jun 4, 2026· 2 days ago

TSMC struggles to keep up with AI demand: &#8216;We can only support so much&#8217;

Taiwan Semiconductor Manufacturing Co. - the world's biggest semiconductor-maker - is struggling to meet demands from American customers even with its factory buildout in the US, according to reports from Reuters and Bl…

Model Launches· The Verge· Jun 4, 2026· 2 days ago

Elon Musk is steamrolling Wall Street to become a trillionaire

Today on Decoder, I’m talking to Ryan Mac, a technology reporter at The New York Times and coauthor of the excellent book Character Limit: How Elon Musk Destroyed Twitter, which came out in 2024. I can’t recommend it en…

Model Launches· The Decoder· Jun 4, 2026· 2 days ago

OpenAI CEO Sam Altman sees "proactive AI" as the next big phase after chatbots and agents

OpenAI CEO Sam Altman outlines the next phase of AI products: a "proactive AI" that runs constantly in the background and acts on its own instead of waiting for user prompts. Companies are also wrestling with spiraling…

Model Launches· NVIDIA· Jun 4, 2026· 2 days ago

Forecast: Fun Ahead — 18 Games Join in June to Stream on GeForce NOW

June’s forecast with GeForce NOW: 100% chance of gaming. GeForce NOW is lining up new adventures for the month, from big-name blockbusters to quirky indies ready for the spotlight. Members can dive into fresh worlds, sq…

Model Launches· Hugging Face· Jun 4, 2026· 2 days ago

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Model Launches· The Verge· Jun 4, 2026· 2 days ago

Let us filter AI slop, you cowards

Nobody should be subjected to seeing shrimp Jesus all over their social feeds. | Image: Cath Virginia / The Verge, Getty Images It's almost impossible to avoid seeing AI-generated content online, but it doesn't have to…

Model Launches· Hugging Face· Jun 4, 2026· 2 days ago

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios

Model Launches· The Verge· Jun 4, 2026· 2 days ago

AI leaders call for tougher protections against AI-aided bioweapons

Some of the AI industry's biggest rivals have put their many, many grievances aside for a common cause: making it harder for people to use their technology to develop biological weapons. In an open letter to US lawmaker…

Model Launches· OpenAI· Jun 4, 2026· 2 days ago

How Endava is redesigning software delivery around AI agents

Learn how Endava is using AI agents, ChatGPT Enterprise, and Codex to accelerate software delivery, automate workflows, and build an AI-native culture across the enterprise.

Model Launches· The Verge· Jun 4, 2026· 2 days ago

Amazon develops a warehouse robot that workers can speak to

The design hasn’t changed much from the original Proteus, which was announced in 2022. | Image: Amazon Amazon has announced a new version of its fully autonomous warehouse robot, Proteus, that will interact using langua…

Model Launches· OpenAI· Jun 4, 2026· 2 days ago

Dreaming: Better memory for a more helpful ChatGPT

ChatGPT introduces a new memory system to better remember preferences, keeping context fresh and relevant across conversations.

Model Launches· The Decoder· Jun 4, 2026· 2 days ago

xAI updates Grok Imagine to 1.5 with image-to-video generation at 720p resolution

xAI has released "grok-imagine-video-1.5-preview," an image-to-video model that turns still images into cinematic videos at up to 720p based on text prompts. Multiple clips can be stitched together into longer scenes. T…

Model Launches· HotON Desk· Jun 4, 2026· 2 days ago

New 1M-context multimodal model enters public preview

The model handles long documents, images and audio in a single context window, expanding the design space for agentic and retrieval-heavy workloads.

Model Launches· WIRED· Jun 4, 2026· 2 days ago

OpenAI and Anthropic Sign Letter to Prevent AI-Developed Biological Weapons

Leading AI labs, executives, and scientists are sending a letter to lawmakers urging them to improve tracking of synthetic DNA sequences that could be used for bioweapons.

Model Launches· OpenAI· Jun 4, 2026· 2 days ago

Biodefense in the Intelligence Age

An action plan for AI-powered biological resilience

Model Launches· Hugging Face· Jun 4, 2026· 2 days ago

Designing the hf CLI as an agent-optimized way to work with the Hub

Designing the hf CLI as an agent-optimized way to work with the Hub

Model Launches· TechCrunch· Jun 4, 2026· 2 days ago

Lovable signs multiyear deal with Google Cloud to up usage 5x, source says

Lovable and Google signed an expanded multiyear deal that involves a 5x expansion of Lovable's footprint on Google Cloud, and expanded access to Anthropic Claude.

Model Launches· Ars Technica· Jun 4, 2026· 2 days ago

Google ordered to put clearer links in AI search and let UK publishers opt out

Google must change AI Overviews after claiming users don't want "lots of sources."

Model Launches· AWS ML· Jun 4, 2026· 2 days ago

How to build self-driving AI operations on Amazon Bedrock at scale

In this post, we introduce Amazon Bedrock Ops Alert, a three-layer automated monitoring solution that proactively detects operational issues, dynamically adjusts alarm thresholds, classifies alarms by category, automati…

Model Launches· MarkTechPost· Jun 4, 2026· 2 days ago

How to Build a Document Intelligence Backend with iii Using Workers, Functions, and Cron Triggers

We build a document intelligence backend with iii by registering modular functions and reusing them across multiple triggers. The post How to Build a Document Intelligence Backend with iii Using Workers, Functions, and…

Model Launches· Ars Technica· Jun 4, 2026· 2 days ago

Google's new Gemma 4 12B model is designed to run on any laptop with 16GB of RAM

Gemma 4 12B uses a new encoding scheme and token prediction to punch above its weight.

Model Launches· TechCrunch· Jun 4, 2026· 2 days ago

Google’s Dreambeans, its weirdest-named AI tool to date, will turn your life into a cartoon

Dreambeans is a curated list of AI-illustrated "stories" culled from the personal data in your Google account.

Model Launches· Ars Technica· Jun 4, 2026· 2 days ago

Trump plan to test AI models has a problem—US security teams were gutted by DOGE

Critics say Trump plan to test AI models is short-sighted, performative.

Model Launches· WIRED· Jun 4, 2026· 2 days ago

The Humanoid Robot of the Future Is a 6-Foot-Tall Beefcake With a Chinese Body and an American Brain

Spencer Huang, Nvidia’s robotics lead, tells WIRED that the new bot combines the best of both worlds.

Model Launches· AWS ML· Jun 4, 2026· 2 days ago

Fundamental’s Large Tabular Model NEXUS is now available on Amazon SageMaker JumpStart

In this post, we show you how to get started with NEXUS on Amazon SageMaker JumpStart, walk through the deployment process, and demonstrate how to run predictions against your enterprise datasets.

Model Launches· The Verge· Jun 4, 2026· 2 days ago

As AI gets better, it reveals an empty promise

This week we've got tandem hands-ons with Google's new Gemini AI agent - Spark - from my colleagues David Pierce and Jay Peters. Their takeaways are similar: It's so effective that it's scary. Spark knew that David's do…

Model Launches· AWS ML· Jun 4, 2026· 3 days ago

Reducing container cold start times using SOCI index on DLAMI and DLC

In this post, we look at how to use SOCI on publicly available Deep Learning AMIs and Containers, when to use the various SOCI modes provided by the tool, and how to quickly and efficiently use this tool in your workloa…

Model Launches· The Verge· Jun 4, 2026· 3 days ago

Amazon&#8217;s search bar will invent AI-generated products you can&#8217;t buy

Amazon's updated search bar will now show you AI-generated images of products as you describe them. For now, the in-app feature only surfaces AI images of clothing and home goods, allowing you to tap on the image that b…

Model Launches· AWS ML· Jun 3, 2026· 3 days ago

Improve your agent’s tool-calling accuracy with SFT and DPO on Amazon SageMaker AI

In this post, you learn how to use Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) together to improve the tool-calling accuracy of a small language model (SLM). The example uses Amazon SageMaker A…

Model Launches· TechCrunch· Jun 3, 2026· 3 days ago

Amazon will show AI product images when you search for some reason

Amazon will use visual search and AI to show AI-generated product images that match your search queries. The retailer says it will help guide users to products.

Model Launches· NVIDIA· Jun 3, 2026· 3 days ago

NVIDIA Research Unlocks Advanced Grasping, Smarter Autonomous Driving and Agent Training at Scale

What makes a robot gripper useful isn’t that it can pick up one object — it’s that it can pick up the next one, and the one after that, with a tool it’s never held before. What makes an autonomous vehicle system safe is…

Model Launches· NVIDIA· Jun 3, 2026· 3 days ago

NVIDIA Enables the Next Era Of Physical AI Research With Agent Skills For Autonomous Vehicles, Robotics And Vision AI

At CVPR, NVIDIA is unveiling new physical AI agent skills that help researchers and developers speed the development of autonomous vehicles, robots and vision AI systems. The core challenge in physical AI research isn’t…

Model Launches· Ars Technica· Jun 3, 2026· 3 days ago

Inside Meta's attempts to play catch-up with AI

Doubts linger over whether Meta can close the gap with rivals.

Model Launches· OpenAI· Jun 3, 2026· 3 days ago

Introducing new capabilities to GPT-Rosalind

GPT-Rosalind advances life sciences research with enhanced biological reasoning, medicinal chemistry expertise, genomics analysis, and experimental workflow capabilities.

Model Launches· Google AI· Jun 3, 2026· 3 days ago

5 ways Google Search can level up your thrift and vintage shopping

Uncover second-hand scores with AI tools in Google Search and Shopping.

Model Launches· Hugging Face· Jun 3, 2026· 3 days ago

Direct Preference Optimization Beyond Chatbots

Direct Preference Optimization Beyond Chatbots

Model Launches· Simon Willison· Jun 3, 2026· 3 days ago

Uber Caps Usage of AI Tools Like Claude Code to Manage Costs

<p><strong><a href="https://www.bloomberg.com/news/articles/2026-06-02/uber-caps-usage-of-ai-tools-like-claude-code-to-cut-costs">Uber Caps Usage of AI Tools Like Claude Code to Manage Costs</a></strong></p> I wrote <a…

Model Launches· OpenAI· Jun 3, 2026· 3 days ago

How Wasmer used Codex to build a Node.js runtime for the edge

See how Wasmer used Codex with GPT-5.5 to build a Node.js runtime for the edge, accelerating development 10x to 20x and shipping in weeks instead of months.

Model Launches· MarkTechPost· Jun 3, 2026· 3 days ago

Nous Research Releases Hermes Desktop: A Native Cross-Platform Front End for Hermes Agent v0.15.2 with Streaming Tool Output

Hermes Desktop is a no-terminal GUI sharing one agent core, skills, and memory with the Hermes Agent CLI. The post Nous Research Releases Hermes Desktop: A Native Cross-Platform Front End for Hermes Agent v0.15.2 with S…

Model Launches· MarkTechPost· Jun 3, 2026· 3 days ago

NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation Model Unifying Physical Reasoning, World Generation, and Action Generation

NVIDIA released Cosmos 3, open omnimodal world models pairing an autoregressive VLM reasoner with a diffusion generator for physical AI. The post NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation…

Model Launches· Hugging Face· Jun 3, 2026· 3 days ago

Adding MCP Tools to Reachy Mini

Adding MCP Tools to Reachy Mini

Model Launches· Simon Willison· Jun 3, 2026· 3 days ago

Microsoft's new MAI models

<p>Microsoft <a href="https://microsoft.ai/news/building-a-hillclimbing-machine-launching-seven-new-mai-models/">announced two new text LLMs</a> this morning - <strong><a href="https://microsoft.ai/news/introducing-mai-…

Model Launches· NVIDIA· Jun 3, 2026· 3 days ago

Industrial Software Leaders Build Secure, Autonomous AI Engineers With NVIDIA NemoClaw

Accelerated computing has revolutionized industrial engineering, compressing simulation times from weeks to hours. Today’s remaining challenges sit in the end-to-end workflow surrounding the simulations: computer-aided…

Model Launches· Ars Technica· Jun 3, 2026· 3 days ago

Microsoft's Project Solara is an Android OS designed for agents instead of apps

Microsoft missed the boat on apps, so get ready for agents.

Model Launches· Simon Willison· Jun 3, 2026· 3 days ago

datasette-agent-micropython 0.1a0

<p><strong>Release:</strong> <a href="https://github.com/datasette/datasette-agent-micropython/releases/tag/0.1a0">datasette-agent-micropython 0.1a0</a></p> <p>I want <a href="https://agent.datasette.io">Datasette Agent…

Model Launches· Simon Willison· Jun 3, 2026· 3 days ago

micropython-wasm 0.1a1

<p><strong>Release:</strong> <a href="https://github.com/simonw/micropython-wasm/releases/tag/0.1a1">micropython-wasm 0.1a1</a></p> <p>Fixes for some limitations that emerged while I was trying to use this to build <cod…

Model Launches· NVIDIA· Jun 3, 2026· 3 days ago

NVIDIA Partners With Microsoft on Unified Stack for Agentic AI Deployment, From Windows Devices to Cloud to Local

The agentic AI moment has arrived, but delivering on its promise requires more than good models. It also takes fast hardware, secure runtimes, a responsive data layer and models tuned for long-running reasoning. NVIDIA…

Model Launches· Ars Technica· Jun 3, 2026· 3 days ago

Mathematicians warn of AI threats to profession as industry encroaches

International Mathematical Union endorses warning about tech industry influence.

Model Launches· Simon Willison· Jun 3, 2026· 3 days ago

California Brown Pelican

<p><img src="https://static.inaturalist.org/photos/671786719/large.jpg" alt="California Brown Pelican"></p><p>California Brown Pelican, in Fort Mason, CA, US</p><p>I'm at the <a href="https://build.microsoft.com/">Micro…

Model Launches· Ars Technica· Jun 3, 2026· 3 days ago

Android phones will soon be able to detect spoofed calls and impersonation scams

Google's June Android feature drop includes more scam detection, more AirDrop, and yes, more AI.

Model Launches· AWS ML· Jun 3, 2026· 3 days ago

The art and science of hyperparameter optimization on Amazon Nova Forge

Fine-tuning for domain-specific tasks means improving performance in one area without degrading the model’s general capabilities, and getting that balance right is harder than it looks. This post walks through how to na…

Model Launches· AWS ML· Jun 3, 2026· 3 days ago

Object detection with Amazon Nova 2 Lite

In this post, we'll walk through implementing object detection with Amazon Nova 2 Lite. You'll learn how to deploy an object detection application using Amazon Bedrock, AWS Lambda, and Amazon API Gateway. You'll also le…

Model Launches· AWS ML· Jun 2, 2026· 4 days ago

How Baz improved its AI Agent Code Review accuracy using Amazon Bedrock AgentCore

This post walks through how Baz built their Spec Review agent using Amazon Bedrock and Amazon Bedrock AgentCore. We'll cover the architecture decisions, implementation details, and the business outcomes they achieved by…

Model Launches· Hugging Face· Jun 2, 2026· 4 days ago

Holo3.1: Fast & Local Computer Use Agents

Holo3.1: Fast & Local Computer Use Agents

Model Launches· OpenAI· Jun 2, 2026· 4 days ago

Travelers deploys AI-powered claims countrywide with OpenAI

Travelers built an AI-powered Claim Assistant with OpenAI to guide customers through filing claims, provide 24/7 support, and scale operations during peak demand.

Model Launches· MIT Tech Review· Jun 2, 2026· 4 days ago

Rehumanizing global health care with agentic AI

The global health care sector is under increasing strain. Decades of chronic underinvestment and constraints in recruitment have coincided with a surge in demand for services for aging populations. Gaps in provision are…

Model Launches· MIT Tech Review· Jun 2, 2026· 4 days ago

How small businesses can leverage AI

This article is from Making AI Work, MIT Technology Review’s limited-run newsletter examining how to apply LLMs across industries. To receive it in your inbox,sign up here. From accounting to design to market research a…

Model Launches· NVIDIA· Jun 2, 2026· 4 days ago

Why Financial Institutions Are Converging on Transaction Foundation Models to Build Their Own Intelligence

Financial institutions have spent years building AI: fraud models, credit models, recommendation engines and risk systems. While this sprawl of task-specific models has been effective, it’s also constrained by siloed sy…

Model Launches· Simon Willison· Jun 2, 2026· 4 days ago

Pasted File Editor

<p><strong>Tool:</strong> <a href="https://tools.simonwillison.net/pasted-file-editor">Pasted File Editor</a></p> <p>I really like how you can paste a large volume of text into <a href="https://claude.ail">claude.ai</a>…

Model Launches· NVIDIA· Jun 2, 2026· 4 days ago

NVIDIA Jetson Brings Agentic AI to the Physical World

Agentic AI is getting physical. At COMPUTEX on Tuesday, NVIDIA announced NVIDIA JetPack 7.2 and NVIDIA NemoClaw support on NVIDIA Jetson. JetPack 7.2 brings agentic AI skills, Yocto project support, NVIDIA CUDA 13 on NV…

Model Launches· Google AI· Jun 2, 2026· 5 days ago

How we used Gemini to build Google I/O 2026

Learn how Googlers used AI to produce Google I/O 2026.

Model Launches· Hugging Face· Jun 1, 2026· 5 days ago

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains

Model Launches· Hugging Face· Jun 1, 2026· 5 days ago

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic

Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic

Model Launches· HotON Desk· Jun 1, 2026· 6 days ago

Agentic model adds native tool-use and longer action horizons

An updated agentic model improves multi-step tool use and reliability on long tasks, a focus area as agent workloads move toward production.

Model Launches· Google AI· May 30, 2026· 7 days ago

Take our I/O 2026 quiz, vibe coded in Google AI Studio.

We used Google AI Studio to vibe code a quiz about our top I/O 2026 announcements.

Model Launches· Google AI· May 30, 2026· 7 days ago

9 demos of Gemini Omni and Gemini 3.5 in action

Watch 9 videos showing the capabilities of Gemini Omni and Gemini 3.5, announced at Google I/O 2026.

Model Launches· Google AI· May 29, 2026· 8 days ago

Check out real-life AI prototypes from the Futures Lab.

University of Waterloo students develop AI prototypes like sign language tutors to reshape the future of education and work.

Model Launches· Google AI· May 28, 2026· 9 days ago

Catch up on 12 major I/O 2026 moments

Here are 12 of the biggest Google I/O 2026 keynote moments, including news about Gemini Omni, Gemini 3.5 Flash and more.

Model Launches· MIT Tech Review· May 28, 2026· 9 days ago

The AI Hype Index: AI gets booed in graduation season

It is one thing to say AI will change the world. It is another to expect the class of 2026 to applaud it. In fact, when former Google CEO Eric Schmidt told University of Arizona graduates that their task is to help shap…

Model Launches· MIT Tech Review· May 26, 2026· 11 days ago

Rethinking organizational design in the age of agentic AI

Amid rapidly growing adoption of enterprise-level AI agents, there’s a disconnect emerging between ambition and execution. Although 85% of organizations say they want to be agentic within the next three years, 76% say t…

Model Launches· MIT Tech Review· May 26, 2026· 11 days ago

It’s time to address the looming crisis in entry-level work

Artificial intelligence has not so far produced a clean story of mass unemployment. Aggregate employment in developed countries remains broadly stable, and recent assessments have found limited evidence that AI has shif…

Model Launches· MIT Tech Review· May 26, 2026· 11 days ago

A reality check on the AI jobs hysteria

Haven’t you heard? White-collar jobs are going away, decimated by AI. Waves of layoffs in the tech sector (most recently at Coinbase and Meta and Cisco) are said to presage what will soon come for all of us knowledge wo…

Model Launches· Google AI· May 23, 2026· 14 days ago

Catch up on the Dialogues stage at Google I/O 2026.

A recap of the 2026 I/O Dialogues, where leaders discuss the future of AI, quantum computing, robotics and creativity.

Model Launches· Google DeepMind· May 22, 2026· 15 days ago

We’re launching the Google DeepMind Accelerator program in Asia Pacific to tackle environmental risks

Model Launches· VentureBeat· May 20, 2026· 17 days ago

Google just redesigned the search box for the first time in 25 years — here’s why it matters more than you think.

For a quarter century, the Google search box has been one of the most recognizable interfaces in computing: a thin white rectangle, a blinking cursor, a few typed words, and a list of blue links. On Tuesday, Google will…

Model Launches· Google DeepMind· May 19, 2026· 18 days ago

Fast-tracking genetic leads to reverse cellular aging

Biologists use Co-Scientist to find novel factors that successfully rejuvenate human cells.

Model Launches· Google DeepMind· May 18, 2026· 19 days ago

Simulate real-world places with Project Genie and Street View

We’re expanding access to Google AI Ultra subscribers globally and introducing a new capability powered by Street View.

Model Launches· Google DeepMind· May 18, 2026· 19 days ago

Introducing Gemini Omni

Model Launches· Google DeepMind· May 18, 2026· 19 days ago

Introducing Google Antigravity 2.0

Model Launches· Google DeepMind· May 17, 2026· 20 days ago

Gemini for Science: AI experiments and tools for a new era of discovery

A collection of science tools and experiments to expand the scale and precision of scientific exploration.

Model Launches· Google DeepMind· May 17, 2026· 20 days ago

Making it easier to understand how content was created and edited

We're expanding our tools to help you understand how content was created and edited across the web.

Model Launches· Google DeepMind· May 16, 2026· 21 days ago

Strengthening Singapore’s AI Future: A New National Partnership

Google DeepMind and Singapore partner to apply frontier AI to address complex challenges across health, education, and sustainability and more.

Model Launches· Berkeley AI (BAIR)· May 8, 2026· 29 days ago

Adaptive Parallel Reasoning: The Next Paradigm in Efficient Inference Scaling

.apr-fig { text-align: center; margin: 1.35em 0; line-height: 1.4; } .apr-fig--wide img { display: inline-block; width: 100%; max-width: 100%; height: auto; vertical-align: middle; } .apr-fig--wide-0-8 { max-width: 80%;…

Model Launches· Berkeley AI (BAIR)· Apr 20, 2026· 2 months ago

Gradient-based Planning for World Models at Longer Horizons

.grasp-results-table table { font-size: 0.875rem; line-height: 1.35; width: 100%; } .grasp-results-table th, .grasp-results-table td { padding: 0.35rem 0.5rem; } /* Consistent whitespace between major sections (this pos…

Model Launches· Berkeley AI (BAIR)· Mar 13, 2026· 3 months ago

Identifying Interactions at Scale for LLMs

--> Understanding the behavior of complex machine learning systems, particularly Large Language Models (LLMs), is a critical challenge in modern artificial intelligence. Interpretability research aims to make the decisi…

Model Launches· VentureBeat· Jan 13, 2026· 5 months ago

Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI

Salesforce on Tuesday launched an entirely rebuilt version of Slackbot, the company's workplace assistant, transforming it from a simple notification tool into what executives describe as a fully powered AI agent capabl…

Model Launches· Berkeley AI (BAIR)· Jan 10, 2026· 5 months ago

Information-Driven Design of Imaging Systems

An encoder (optical system) maps objects to noiseless images, which noise corrupts into measurements. Our information estimator uses only these noisy measurements and a noise model to quantify how well measurements dist…

Model Launches· Berkeley AI (BAIR)· Nov 1, 2025· 7 months ago

RL without TD learning

In this post, I’ll introduce a reinforcement learning (RL) algorithm based on an “alternative” paradigm: divide and conquer. Unlike traditional methods, this algorithm is not based on temporal difference (TD) learning (…

Model Launches· Berkeley AI (BAIR)· Sep 1, 2025· 9 months ago

What exactly does word2vec learn?

What exactly does word2vec learn, and how? Answering this question amounts to understanding representation learning in a minimal yet interesting language modeling task. Despite the fact that word2vec is a well-known pre…

Model Launches· Berkeley AI (BAIR)· Jul 1, 2025· 11 months ago

Whole-Body Conditioned Egocentric Video Prediction

.modal { display: none; position: fixed; z-index: 9999; padding-top: 50px; left: 0; top: 0; width: 100%; height: 100%; overflow: auto; background-color: rgba(0,0,0,0.9); } .modal-content { margin: auto; display: block;…

Model Launches· Berkeley AI (BAIR)· Apr 11, 2025· last year

Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)

Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated applications. However, as LLMs have improved, so have the attacks against them. Prompt injection attack is listed as the #1 threat by OWASP…

Model Launches· Meta Research· May 17, 2023· 3 years ago

How generational differences affect consumer attitudes towards ads

Our research study, in collaboration with CrowdDNA, aims to understand people's relationship with social media ads across different social media platforms.

Model Launches· Meta Research· Apr 17, 2023· 3 years ago

Every tree counts

Meta set a goal to reach net zero emissions by 2030. We are developing technology to mitigate our carbon footprint and making these openly available.

Model Launches· Meta Research· Apr 14, 2023· 3 years ago

How a non-traditional background led to cutting-edge XR tech

Model Launches· Meta Research· Apr 13, 2023· 3 years ago

A new, unique AI dataset for animating amateur drawings

Model Launches· Meta Research· Apr 12, 2023· 3 years ago

How the metaverse can transform education

Model Launches· Meta Research· Apr 5, 2023· 3 years ago

Announcing the 2023 Meta Research PhD Fellowship award winners

...

Model Launches· Meta Research· Apr 5, 2023· 3 years ago

Introducing Segment Anything: Working toward the first foundation model for image segmentation

Model Launches· Microsoft AI· Dec 7, 2022· 4 years ago

A conversation with Kevin Scott: What’s next in AI

The post A conversation with Kevin Scott: What’s next in AI appeared first on The AI Blog.

Model Launches· Microsoft AI· Oct 13, 2022· 4 years ago

From Hot Wheels to handling content: How brands are using Microsoft AI to be more productive and imaginative

The post From Hot Wheels to handling content: How brands are using Microsoft AI to be more productive and imaginative appeared first on The AI Blog.

Model Launches· Microsoft AI· Jul 25, 2022· 4 years ago

How data and AI will transform contact centres for financial services

The post How data and AI will transform contact centres for financial services appeared first on The AI Blog.

Model Launches· Microsoft AI· Jul 21, 2022· 4 years ago

AI-equipped drones study dolphins on the edge of extinction

The post AI-equipped drones study dolphins on the edge of extinction appeared first on The AI Blog.

Model Launches· Microsoft AI· Jul 13, 2022· 4 years ago

Online math tutoring service uses AI to help boost students’ skills and confidence

The post Online math tutoring service uses AI to help boost students’ skills and confidence appeared first on The AI Blog.

Model Launches· Microsoft AI· Jul 6, 2022· 4 years ago

AI-Mimi is building inclusive TV experiences for Deaf and Hard of Hearing user in Japan

The post AI-Mimi is building inclusive TV experiences for Deaf and Hard of Hearing user in Japan appeared first on The AI Blog.

Model Launches· Microsoft AI· Jun 22, 2022· 4 years ago

Microsoft’s framework for building AI systems responsibly

The post Microsoft’s framework for building AI systems responsibly appeared first on The AI Blog.

Summaries are aggregated for information only — follow the source link for the full story. Demo entries are illustrative.