AI 模型发布
每一次重要的模型发布与能力更新——谁发布了什么,又如何改写性价比前沿。
137 条
为何重要
新模型会重置能力与性价比的前沿。每次发布改变「每美元能做什么」,团队就要重新评估该基于哪个模型构建。
Google DeepMind Releases Gemma 4 QAT Checkpoints: Q4_0 and a New Mobile Format Cut On-Device Memory
Compare Gemma 4 edge formats: BF16, Q4_0 QAT, and mobile QAT, on published memory numbers and design tradeoffs. The post Google DeepMind Releases Gemma 4 QAT Checkpoints: Q4_0 and a New Mobile Format Cut On-Device Memor…
This is your laptop… on AI
We're now deep into developer conference season, and one of the themes so far is the relentless conviction from Big Tech companies that AI is going to change everything about how we do everything. Nvidia's Jensen Huang…
The Fitbit Air is a good wearable weighed down by a chatty AI "coach"
The Air succeeds as a minimalist, reliable fitness tracker, but Google's AI Health Coach feels unnecessary.
Satya Nadella publicly torches a VP's plan to make Microsoft's AI agent deliberately addictive
Microsoft CEO Satya Nadella has sharply criticized an internal memo proposing to make users "addicted" to the company's new AI agent Scout. "Not sure who is writing and leaking this nonsense," Nadella wrote to about 50…
Has Microsoft Lost Its Mojo (Again)?
Microsoft’s AI products aren’t selling, and Github’s been plagued with troubles. WIRED spoke with VP Scott Hanselman about whether the company is in catch-up mode.
The token bill comes due: Inside the industry scramble to manage AI’s runaway costs
"The whole conversation shifted from tokenmaxxing and 'go fast' to 'we need guardrails, how do we control this?'"
The latest AI news we announced in May 2026
Here are Google’s latest AI updates from May 2026
This AI startup says it can tell if a script will make a hit film
When Quilty hit the industry trades earlier this year, the AI startup promised that its tool could accurately predict a film's success just by reading the script. When people actually got a chance to experiment with Qui…
Microsoft trained its MAI models on unlicensed web data despite promising "enterprise grade, clean and commercially licensed data"
Microsoft sells its LLM training approach as different from other AI companies. It isn't. The company trained its new MAI models partly on unlicensed web data like Common Crawl, despite claiming they used only "clean an…
Anthropic's Mythos model is reportedly powering NSA offensive cyber ops against China and Iran
Anthropic has reportedly stationed about half a dozen engineers directly at the NSA to adapt its Mythos AI model for offensive cyber operations. The model could be used to break into networks in China or Iran. That fits…
Quoting Andreas Kling
<blockquote cite="https://ladybird.org/posts/changing-how-we-develop-ladybird/"><p>We will no longer accept public pull requests. [...]</p> <p>A substantial patch used to imply substantial effort, and that effort was a…
NVIDIA AI Releases Dynamo Snapshot: A CRIU-Based Fast Startup System for AI Inference on Kubernetes
NVIDIA Dynamo Snapshot checkpoints and restores vLLM inference workers on Kubernetes using CRIU and cuda-checkpoint tools. The post NVIDIA AI Releases Dynamo Snapshot: A CRIU-Based Fast Startup System for AI Inference o…
Perplexity AI Introduces Hybrid Local-Server Inference Orchestrator for Personal Computer: Automatic On-Device and Cloud Task Routing
Perplexity AI announces a hybrid local-server inference orchestrator for Personal Computer, automatically routing AI tasks between on-device and cloud models. The post Perplexity AI Introduces Hybrid Local-Server Infere…
Microsoft Fara Tutorial: Run a Browser-Use Agent in Google Colab with a Mock OpenAI-Compatible Endpoint
A hands-on guide to running Microsoft Fara in Colab, testing the browser agent loop with a mock endpoint. The post Microsoft Fara Tutorial: Run a Browser-Use Agent in Google Colab with a Mock OpenAI-Compatible Endpoint…
The Meta hack shows there’s more to AI security than Mythos
On June 5, 404 Media reported that attackers had been using Meta’s AI customer support agent to steal Instagram accounts. Their approach was simple: They asked the agent to link the accounts to email addresses that they…
AI Has Come for Serif Fonts
AI companies are using serif to project humanity. Critics are calling it “tasteslop.”
Anthropic says Claude now writes over 90% of its code and wants the world to have an AI pause button
Anthropic is sharing internal data showing how much Claude is speeding up its own AI development: more than 80 percent of production code now comes from Claude, and engineers are shipping eight times as much code per da…
Seoul Purpose: How NVIDIA and South Korea Are Building the Future of AI
Home to cutting-edge sovereign AI infrastructure and robotics innovators, as well as one of the world’s most passionate gaming communities, South Korea is one of the world’s centers of AI. NVIDIA founder and CEO Jensen…
Mira Murati steps back into the spotlight, carefully
In the current environment, remaining heads down has diminishing returns; at some point, you have to make some noise just to remind the market you exist.
AI enthusiasts are in a race against time, AI skeptics are in a race against entropy
<p><strong><a href="https://charitydotwtf.substack.com/p/ai-enthusiasts-are-in-a-race-against">AI enthusiasts are in a race against time, AI skeptics are in a race against entropy</a></strong></p> Charity Majors neatly…
Airbnb’s Brian Chesky plans to launch a new AI lab
The Airbnb CEO said last year it hasn't struck an LLM partnership because existing products weren't quite ready.
Building a Semantic Search Engine and Open-Status Classifier over the ResearchMath-14k Dataset
This tutorial walks through a complete NLP pipeline for research-level mathematics. Using the ResearchMath-14k dataset, we extract field-specific keywords with TF-IDF, generate sentence embeddings, visualize the problem…
The skeptic’s guide to humanoid robots going viral on the Internet
Robot demonstrations can distort public perceptions of robotic capabilities.
Apple approves Poke as the first AI agent on its Messages for Business platform
Poke, the startup that lets people use AI agents through simple text messages, has become the first AI agent approved for Apple’s Messages for Business platform.
Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI
Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI
Cloudflare CEO says the web's future is "pay to crawl" as bots overtake human traffic
Bot traffic now outpaces human traffic on the internet, Cloudflare CEO Matthew Prince says, years ahead of his late 2027 forecast. He blames AI agents for the surge. His conclusion for the future of the web: "Clearly it…
NVIDIA Nemotron 3 Ultra now available on Amazon SageMaker JumpStart
Deploy NVIDIA Nemotron 3 Ultra on Amazon SageMaker JumpStart. Get 5x faster inference and 30% lower cost for agentic AI workloads with this frontier reasoning model.
ChatGPT now saves narrative dossiers about you sorted by work, hobbies, and travel preferences
ChatGPT's updated "Dreaming" memory system now builds coherent user profiles from conversations instead of saving scattered bullet points. OpenAI says the success rate for keeping information current jumped from 52.2 pe…
Quoting Emanuel Maiberg, 404 Media
<blockquote cite="https://www.404media.co/google-employees-internally-share-memes-about-how-its-ai-sucks/"><p>After this story was published Google's spokesperson reached out and asked us to publish a slightly different…
Meta rolls out a new AI creator assistant on Facebook
Creators often have to parse through charts and dashboards to understand their performance, but with the new AI assistant, they can get quick answers to questions like "When should I post?" and "What are people saying i…
What to expect from WWDC 2026: Siri’s highly anticipated revamp and Apple Intelligence updates
Apple's WWDC nears: here's what you can look forward to.
Bain study finds companies miss AI savings targets because humans keep getting in the way
According to a Bain survey of 951 companies, almost 40 percent achieved less than 10 percent in AI cost savings, even though most had targeted 11 to 20 percent. One alleged reason is that only 7 percent actually run ful…
TSMC struggles to keep up with AI demand: ‘We can only support so much’
Taiwan Semiconductor Manufacturing Co. - the world's biggest semiconductor-maker - is struggling to meet demands from American customers even with its factory buildout in the US, according to reports from Reuters and Bl…
Elon Musk is steamrolling Wall Street to become a trillionaire
Today on Decoder, I’m talking to Ryan Mac, a technology reporter at The New York Times and coauthor of the excellent book Character Limit: How Elon Musk Destroyed Twitter, which came out in 2024. I can’t recommend it en…
OpenAI CEO Sam Altman sees "proactive AI" as the next big phase after chatbots and agents
OpenAI CEO Sam Altman outlines the next phase of AI products: a "proactive AI" that runs constantly in the background and acts on its own instead of waiting for user prompts. Companies are also wrestling with spiraling…
Forecast: Fun Ahead — 18 Games Join in June to Stream on GeForce NOW
June’s forecast with GeForce NOW: 100% chance of gaming. GeForce NOW is lining up new adventures for the month, from big-name blockbusters to quirky indies ready for the spotlight. Members can dive into fresh worlds, sq…
How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent
How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent
Let us filter AI slop, you cowards
Nobody should be subjected to seeing shrimp Jesus all over their social feeds. | Image: Cath Virginia / The Verge, Getty Images It's almost impossible to avoid seeing AI-generated content online, but it doesn't have to…
EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios
EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios
AI leaders call for tougher protections against AI-aided bioweapons
Some of the AI industry's biggest rivals have put their many, many grievances aside for a common cause: making it harder for people to use their technology to develop biological weapons. In an open letter to US lawmaker…
How Endava is redesigning software delivery around AI agents
Learn how Endava is using AI agents, ChatGPT Enterprise, and Codex to accelerate software delivery, automate workflows, and build an AI-native culture across the enterprise.
Amazon develops a warehouse robot that workers can speak to
The design hasn’t changed much from the original Proteus, which was announced in 2022. | Image: Amazon Amazon has announced a new version of its fully autonomous warehouse robot, Proteus, that will interact using langua…
Dreaming: Better memory for a more helpful ChatGPT
ChatGPT introduces a new memory system to better remember preferences, keeping context fresh and relevant across conversations.
xAI updates Grok Imagine to 1.5 with image-to-video generation at 720p resolution
xAI has released "grok-imagine-video-1.5-preview," an image-to-video model that turns still images into cinematic videos at up to 720p based on text prompts. Multiple clips can be stitched together into longer scenes. T…
支持 100 万上下文的多模态新模型进入公开预览
该模型可在单一上下文窗口内处理长文档、图像与音频,为智能体与重检索类负载拓宽了设计空间。
OpenAI and Anthropic Sign Letter to Prevent AI-Developed Biological Weapons
Leading AI labs, executives, and scientists are sending a letter to lawmakers urging them to improve tracking of synthetic DNA sequences that could be used for bioweapons.
Biodefense in the Intelligence Age
An action plan for AI-powered biological resilience
Designing the hf CLI as an agent-optimized way to work with the Hub
Designing the hf CLI as an agent-optimized way to work with the Hub
Lovable signs multiyear deal with Google Cloud to up usage 5x, source says
Lovable and Google signed an expanded multiyear deal that involves a 5x expansion of Lovable's footprint on Google Cloud, and expanded access to Anthropic Claude.
Google ordered to put clearer links in AI search and let UK publishers opt out
Google must change AI Overviews after claiming users don't want "lots of sources."
How to build self-driving AI operations on Amazon Bedrock at scale
In this post, we introduce Amazon Bedrock Ops Alert, a three-layer automated monitoring solution that proactively detects operational issues, dynamically adjusts alarm thresholds, classifies alarms by category, automati…
How to Build a Document Intelligence Backend with iii Using Workers, Functions, and Cron Triggers
We build a document intelligence backend with iii by registering modular functions and reusing them across multiple triggers. The post How to Build a Document Intelligence Backend with iii Using Workers, Functions, and…
Google's new Gemma 4 12B model is designed to run on any laptop with 16GB of RAM
Gemma 4 12B uses a new encoding scheme and token prediction to punch above its weight.
Google’s Dreambeans, its weirdest-named AI tool to date, will turn your life into a cartoon
Dreambeans is a curated list of AI-illustrated "stories" culled from the personal data in your Google account.
Trump plan to test AI models has a problem—US security teams were gutted by DOGE
Critics say Trump plan to test AI models is short-sighted, performative.
The Humanoid Robot of the Future Is a 6-Foot-Tall Beefcake With a Chinese Body and an American Brain
Spencer Huang, Nvidia’s robotics lead, tells WIRED that the new bot combines the best of both worlds.
Fundamental’s Large Tabular Model NEXUS is now available on Amazon SageMaker JumpStart
In this post, we show you how to get started with NEXUS on Amazon SageMaker JumpStart, walk through the deployment process, and demonstrate how to run predictions against your enterprise datasets.
As AI gets better, it reveals an empty promise
This week we've got tandem hands-ons with Google's new Gemini AI agent - Spark - from my colleagues David Pierce and Jay Peters. Their takeaways are similar: It's so effective that it's scary. Spark knew that David's do…
Reducing container cold start times using SOCI index on DLAMI and DLC
In this post, we look at how to use SOCI on publicly available Deep Learning AMIs and Containers, when to use the various SOCI modes provided by the tool, and how to quickly and efficiently use this tool in your workloa…
Amazon’s search bar will invent AI-generated products you can’t buy
Amazon's updated search bar will now show you AI-generated images of products as you describe them. For now, the in-app feature only surfaces AI images of clothing and home goods, allowing you to tap on the image that b…
Improve your agent’s tool-calling accuracy with SFT and DPO on Amazon SageMaker AI
In this post, you learn how to use Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO) together to improve the tool-calling accuracy of a small language model (SLM). The example uses Amazon SageMaker A…
Amazon will show AI product images when you search for some reason
Amazon will use visual search and AI to show AI-generated product images that match your search queries. The retailer says it will help guide users to products.
NVIDIA Research Unlocks Advanced Grasping, Smarter Autonomous Driving and Agent Training at Scale
What makes a robot gripper useful isn’t that it can pick up one object — it’s that it can pick up the next one, and the one after that, with a tool it’s never held before. What makes an autonomous vehicle system safe is…
NVIDIA Enables the Next Era Of Physical AI Research With Agent Skills For Autonomous Vehicles, Robotics And Vision AI
At CVPR, NVIDIA is unveiling new physical AI agent skills that help researchers and developers speed the development of autonomous vehicles, robots and vision AI systems. The core challenge in physical AI research isn’t…
Inside Meta's attempts to play catch-up with AI
Doubts linger over whether Meta can close the gap with rivals.
Introducing new capabilities to GPT-Rosalind
GPT-Rosalind advances life sciences research with enhanced biological reasoning, medicinal chemistry expertise, genomics analysis, and experimental workflow capabilities.
5 ways Google Search can level up your thrift and vintage shopping
Uncover second-hand scores with AI tools in Google Search and Shopping.
Direct Preference Optimization Beyond Chatbots
Direct Preference Optimization Beyond Chatbots
Uber Caps Usage of AI Tools Like Claude Code to Manage Costs
<p><strong><a href="https://www.bloomberg.com/news/articles/2026-06-02/uber-caps-usage-of-ai-tools-like-claude-code-to-cut-costs">Uber Caps Usage of AI Tools Like Claude Code to Manage Costs</a></strong></p> I wrote <a…
How Wasmer used Codex to build a Node.js runtime for the edge
See how Wasmer used Codex with GPT-5.5 to build a Node.js runtime for the edge, accelerating development 10x to 20x and shipping in weeks instead of months.
Nous Research Releases Hermes Desktop: A Native Cross-Platform Front End for Hermes Agent v0.15.2 with Streaming Tool Output
Hermes Desktop is a no-terminal GUI sharing one agent core, skills, and memory with the Hermes Agent CLI. The post Nous Research Releases Hermes Desktop: A Native Cross-Platform Front End for Hermes Agent v0.15.2 with S…
NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation Model Unifying Physical Reasoning, World Generation, and Action Generation
NVIDIA released Cosmos 3, open omnimodal world models pairing an autoregressive VLM reasoner with a diffusion generator for physical AI. The post NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation…
Adding MCP Tools to Reachy Mini
Adding MCP Tools to Reachy Mini
Microsoft's new MAI models
<p>Microsoft <a href="https://microsoft.ai/news/building-a-hillclimbing-machine-launching-seven-new-mai-models/">announced two new text LLMs</a> this morning - <strong><a href="https://microsoft.ai/news/introducing-mai-…
Industrial Software Leaders Build Secure, Autonomous AI Engineers With NVIDIA NemoClaw
Accelerated computing has revolutionized industrial engineering, compressing simulation times from weeks to hours. Today’s remaining challenges sit in the end-to-end workflow surrounding the simulations: computer-aided…
Microsoft's Project Solara is an Android OS designed for agents instead of apps
Microsoft missed the boat on apps, so get ready for agents.
datasette-agent-micropython 0.1a0
<p><strong>Release:</strong> <a href="https://github.com/datasette/datasette-agent-micropython/releases/tag/0.1a0">datasette-agent-micropython 0.1a0</a></p> <p>I want <a href="https://agent.datasette.io">Datasette Agent…
micropython-wasm 0.1a1
<p><strong>Release:</strong> <a href="https://github.com/simonw/micropython-wasm/releases/tag/0.1a1">micropython-wasm 0.1a1</a></p> <p>Fixes for some limitations that emerged while I was trying to use this to build <cod…
NVIDIA Partners With Microsoft on Unified Stack for Agentic AI Deployment, From Windows Devices to Cloud to Local
The agentic AI moment has arrived, but delivering on its promise requires more than good models. It also takes fast hardware, secure runtimes, a responsive data layer and models tuned for long-running reasoning. NVIDIA…
Mathematicians warn of AI threats to profession as industry encroaches
International Mathematical Union endorses warning about tech industry influence.
California Brown Pelican
<p><img src="https://static.inaturalist.org/photos/671786719/large.jpg" alt="California Brown Pelican"></p><p>California Brown Pelican, in Fort Mason, CA, US</p><p>I'm at the <a href="https://build.microsoft.com/">Micro…
Android phones will soon be able to detect spoofed calls and impersonation scams
Google's June Android feature drop includes more scam detection, more AirDrop, and yes, more AI.
The art and science of hyperparameter optimization on Amazon Nova Forge
Fine-tuning for domain-specific tasks means improving performance in one area without degrading the model’s general capabilities, and getting that balance right is harder than it looks. This post walks through how to na…
Object detection with Amazon Nova 2 Lite
In this post, we'll walk through implementing object detection with Amazon Nova 2 Lite. You'll learn how to deploy an object detection application using Amazon Bedrock, AWS Lambda, and Amazon API Gateway. You'll also le…
How Baz improved its AI Agent Code Review accuracy using Amazon Bedrock AgentCore
This post walks through how Baz built their Spec Review agent using Amazon Bedrock and Amazon Bedrock AgentCore. We'll cover the architecture decisions, implementation details, and the business outcomes they achieved by…
Holo3.1: Fast & Local Computer Use Agents
Holo3.1: Fast & Local Computer Use Agents
Travelers deploys AI-powered claims countrywide with OpenAI
Travelers built an AI-powered Claim Assistant with OpenAI to guide customers through filing claims, provide 24/7 support, and scale operations during peak demand.
Rehumanizing global health care with agentic AI
The global health care sector is under increasing strain. Decades of chronic underinvestment and constraints in recruitment have coincided with a surge in demand for services for aging populations. Gaps in provision are…
How small businesses can leverage AI
This article is from Making AI Work, MIT Technology Review’s limited-run newsletter examining how to apply LLMs across industries. To receive it in your inbox,sign up here. From accounting to design to market research a…
Why Financial Institutions Are Converging on Transaction Foundation Models to Build Their Own Intelligence
Financial institutions have spent years building AI: fraud models, credit models, recommendation engines and risk systems. While this sprawl of task-specific models has been effective, it’s also constrained by siloed sy…
Pasted File Editor
<p><strong>Tool:</strong> <a href="https://tools.simonwillison.net/pasted-file-editor">Pasted File Editor</a></p> <p>I really like how you can paste a large volume of text into <a href="https://claude.ail">claude.ai</a>…
NVIDIA Jetson Brings Agentic AI to the Physical World
Agentic AI is getting physical. At COMPUTEX on Tuesday, NVIDIA announced NVIDIA JetPack 7.2 and NVIDIA NemoClaw support on NVIDIA Jetson. JetPack 7.2 brings agentic AI skills, Yocto project support, NVIDIA CUDA 13 on NV…
How we used Gemini to build Google I/O 2026
Learn how Googlers used AI to produce Google I/O 2026.
Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains
Introducing Mellum2: A 12B Mixture-of-Experts Model by JetBrains
Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic
Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic
智能体模型新增原生工具调用与更长行动跨度
更新后的智能体模型改进了多步工具调用与长任务可靠性;随着智能体负载走向生产,这是重点方向。
Take our I/O 2026 quiz, vibe coded in Google AI Studio.
We used Google AI Studio to vibe code a quiz about our top I/O 2026 announcements.
9 demos of Gemini Omni and Gemini 3.5 in action
Watch 9 videos showing the capabilities of Gemini Omni and Gemini 3.5, announced at Google I/O 2026.
Check out real-life AI prototypes from the Futures Lab.
University of Waterloo students develop AI prototypes like sign language tutors to reshape the future of education and work.
Catch up on 12 major I/O 2026 moments
Here are 12 of the biggest Google I/O 2026 keynote moments, including news about Gemini Omni, Gemini 3.5 Flash and more.
The AI Hype Index: AI gets booed in graduation season
It is one thing to say AI will change the world. It is another to expect the class of 2026 to applaud it. In fact, when former Google CEO Eric Schmidt told University of Arizona graduates that their task is to help shap…
Rethinking organizational design in the age of agentic AI
Amid rapidly growing adoption of enterprise-level AI agents, there’s a disconnect emerging between ambition and execution. Although 85% of organizations say they want to be agentic within the next three years, 76% say t…
It’s time to address the looming crisis in entry-level work
Artificial intelligence has not so far produced a clean story of mass unemployment. Aggregate employment in developed countries remains broadly stable, and recent assessments have found limited evidence that AI has shif…
A reality check on the AI jobs hysteria
Haven’t you heard? White-collar jobs are going away, decimated by AI. Waves of layoffs in the tech sector (most recently at Coinbase and Meta and Cisco) are said to presage what will soon come for all of us knowledge wo…
Catch up on the Dialogues stage at Google I/O 2026.
A recap of the 2026 I/O Dialogues, where leaders discuss the future of AI, quantum computing, robotics and creativity.
We’re launching the Google DeepMind Accelerator program in Asia Pacific to tackle environmental risks
Google just redesigned the search box for the first time in 25 years — here’s why it matters more than you think.
For a quarter century, the Google search box has been one of the most recognizable interfaces in computing: a thin white rectangle, a blinking cursor, a few typed words, and a list of blue links. On Tuesday, Google will…
Fast-tracking genetic leads to reverse cellular aging
Biologists use Co-Scientist to find novel factors that successfully rejuvenate human cells.
Simulate real-world places with Project Genie and Street View
We’re expanding access to Google AI Ultra subscribers globally and introducing a new capability powered by Street View.
Introducing Gemini Omni
Introducing Google Antigravity 2.0
Gemini for Science: AI experiments and tools for a new era of discovery
A collection of science tools and experiments to expand the scale and precision of scientific exploration.
Making it easier to understand how content was created and edited
We're expanding our tools to help you understand how content was created and edited across the web.
Strengthening Singapore’s AI Future: A New National Partnership
Google DeepMind and Singapore partner to apply frontier AI to address complex challenges across health, education, and sustainability and more.
Adaptive Parallel Reasoning: The Next Paradigm in Efficient Inference Scaling
.apr-fig { text-align: center; margin: 1.35em 0; line-height: 1.4; } .apr-fig--wide img { display: inline-block; width: 100%; max-width: 100%; height: auto; vertical-align: middle; } .apr-fig--wide-0-8 { max-width: 80%;…
Gradient-based Planning for World Models at Longer Horizons
.grasp-results-table table { font-size: 0.875rem; line-height: 1.35; width: 100%; } .grasp-results-table th, .grasp-results-table td { padding: 0.35rem 0.5rem; } /* Consistent whitespace between major sections (this pos…
Identifying Interactions at Scale for LLMs
--> Understanding the behavior of complex machine learning systems, particularly Large Language Models (LLMs), is a critical challenge in modern artificial intelligence. Interpretability research aims to make the decisi…
Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI
Salesforce on Tuesday launched an entirely rebuilt version of Slackbot, the company's workplace assistant, transforming it from a simple notification tool into what executives describe as a fully powered AI agent capabl…
Information-Driven Design of Imaging Systems
An encoder (optical system) maps objects to noiseless images, which noise corrupts into measurements. Our information estimator uses only these noisy measurements and a noise model to quantify how well measurements dist…
RL without TD learning
In this post, I’ll introduce a reinforcement learning (RL) algorithm based on an “alternative” paradigm: divide and conquer. Unlike traditional methods, this algorithm is not based on temporal difference (TD) learning (…
What exactly does word2vec learn?
What exactly does word2vec learn, and how? Answering this question amounts to understanding representation learning in a minimal yet interesting language modeling task. Despite the fact that word2vec is a well-known pre…
Whole-Body Conditioned Egocentric Video Prediction
.modal { display: none; position: fixed; z-index: 9999; padding-top: 50px; left: 0; top: 0; width: 100%; height: 100%; overflow: auto; background-color: rgba(0,0,0,0.9); } .modal-content { margin: auto; display: block;…
Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)
Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated applications. However, as LLMs have improved, so have the attacks against them. Prompt injection attack is listed as the #1 threat by OWASP…
How generational differences affect consumer attitudes towards ads
Our research study, in collaboration with CrowdDNA, aims to understand people's relationship with social media ads across different social media platforms.
Every tree counts
Meta set a goal to reach net zero emissions by 2030. We are developing technology to mitigate our carbon footprint and making these openly available.
How a non-traditional background led to cutting-edge XR tech
A new, unique AI dataset for animating amateur drawings
How the metaverse can transform education
Announcing the 2023 Meta Research PhD Fellowship award winners
...
Introducing Segment Anything: Working toward the first foundation model for image segmentation
A conversation with Kevin Scott: What’s next in AI
The post A conversation with Kevin Scott: What’s next in AI appeared first on The AI Blog.
From Hot Wheels to handling content: How brands are using Microsoft AI to be more productive and imaginative
The post From Hot Wheels to handling content: How brands are using Microsoft AI to be more productive and imaginative appeared first on The AI Blog.
How data and AI will transform contact centres for financial services
The post How data and AI will transform contact centres for financial services appeared first on The AI Blog.
AI-equipped drones study dolphins on the edge of extinction
The post AI-equipped drones study dolphins on the edge of extinction appeared first on The AI Blog.
Online math tutoring service uses AI to help boost students’ skills and confidence
The post Online math tutoring service uses AI to help boost students’ skills and confidence appeared first on The AI Blog.
AI-Mimi is building inclusive TV experiences for Deaf and Hard of Hearing user in Japan
The post AI-Mimi is building inclusive TV experiences for Deaf and Hard of Hearing user in Japan appeared first on The AI Blog.
Microsoft’s framework for building AI systems responsibly
The post Microsoft’s framework for building AI systems responsibly appeared first on The AI Blog.
摘要仅供参考,请点击来源链接查看全文。演示条目为示意。