Các bản phát hành trọng lượng mở và các mô hình cộng đồng gây áp lực lên giá đóng và mở rộng quyền truy cập.
Câu chuyện 17
Trọng lượng mở giải phóng áp lực định giá đóng và mở rộng khả năng tiếp cận, thay đổi phép toán xây dựng và mua cho toàn bộ hệ sinh thái.
Zyphra đã phát hành Zamba2-VL, một nhóm mô hình ngôn ngữ tầm nhìn mở ở các tham số 1,2B, 2,7B và 7B. Các mô hình này sử dụng không gian trạng thái lai Mamba2 và đường trục Transformer, được vận chuyển theo Apache 2.0. Họ tiếp tục cạnh tranh…
Cohere's first developer coding model is a 30B mixture-of-experts running on a single H100 with 256K context length. The post Meet ‘North Mini Code’: Cohere’s 30B Open-Weight Mixture-of-Experts Model With 3B Active Para…
DiffusionGemma is Google DeepMind's experimental 26B open model using text diffusion for up to 4x faster generation on GPUs. The post Google AI Releases DiffusionGemma, a 26B MoE Open Model Using Text Diffusion for Up t…
Today, Google DeepMind released DiffusionGemma — an experimental open model built for exceptionally fast text generation. NVIDIA has optimized DiffusionGemma to run even faster across NVIDIA GeForce RTX GPUs, the NVIDIA…
How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces
Migrating Your GitHub CI to Hugging Face Jobs
In this tutorial, we explore the ClawHub Security Signals dataset to see how scanners assess AI skills. We load the data from the Hugging Face Parquet conversion and inspect verdicts, scanner outputs, and severity label…
In this post, we walk you through the Nova Sonic Test Harness, an open source framework that we built to solve both problems. It serves as a rapid iteration tool for tuning system prompts and tool configurations (run a…
The Open Source Community is backing OpenEnv for Agentic RL
Unlike GPT-4o or Qwen3.5-Omni, Audio Interaction doesn't wait for a recording to end: it translates, transcribes, chats, and picks up everyday noises like coughing in a single stream. Code, model weights, and download i…
Kimi Code CLI is Moonshot AI's open-source terminal coding agent, written in TypeScript with subagents and MCP configuration. The post Moonshot AI Releases Kimi Code CLI: A Terminal AI Coding Agent Built in TypeScript f…
Miso Labs has released MisoTTS, an open-weights 8B text-to-speech model. It uses residual vector quantization (RVQ) to scale its sonic range without scaling parameters, and conditions on both text and audio context to r…
Stanford researchers released OpenJarvis, an open-source framework that runs inference, agents, memory, and learning entirely on-device. It decomposes a personal AI system into five composable primitives — Intelligence,…
Google Deepmind's Gemma 4 12B is an open-source model that processes text, images, and audio natively and runs on laptops with just 16 GB of RAM. It nearly matches the twice-as-large 26B model in benchmarks and ships un…
Gemma 4 12B feeds vision and audio straight into the LLM backbone, running locally under an Apache 2.0 license. The post Google DeepMind Releases Gemma 4 12B: An Encoder-Free Multimodal Model with Native audio that runs…
A new open-weights release reports parity with leading closed models on several reasoning benchmarks, strengthening the open ecosystem's momentum.
Nous Research, the open-source artificial intelligence startup backed by crypto venture firm Paradigm, released a new competitive programming model on Monday that it says matches or exceeds several larger proprietary sy…
Các bản tóm tắt chỉ được tổng hợp để cung cấp thông tin - hãy nhấp vào liên kết nguồn để xem toàn bộ câu chuyện. Các mục demo có tính minh họa.