Model Launches· The Decoder· Jun 14, 2026· 23 hours ago· 1 min read

AI coding agents find the right file but miss the exact lines that matter, study shows

AI coding agents like Claude Code or Codex reliably find the right file but miss most of the critical lines within it. The new SWE-Explore benchmark is the first to test code search separately from the actual repair, an…

Why it matters

New models reset the capability and price-performance frontier. Teams re-evaluate what to build on whenever a launch shifts what's possible per dollar.

Explore the data behind this

Related HotON.ai pages

Models →Compare →

Read original (The Decoder) →

Summaries are aggregated for information only — follow the source link for the full story. Demo entries are illustrative.

More news

Model Launches2 hours ago

Z.ai Launches GLM-5.2 With a Usable 1M-Token Context, Two Thinking-Effort Levels, and No Benchmarks at Launch

Model Launches6 hours ago

Quoting Julia Evans

Model Launches7 hours ago

Claude Code Guide 2026: 25 Features with Examples + Demo

Model Launches8 hours ago

Why AI hasn’t replaced software engineers, and won’t