OpenAI's LifeSciBench evaluates whether frontier AI can handle real life-science research across 750 expert-authored tasks, seven workflows, and seven biological domains. Built by 173 PhD scientists with 19,020 rubric c…
New models reset the capability and price-performance frontier. Teams re-evaluate what to build on whenever a launch shifts what's possible per dollar.
Companies and models mentioned in this story — open their pages and live prices
Summaries are aggregated for information only — follow the source link for the full story. Demo entries are illustrative.