Introducing Claude Sonnet 5
Introducing Claude Sonnet 5
Claude Sonnet 5 发布
Claude Sonnet 5 is built to be the most agentic Sonnet model yet. It can make plans, use tools like browsers and terminals, and run autonomously at a level that, just a few months ago, required larger and more expensive models. Claude Sonnet 5 旨在成为迄今为止最具代理能力的 Sonnet 模型。它能够制定计划、使用浏览器和终端等工具,并以几个月前还需要更大、更昂贵的模型才能达到的水平进行自主运行。
For many developers, the agentic AI era began with Sonnet-class models: Claude Sonnet 3.5, 3.6, and 3.7 were the first models that showed impressive skills in coding and tool use. More recently, though, the clearest gains in agentic capabilities have been in our Opus-class models. 对于许多开发者而言,代理 AI 时代始于 Sonnet 系列模型:Claude Sonnet 3.5、3.6 和 3.7 是首批在编程和工具使用方面展现出惊人技能的模型。然而,最近代理能力最显著的提升出现在我们的 Opus 系列模型中。
Sonnet 5 narrows the gap: its performance is close to that of Opus 4.8, but at lower prices. It’s a substantial improvement over its predecessor, Sonnet 4.6, on important aspects of agentic performance like reasoning, tool use, coding, and knowledge work. Sonnet 5 缩小了这一差距:其性能接近 Opus 4.8,但价格更低。与前代产品 Sonnet 4.6 相比,它在推理、工具使用、编程和知识工作等代理性能的重要方面有了实质性的提升。
Our safety assessments found that Sonnet 5 shows an overall lower rate of undesirable behaviors than Sonnet 4.6, and is generally safer to use in agentic contexts. Evaluations also show that it has a much lower ability to perform cybersecurity tasks than our current Opus models. 我们的安全评估发现,与 Sonnet 4.6 相比,Sonnet 5 的不良行为发生率总体较低,在代理场景中使用通常更安全。评估还显示,它执行网络安全任务的能力远低于我们目前的 Opus 模型。
From today, Claude Sonnet 5 is available across all plans: it is the default model for Free and Pro plans, and is available to Max, Team, and Enterprise users. It’s also available in Claude Code and on the Claude Platform, where it launches with introductory pricing of $2 per million input tokens and $10 per million output tokens through August 31, 2026, after which it will be priced at $3 per million input tokens and $15 per million output tokens. Developers can use claude-sonnet-5 via the Claude API. 从今天起,Claude Sonnet 5 已在所有计划中可用:它是 Free 和 Pro 计划的默认模型,并向 Max、Team 和 Enterprise 用户开放。它也可在 Claude Code 和 Claude Platform 上使用,推出时的优惠价格为每百万输入 token 2 美元,每百万输出 token 10 美元(有效期至 2026 年 8 月 31 日);此后价格将调整为每百万输入 token 3 美元,每百万输出 token 15 美元。开发者可以通过 Claude API 使用 claude-sonnet-5。
Working with Claude Sonnet 5
使用 Claude Sonnet 5
The charts below compare the performance of Sonnet 5 with Sonnet 4.6 and Opus 4.8 at different effort levels on the agentic search evaluation BrowseComp and the computer use evaluation OSWorld-Verified. Sonnet 5 (orange line) is a strict improvement over Sonnet 4.6 (gray line) and covers a much wider range of cost-performance options than Opus 4.8 (yellow line). It provides substantially improved cost efficiency at medium effort; its higher-effort performance can match Opus 4.8 on some tasks. Between Sonnet 5 and Opus 4.8, users can adjust the effort level to find the right balance of cost and performance. 下图比较了 Sonnet 5 与 Sonnet 4.6 和 Opus 4.8 在代理搜索评估 BrowseComp 和计算机使用评估 OSWorld-Verified 中不同努力程度下的性能。Sonnet 5(橙色线)较 Sonnet 4.6(灰色线)有显著提升,并涵盖了比 Opus 4.8(黄色线)更广泛的性价比选择。它在中等努力程度下提供了大幅提升的成本效率;其高努力程度下的性能在某些任务上可以媲美 Opus 4.8。用户可以在 Sonnet 5 和 Opus 4.8 之间调整努力程度,以找到成本与性能之间的最佳平衡点。
Feedback from our early access partners has been consistent: Sonnet 5 is much more agentic than its predecessors. Testers described how it finishes complex tasks where previous Sonnet models would stop short, how it checks its own output without explicitly being asked, and how it does all this agentic work at an attractive price point. 我们早期访问合作伙伴的反馈非常一致:Sonnet 5 比其前代产品更具代理能力。测试人员描述了它如何完成以前 Sonnet 模型无法完成的复杂任务,如何在未被明确要求的情况下检查自己的输出,以及它如何以极具吸引力的价格完成所有这些代理工作。