OpenAI unveils GPT-5.6 amid US AI regulatory drama
OpenAI unveils GPT-5.6 amid US AI regulatory drama
OpenAI 在美国 AI 监管风波中发布 GPT-5.6
Less than 24 hours after news broke that OpenAI would stagger its next model release at the request of the Trump administration, that model, GPT-5.6, is here. On Friday, the company unveiled the limited preview of its new GPT 5.6 model suite: Sol, the flagship; Terra, a medium-tier model for “high-volume work”; and Luna, a “fast and affordable” everyday model. OpenAI says it’s especially skilled at coding, cybersecurity, and biology, as well as staying focused during long-horizon agentic AI tasks.
在有消息称 OpenAI 应特朗普政府要求将推迟发布其下一代模型不到 24 小时后,该模型 GPT-5.6 便正式亮相。周五,该公司发布了其全新 GPT 5.6 模型套件的有限预览版:旗舰模型 Sol;用于“高负载工作”的中端模型 Terra;以及“快速且经济实惠”的日常模型 Luna。OpenAI 表示,该系列模型在编程、网络安全和生物学领域表现尤为出色,并能在长周期的智能体(Agentic)AI 任务中保持专注。
Per million tokens, GPT-5.6 Sol is priced at $5 input / $30 output (nearly half the cost of Anthropic’s Claude 5, which is $10 input / $50 output). Terra is half the cost of Sol, and Luna is less than half the cost of Terra. The company also debuted two additional modes for Sol: a “max” mode for deeper reasoning and an “ultra” mode for leveraging sub-agents — evoking OpenClaw, and perhaps a sign of OpenClaw creator Peter Steinberger’s work at OpenAI so far.
GPT-5.6 Sol 的定价为每百万 token 输入 5 美元 / 输出 30 美元(几乎是 Anthropic 公司 Claude 5 定价的一半,后者为输入 10 美元 / 输出 50 美元)。Terra 的成本是 Sol 的一半,而 Luna 的成本则不到 Terra 的一半。该公司还为 Sol 推出了两种额外模式:“max”模式用于更深度的推理,“ultra”模式用于调用子智能体——这让人联想到 OpenClaw,或许也标志着 OpenClaw 创始人 Peter Steinberger 在 OpenAI 目前的工作成果。
Unsurprisingly amid a security panic in Washington, D.C., OpenAI dedicated the majority of its announcement blog post to safety and potential misuse. It appeared to reference the recent jailbreaking travails of its rival Anthropic, writing that “GPT‐5.6 is trained to refuse prohibited cyber assistance, including when users attempt to disguise their intent or jailbreak the model.” It also said that flagship model Sol “is better at helping people find and fix vulnerabilities than reliably carrying out end-to-end attacks,” and that Sol doesn’t cross the cyber-critical threshold under OpenAI’s preparedness framework — though it should be noted that OpenAI recently revised its preparedness framework in April and removed some areas of previous study.
在华盛顿特区弥漫的安全恐慌氛围下,OpenAI 将其公告博客的大部分篇幅用于讨论安全性和潜在的滥用风险,这并不令人意外。文中似乎暗指了其竞争对手 Anthropic 最近遭遇的越狱困境,写道:“GPT-5.6 经过训练,能够拒绝违规的网络协助,包括用户试图伪装意图或对模型进行越狱的情况。”该公司还表示,旗舰模型 Sol “在帮助人们发现和修复漏洞方面表现更好,而不是可靠地执行端到端攻击”,并且 Sol 没有跨越 OpenAI 准备框架下的网络关键阈值——尽管需要注意的是,OpenAI 最近在 4 月份修订了其准备框架,并删除了此前研究的一些领域。
The company said Sol has the company’s “most robust safety stack to date” and that it “strengthened protections for higher-risk activity, sensitive cyber requests, and repeated misuse.” OpenAI said it had dedicated “approximately 700,000 A100e GPU hours” to automated red-teaming and also worked with third-party testers, the latter of which will continue to test it for the next two weeks.
该公司表示,Sol 拥有其“迄今为止最强大的安全堆栈”,并“加强了针对高风险活动、敏感网络请求和重复滥用行为的保护”。OpenAI 称,已投入“约 70 万小时的 A100e GPU 时间”用于自动化红队测试,并与第三方测试人员合作,后者将在未来两周内继续进行测试。
OpenAI also seemed to be taking an extra-sensitive approach during the preview period, which is being closely monitored by the Trump administration. The company wrote that “safeguards may occasionally intervene on legitimate work, particularly in dual-use areas where defensive and offensive activity can initially look similar. That is part of what the preview is designed to test.” The report earlier this week said that the Trump administration will approve customers on a case-by-case basis during the preview period.
在预览期间,OpenAI 似乎采取了格外谨慎的态度,该阶段正受到特朗普政府的密切监控。公司写道:“安全防护措施偶尔可能会干预合法工作,特别是在防御和攻击活动最初看起来相似的双重用途领域。这也是预览版旨在测试的内容之一。”本周早些时候的报道称,在预览期间,特朗普政府将对客户进行逐一审批。
OpenAI said the model suite should be generally available in the coming weeks because the company believes in “broad access,” and that the company cooperated with the US government ahead of this launch, but that it hopefully wouldn’t be the norm.
OpenAI 表示,该模型套件预计将在未来几周内全面开放,因为公司坚信“广泛访问”的理念。公司在发布前与美国政府进行了合作,但希望这不会成为常态。
“We don’t believe this kind of government access process should become the long-term default,” the company wrote. “It keeps the best tools from users, developers, enterprises, cyber defenders, and global partners who need them. We are taking this short-term step because we believe it is the strongest path to broader availability in the coming weeks, while we work with the Administration to develop the cyber Executive Order framework and a repeatable process for future model releases.”
“我们不认为这种政府准入流程应该成为长期的默认模式,”该公司写道。“这会让需要这些工具的用户、开发者、企业、网络防御者和全球合作伙伴无法获得最好的工具。我们采取这一短期措施,是因为我们相信这是在未来几周内实现更广泛可用性的最有力途径,同时我们也在与政府合作,制定网络行政命令框架以及未来模型发布的标准化流程。”