Statement on US government directive to suspend access to Fable 5 and Mythos 5
Statement on US government directive to suspend access to Fable 5 and Mythos 5
关于美国政府暂停 Fable 5 和 Mythos 5 访问权限指令的声明
Jun 12, 2026 2026年6月12日
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Anthropic models will not be affected. 美国政府援引国家安全授权,发布了一项出口管制指令,要求暂停任何外国公民(无论身处美国境内还是境外,包括 Anthropic 的外籍员工)对 Fable 5 和 Mythos 5 的所有访问权限。该指令的直接后果是,我们必须立即为所有客户禁用 Fable 5 和 Mythos 5 以确保合规。Anthropic 其他所有模型的访问权限将不受影响。
We received the directive from the government today at 5:21pm (ET). The letter did not provide specific details of its national security concern. Our understanding is that the government believes it has become aware of a method of bypassing, or “jailbreaking” Fable 5. We reviewed a demonstration of this specific technique being used to identify a small number of previously known, minor vulnerabilities. These vulnerabilities all appear relatively simple, and we have found that other publicly-available models are able to discover them as well without requiring a bypass. 我们于今日下午 5:21(美东时间)收到政府的指令。信函中并未提供有关其国家安全担忧的具体细节。据我们了解,政府认为其发现了一种绕过或“越狱”Fable 5 的方法。我们审查了该技术的演示,发现它被用于识别少量此前已知的轻微漏洞。这些漏洞看起来都相对简单,且我们发现其他公开可用的模型无需通过越狱也能发现这些漏洞。
Anthropic’s posture with respect to Fable’s safeguards, as laid out in our launch blog post, is the following: We have instituted strong safeguards that greatly reduce the likelihood that Fable is misused for tasks related to cybersecurity (among others). In fact, our safeguards are so strong that many users have complained that they are overly broad. 正如我们在发布博客中所述,Anthropic 对 Fable 安全防护措施的立场如下:我们已经建立了强大的安全防护机制,极大地降低了 Fable 被滥用于网络安全相关任务(以及其他任务)的可能性。事实上,我们的防护措施非常严密,以至于许多用户抱怨其限制范围过于宽泛。
In the weeks leading up to the launch of Fable, Anthropic worked with the US government, the UK AISI, multiple private third-party organizations and internal teams to red-team Fable’s safeguards for thousands of hours in total. These tests showed that Fable’s safeguards are substantially more effective than those of any previously deployed model. No testers have yet been able to find a universal jailbreak—a jailbreak method that can very broadly bypass the model’s safeguards, unblocking a wide range of cyber capabilities. 在 Fable 发布前的几周里,Anthropic 与美国政府、英国人工智能安全研究所(AISI)、多家私营第三方机构及内部团队合作,对 Fable 的安全防护进行了总计数千小时的红队测试。测试表明,Fable 的安全防护能力远高于此前部署的任何模型。目前还没有测试人员能够找到“通用越狱”方法——即一种能够广泛绕过模型安全防护、解锁多种网络攻击能力的越狱手段。
We suspect that perfect jailbreak resistance is not currently possible for any model provider. Every safeguard used in the industry is vulnerable to non-universal jailbreaks (which can elicit some cyber information in specific circumstances), and it is likely that universal jailbreaks will eventually be found in the future. We stated this clearly when we released Fable 5. 我们认为,目前没有任何模型提供商能够实现完美的越狱防御。行业内使用的每一项防护措施都容易受到非通用越狱的影响(在特定情况下可能诱导出部分网络信息),且未来很可能会出现通用越狱。我们在发布 Fable 5 时已明确说明了这一点。
Given that perfect jailbreak resistance does not appear to be possible today, Anthropic adopted a defense in depth strategy with Fable 5. We aimed to make jailbreaks either narrow (in the case of non-universal jailbreaks) or very expensive to produce (in the case of universal jailbreaks), and to combine this with thorough monitoring to quickly detect and shut down any successful attacks. This is also why Anthropic has required 30-day retention of customer data with Fable—a policy change that carries real costs for us with customers, but that allows us to research and mitigate jailbreaks. 鉴于目前似乎无法实现完美的越狱防御,Anthropic 在 Fable 5 上采取了纵深防御策略。我们的目标是使越狱要么局限在狭窄范围内(针对非通用越狱),要么使其产生的成本极其高昂(针对通用越狱),并结合全面的监控来快速检测并阻断任何成功的攻击。这也是 Anthropic 要求保留 Fable 客户数据 30 天的原因——这一政策调整虽然让我们在客户层面付出了实际成本,但它使我们能够研究并缓解越狱风险。
We stand by this defense in depth strategy. It reduces the risks posed by Fable, making them comparable to the risks of existing models already deployed across the industry. We have not even received a disclosure of a concerning non-universal potential jailbreak that led to a harmful result. The potential jailbreaks that have been disclosed to us are either entirely benign responses or are minor findings that provide no Mythos-specific uplift. 我们坚持这一纵深防御策略。它降低了 Fable 带来的风险,使其与行业内已部署的现有模型风险相当。我们甚至尚未收到任何关于导致有害结果的非通用潜在越狱的披露。目前向我们披露的潜在越狱要么是完全无害的响应,要么是无法提供任何 Mythos 特定提升的微小发现。
To date, the government has only given us verbal evidence of a potential narrow, non-universal jailbreak, which essentially consists of asking the model to read a specific codebase and fix any software flaws. Our understanding is that one potential jailbreak was shared with the government. We have reviewed a report that we believe is the basis of the government’s directive and validated that the level of capability displayed there is widely available from other models (including OpenAI’s GPT-5.5), and is used every day by the defenders who keep systems safe. We will share more details over the next 24 hours. 迄今为止,政府仅向我们提供了关于一种潜在的、狭窄的非通用越狱的口头证据,其本质是要求模型读取特定的代码库并修复软件缺陷。据我们了解,有一个潜在的越狱方法被分享给了政府。我们审查了一份我们认为是政府指令依据的报告,并验证了其中展示的能力水平在其他模型(包括 OpenAI 的 GPT-5.5)中广泛存在,且每天都被维护系统安全的防御者所使用。我们将在未来 24 小时内分享更多细节。
We are complying with the government’s legal directive and are removing access to Fable 5 and Mythos 5 for all users. However, we disagree that the finding of a narrow potential jailbreak should be cause for recalling a commercial model deployed to hundreds of millions of people. If this standard was applied across the industry, we believe it would essentially halt all new model deployments for all frontier model providers. 我们正在遵守政府的法律指令,并取消所有用户对 Fable 5 和 Mythos 5 的访问权限。然而,我们不同意将发现一个狭窄的潜在越狱作为召回已部署给数亿人的商业模型的理由。如果这一标准在全行业实施,我们认为这将实质上停止所有前沿模型提供商的新模型部署。
As we have stated publicly, we believe the government should have the ability to block unsafe deployments, as part of a statutory process that is transparent, fair, clear, and grounded in technical facts. This action does not adhere to those principles. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. 正如我们公开声明的那样,我们认为政府应当有能力阻止不安全的部署,但这必须作为透明、公平、清晰且基于技术事实的法定程序的一部分。此次行动并不符合这些原则。我们对给客户带来的不便深表歉意。我们相信这是一个误解,并正努力尽快恢复访问权限。