Introducing GPT-5.5
Introducing GPT-5.5
隆重推出 GPT-5.5
April 23, 2026 | Product Release | Introducing GPT‑5.5: A new class of intelligence for real work. 2026年4月23日 | 产品发布 | 隆重推出 GPT-5.5:赋能实际工作的新一代智能。
Update on April 24, 2026: GPT‑5.5 and GPT‑5.5 Pro are now available in the API. The system card has also been updated to describe the additional safeguards that apply. 2026年4月24日更新:GPT-5.5 和 GPT-5.5 Pro 现已在 API 中可用。系统卡片也已更新,详细说明了所采用的额外安全保障措施。
We’re releasing GPT‑5.5, our smartest and most intuitive to use model yet, and the next step toward a new way of getting work done on a computer. 我们正式发布 GPT-5.5,这是我们迄今为止最智能、最直观的模型,也是迈向计算机工作新方式的又一重要里程碑。
GPT‑5.5 understands what you’re trying to do faster and can carry more of the work itself. It excels at writing and debugging code, researching online, analyzing data, creating documents and spreadsheets, operating software, and moving across tools until a task is finished. Instead of carefully managing every step, you can give GPT‑5.5 a messy, multi-part task and trust it to plan, use tools, check its work, navigate through ambiguity, and keep going. GPT-5.5 能更快地理解您的意图,并能独立承担更多工作。它擅长编写和调试代码、在线研究、分析数据、创建文档和电子表格、操作软件,并能在不同工具间切换直至任务完成。您无需事无巨细地管理每一个步骤,只需将复杂的、多部分的任务交给 GPT-5.5,它就能自主规划、使用工具、核查工作、处理模糊信息并持续推进。
The gains are especially strong in agentic coding, computer use, knowledge work, and early scientific research—areas where progress depends on reasoning across context and taking action over time. GPT‑5.5 delivers this step up in intelligence without compromising on speed: larger, more capable models are often slower to serve, but GPT‑5.5 matches GPT‑5.4 per-token latency in real-world serving, while performing at a much higher level of intelligence. It also uses significantly fewer tokens to complete the same Codex tasks, making it more efficient as well as more capable. 在智能体编程、计算机操作、知识工作和早期科学研究等领域,GPT-5.5 的提升尤为显著——这些领域的发展依赖于跨上下文的推理和长期的行动能力。GPT-5.5 在提升智能水平的同时并未牺牲速度:通常情况下,更大、更强大的模型响应速度较慢,但 GPT-5.5 在实际应用中的每 Token 延迟与 GPT-5.4 持平,同时展现出更高水平的智能。此外,它在完成相同的 Codex 任务时使用的 Token 显著减少,使其在更强大的同时也更加高效。
We are releasing GPT‑5.5 with our strongest set of safeguards to date, designed to reduce misuse while preserving access for beneficial work. We evaluated this model across our full suite of safety and preparedness frameworks, worked with internal and external redteamers, added targeted testing for advanced cybersecurity and biology capabilities, and collected feedback on real use cases from nearly 200 trusted early-access partners before release. 我们在发布 GPT-5.5 时采用了迄今为止最严密的保障措施,旨在减少滥用的同时,确保其能用于有益的工作。我们在全套安全和准备框架下对该模型进行了评估,与内部和外部红队专家合作,针对高级网络安全和生物学能力进行了专项测试,并在发布前从近 200 家受信任的早期访问合作伙伴处收集了真实用例的反馈。
Today, GPT‑5.5 is rolling out to Plus, Pro, Business, and Enterprise users in ChatGPT and Codex, and GPT‑5.5 Pro is rolling out to Pro, Business, and Enterprise users in ChatGPT. API deployments require different safeguards and we are working closely with partners and customers on the safety and security requirements for serving it at scale. We’ll bring GPT‑5.5 and GPT‑5.5 Pro to the API very soon. 即日起,GPT-5.5 将向 ChatGPT 和 Codex 的 Plus、Pro、Business 和 Enterprise 用户推出,GPT-5.5 Pro 也将向 ChatGPT 的 Pro、Business 和 Enterprise 用户推出。API 部署需要不同的保障措施,我们正与合作伙伴和客户密切合作,以满足大规模服务所需的安全性要求。我们很快会将 GPT-5.5 和 GPT-5.5 Pro 引入 API。
Model capabilities
模型能力
OpenAI is building the global infrastructure for agentic AI, making it possible for people and businesses around the world to get work done with AI. Over the past year, we’ve seen AI dramatically accelerate software engineering. With GPT‑5.5 in Codex and ChatGPT, that same transformation is beginning to extend into scientific research and the broader work people do on computers. OpenAI 正在构建智能体 AI 的全球基础设施,使世界各地的人们和企业能够利用 AI 完成工作。在过去的一年中,我们见证了 AI 对软件工程的巨大推动作用。随着 GPT-5.5 在 Codex 和 ChatGPT 中的应用,这种变革正开始扩展到科学研究以及人们在计算机上进行的更广泛的工作中。
Across these domains, GPT‑5.5 is not just more intelligent; it is more efficient in how it works through problems, often reaching higher-quality outputs with fewer tokens and fewer retries. On Artificial Analysis’s Coding Index, GPT‑5.5 delivers state-of-the-art intelligence at half the cost of competitive frontier coding models. 在这些领域中,GPT-5.5 不仅更智能,而且解决问题的方式更高效,通常能以更少的 Token 和更少的重试次数获得更高质量的输出。在 Artificial Analysis 的编码指数中,GPT-5.5 以竞争对手前沿编码模型一半的成本提供了顶尖的智能水平。
Agentic coding
智能体编程
GPT‑5.5 is our strongest agentic coding model to date. On Terminal-Bench 2.0, which tests complex command-line workflows requiring planning, iteration, and tool coordination, it achieves a state-of-the-art accuracy of 82.7%. On SWE-Bench Pro, which evaluates real-world GitHub issue resolution, it reaches 58.6%, solving more tasks end-to-end in a single pass than previous models. On Expert-SWE, our internal frontier eval for long-horizon coding tasks with a median estimated human completion time of 20 hours, GPT‑5.5 also outperforms GPT‑5.4. GPT-5.5 是我们迄今为止最强大的智能体编程模型。在测试需要规划、迭代和工具协调的复杂命令行工作流的 Terminal-Bench 2.0 上,它达到了 82.7% 的顶尖准确率。在评估真实 GitHub 问题解决能力的 SWE-Bench Pro 上,它达到了 58.6%,在单次运行中端到端解决的任务数量超过了以往的模型。在 Expert-SWE(我们用于长周期编码任务的内部前沿评估,人类完成时间中位数为 20 小时)上,GPT-5.5 的表现也优于 GPT-5.4。
Across all three evals, GPT‑5.5 improves on GPT‑5.4’s scores while using fewer tokens. 在所有三项评估中,GPT-5.5 在使用更少 Token 的同时,得分均高于 GPT-5.4。
The model’s coding strengths show up especially clearly in Codex where it can take on engineering work ranging from implementation and refactors to debugging, testing, and validation. Early testing suggests GPT‑5.5 is better at the behaviors real engineering work depends on, like holding context across large systems, reasoning through ambiguous failures, checking assumptions with tools, and carrying changes through the surrounding codebase. 该模型的编码优势在 Codex 中表现得尤为明显,它可以承担从实现和重构到调试、测试和验证的各项工程工作。早期测试表明,GPT-5.5 在真实工程工作所依赖的行为上表现更好,例如在大型系统中保持上下文、通过模糊故障进行推理、使用工具检查假设以及在周围代码库中执行更改。
Beyond benchmarks, early testers said GPT‑5.5 shows a stronger ability to understand the shape of a system: why something is failing, where the fix needs to land, and what else in the codebase would be affected. 除了基准测试外,早期测试人员表示,GPT-5.5 在理解系统架构方面表现出更强的能力:能够理解故障原因、确定修复位置,以及预判代码库中其他受影响的部分。
“The first coding model I’ve used that has serious conceptual clarity.” “这是我用过的第一个具有真正概念清晰度的编码模型。”
Dan Shipper, Founder and CEO of Every, described GPT‑5.5 as “the first coding model I’ve used that has serious conceptual clarity.” Every 的创始人兼首席执行官 Dan Shipper 将 GPT-5.5 描述为“我用过的第一个具有真正概念清晰度的编码模型”。