Introducing computer use in Gemini 3.5 Flash
Introducing computer use in Gemini 3.5 Flash
在 Gemini 3.5 Flash 中引入计算机使用功能
Computer use is now a built-in tool in Gemini 3.5 Flash to build agents that can interact across platforms. 计算机使用功能现已成为 Gemini 3.5 Flash 的内置工具,旨在构建能够跨平台交互的智能体。
Computer use is now a built-in tool supported in Gemini 3.5 Flash, delivering our best performance yet for agentic computer use tasks. Previously only available as a standalone Gemini 2.5 computer use model, computer use is now integrated natively in the main Gemini Flash model. 计算机使用功能现已成为 Gemini 3.5 Flash 支持的内置工具,为智能体计算机使用任务提供了我们迄今为止最佳的性能。此前,该功能仅作为独立的 Gemini 2.5 计算机使用模型提供,现在已原生集成到 Gemini Flash 主模型中。
Gemini already excels at function calling and using built-in tools like Search and Maps grounding. With built-in computer use capability, developers can now use 3.5 Flash to reliably build custom agents that can see, reason and take action across browser, mobile and desktop environments. This unlocks improved performance for long-horizon and enterprise automation tasks like continuous software testing and knowledge work across professional applications. Gemini 在函数调用以及使用搜索和地图定位等内置工具方面表现卓越。借助内置的计算机使用能力,开发者现在可以使用 3.5 Flash 可靠地构建自定义智能体,使其能够在浏览器、移动端和桌面环境中进行观察、推理并采取行动。这为长周期任务和企业自动化任务(如持续软件测试和跨专业应用的知识工作)带来了性能提升。
Developers and enterprises can start using computer use in 3.5 Flash via the Gemini API and Gemini Enterprise Agent Platform. 3.5 Flash uses computer use to analyse the Gemini app and return a categorized list of features. 3.5 Flash with computer use audits its own documentation for accessibility issues. 开发者和企业可以通过 Gemini API 和 Gemini Enterprise Agent Platform 开始在 3.5 Flash 中使用计算机使用功能。3.5 Flash 利用计算机使用功能来分析 Gemini 应用并返回分类的功能列表。具备计算机使用功能的 3.5 Flash 还能审计其自身的文档,以发现可访问性问题。
Making computer use safe in 3.5 Flash: To mitigate some of the prompt injection risks for agents operating in live environments, we use targeted adversarial training for computer use in Gemini 3.5 Flash. We’re also releasing two optional enterprise safeguard systems that enable enterprises to: Require explicit user confirmation for sensitive or irreversible actions; Automatically stop tasks if an indirect prompt injection is identified. 确保 3.5 Flash 中计算机使用的安全性:为了减轻智能体在实时环境中运行时的部分提示词注入风险,我们针对 Gemini 3.5 Flash 的计算机使用功能采用了针对性的对抗训练。我们还发布了两个可选的企业级安全防护系统,使企业能够:要求用户对敏感或不可逆的操作进行明确确认;在识别到间接提示词注入时自动停止任务。
Taking a “defense-in-depth” approach, we encourage developers to combine these features with secure sandboxing, human-in-the-loop verification and strict access controls. Additional information on safety measures can be found in our best practices documentation. 我们采取“纵深防御”策略,鼓励开发者将这些功能与安全沙箱、人工介入验证以及严格的访问控制相结合。有关安全措施的更多信息,请参阅我们的最佳实践文档。
We are already seeing customers drive value with computer use. Here’s what some of them have to say: 我们已经看到客户通过计算机使用功能创造了价值。以下是其中一些客户的反馈:
To start building with computer use today: Try it now: Test the capabilities in a demo environment hosted by Browserbase. Start building: Dive into our reference implementation and documentation via Gemini API and Gemini Enterprise Agent Platform. 立即开始使用计算机使用功能进行构建:立即试用:在 Browserbase 托管的演示环境中测试各项功能。开始构建:通过 Gemini API 和 Gemini Enterprise Agent Platform 深入了解我们的参考实现和文档。