AI News Daily - 2026-05-20

2026-05-20

今日要点

Google I/O 2026 重磅发布：Google 全面转向“智能体（Agentic）”时代，发布 Gemini 3.5 Flash 模型，并对搜索框进行 25 年来最大规模的 AI 重构，旨在让搜索框成为处理一切任务的中心。
AI 行业人才流动与法律博弈：Andrej Karpathy 加入 Anthropic；Elon Musk 起诉 OpenAI 的案件以败诉告终，法院裁定其诉讼已过追诉期。
企业级 AI 应用加速：KPMG、PwC 等巨头大规模部署 Claude；OpenAI 与 Dell 合作将 Codex 引入企业内部环境，AI 正在深度重塑金融与咨询行业。
安全与合规挑战：CISA 承包商泄露 AWS GovCloud 密钥引发安全担忧；npm 出现大规模恶意包攻击；AI 内容溯源（如 SynthID）成为行业共识。

Hacker News

I’ve joined Anthropic

Andrej Karpathy 宣布正式加入 Anthropic。作为 AI 领域极具影响力的研究者，他的加入被视为 Anthropic 在与 OpenAI 的竞争中进一步强化技术实力的重要信号。

The last six months in LLMs in five minutes

作者在 PyCon US 2026 上通过五分钟的闪电演讲，总结了过去半年大语言模型（LLM）领域的关键进展。他通过注释幻灯片的形式，梳理了技术迭代的脉络。

Apple unveils new accessibility features

Apple 发布了由 Apple Intelligence 驱动的全新辅助功能，包括针对 VoiceOver、放大器和语音控制的自然语言导航改进，并为 Vision Pro 用户增加了控制电动轮椅的新功能。

I’ve built a virtual museum with nearly every operating system you can think of

开发者构建了一个虚拟操作系统博物馆，通过 Linux 虚拟机预装并配置了几乎所有主流及经典操作系统，并提供了一个自定义启动器，支持快照功能以便快速恢复。

Show HN: Gaussian Splat of a Strawberry

展示了一个草莓的高斯溅射（Gaussian Splatting）场景，通过多角度拍摄实现了高质量的 3D 重建效果。

Gemini 3.5 Flash

Google 发布了 Gemini 3.5 Flash 模型，该模型在代码编写和智能体任务执行方面表现出色，是 Google 推动 AI 智能体化战略的核心引擎。

Click (2016)

一个简单的点击计数网页，当前得分为 363，引发了社区关于简单交互设计的讨论。

OpenBSD 7.9

OpenBSD 发布 7.9 版本，带来了多项系统更新和功能增强，继续保持其在安全性和代码质量方面的领先地位。

CISA Admin Leaked AWS GovCloud Keys on GitHub

CISA 的一名承包商因在公开的 GitHub 仓库中泄露了高度敏感的 AWS GovCloud 凭据及内部系统信息，引发了严重的安全漏洞，目前该问题已引起安全专家的高度关注。

Mini Shai-Hulud Strikes Again: 314 npm Packages Compromised

npm 账号“atool”遭到入侵，攻击者在 22 分钟内发布了 637 个恶意版本的包，影响了包括 size-sensor 和 echarts-for-react 在内的多个高下载量项目。

Tesla’s lithium refinery discharges 231,000 gallons of polluted wastewater a day

特斯拉位于德克萨斯州的锂精炼厂被发现每天排放约 23.1 万加仑的受污染废水，当地排水区工人在例行维护中发现了未经授权的排放管道。

Peter Neumann has died

计算机安全领域的先驱 Peter G. Neumann 去世，他在系统安全和风险分析方面做出了卓越贡献。

Pope Leo XIV’s first encyclical Magnifica humanitas to be published May 25

教皇利奥十四世即将发布首份通谕《Magnifica humanitas》，重点探讨在人工智能时代如何保障人类尊严。

Google 对其搜索框进行了重大改版，深度整合了 Gemini AI，标志着搜索体验从关键词匹配向 AI 智能体交互的彻底转型。

Minnesota becomes first state to ban prediction markets

明尼苏达州成为美国首个禁止预测市场的州，引发了关于金融创新与监管边界的广泛讨论。

Google just declared itself a contender in AI design at IO 2026

Google 在 I/O 2026 上展示了其在 AI 设计领域的雄心，发布了面向教师和小型企业主的易用型 AI 应用。

You can now talk to your Gmail inbox, as seen at Google IO 2026

Google 扩展了 Gmail 的 AI Inbox 功能，支持对话式语音搜索，用户可以直接询问 Gemini 来查找邮件中的具体细节。

How to use Google’s new AI agents to go beyond your standard searches

Google 推出了 AI 驱动的“信息智能体”，能够在后台监控特定主题，并主动向用户推送更新和变化，超越了传统的搜索模式。

Discord enables end-to-end encrypted voice and video calling for every user

Discord 宣布为所有用户启用端到端加密的语音和视频通话，确保即使是 Discord 官方也无法监听用户的通信内容。

Mach Industries just spent $50M to solve a major defense tech problem

国防科技公司 Mach Industries 斥资 5000 万美元进行收购，旨在优化其五个车辆项目的单位经济效益，以应对快速扩张的需求。

From teen hacker to Iron Dome researcher, this founder raised $28M to fight AI phishing

Ocean 是一家专注于 AI 邮件安全的平台，近日完成了 2800 万美元融资，旨在利用智能体技术对抗日益复杂的 AI 钓鱼攻击。

Elon Musk said Sam Altman “stole” a non-profit — but the trial showed he had similar aims

关于 Elon Musk 起诉 Sam Altman 的审判显示，尽管 Musk 指责 Altman 窃取了非营利组织，但证据表明两人在 AI 发展目标上存在高度相似性。

Google takes a page out of Meta’s book, announces new audio-powered smart glasses

Google 发布了新款音频智能眼镜，用户可以通过语音指令与 Gemini 生态系统交互，完成各种日常任务。

Google’s Genie world model can now simulate real streets with Street View

Google DeepMind 将 Street View 与 Project Genie 集成，能够创建沉浸式的交互式世界模拟，广泛应用于机器人、游戏和旅游领域。

With Gemini 3.5 Flash, Google bets its next AI wave on agents, not chatbots

Google 在 I/O 大会上发布了 Gemini 3.5 Flash，强调其作为最强大的代码和智能体模型，能够自主执行复杂任务并从零构建软件。

Demis Hassabis said this might be the ‘foothills of the singularity.’ What?

Google DeepMind CEO Demis Hassabis 在 I/O 大会上表示，人类正处于“奇点的前奏”，强调 AI 将解锁 AGI 的巨大潜力。

We react to Google I/O 2026

The Vergecast 团队在 I/O 大会后进行了直播，讨论了从 Gmail 语音机器人到“奇点临近”等一系列令人惊叹的发布内容。

文章分析称，Google 的未来愿景是将搜索框打造为一个全能的 AI 助手，不仅是搜索信息，而是直接为用户完成任务。

Nintendo’s $500 Switch 2 bundle includes a game, and it’s available now

任天堂推出了 499.99 美元的 Switch 2 “选择你的游戏”捆绑包，目前已在多家零售商处上架。

Google’s AI future demands trust — and your personal data

Google 在 I/O 上展示了 Gemini Spark 等 AI 智能体，但文章指出，这些工具的便利性高度依赖于用户对 Google 的信任以及对个人数据的开放。

Here are our favorite Memorial Day deals (so far)

盘点了阵亡将士纪念日期间的科技产品优惠，包括便携式蓝牙音箱等适合夏季户外活动的设备。

Democrats preview how they’d go after the Ticketmaster settlement if they regain power

民主党人对司法部与 Live Nation-Ticketmaster 的和解协议表示强烈不满，并预告了若在 11 月大选中获胜，将如何重新审查此类反垄断协议。

Ugreen’s new soccer ball-shaped tracker has up to 7 years of battery life

绿联（Ugreen）发布了 FineTrack 2 追踪器，采用独特的足球造型设计，支持 Apple Find My，电池寿命长达 7 年。

Kickstarter just killed its new mature content rules

Kickstarter 撤销了上周发布的新内容准则，恢复了之前的政策，此前的新规曾因对成人健康产品的限制而引发争议。

Gemini will use Volvo’s external cameras to interpret parking signs

Google 与沃尔沃合作，利用 Gemini AI 接入 EX60 SUV 的外部摄像头，帮助车辆识别并向车主解释周围的交通标志。

FBI seeks US-wide access to license plate cameras, wants “data in near real time”

FBI 计划向供应商付费，以获取全国范围内的车牌识别摄像头数据，旨在实现近乎实时的车辆追踪。

Spider-Noir final trailer gives us a classic villain

《蜘蛛侠：暗影》发布最终预告片，展示了一位经典反派角色。

“I’ll buy 10 of those”—NASA science chief yearns for mass-produced satellites

NASA 科学主管表达了对大规模生产卫星的渴望，旨在通过降低成本来增加太空科学任务的数量。

Plex’s 200% Lifetime Pass price hike tries forcing users to another subscription

Plex 将终身会员价格大幅上调 200%，此举被视为试图引导用户转向其订阅制服务。

Two AI-based science assistants succeed with drug-retargeting tasks

两款 AI 科学助手在药物重定向任务中取得成功，能够生成假设并分析部分实验数据。

Google’s SynthID AI watermarking tech is being adopted by OpenAI, Nvidia, and more

Google 的 SynthID AI 水印技术正被 OpenAI、Nvidia 等公司采用，以帮助区分 AI 生成内容与真实内容。

In stunning display of stupid, secret CISA credentials found in public GitHub repo

CISA 的敏感凭据（包括 SSH 密钥和明文密码）被发现长期暴露在公开的 GitHub 仓库中，安全专家对此表示震惊。

RFK Jr. forced to withdraw charter that opened CDC panel to anti-vaccine quacks

小罗伯特·肯尼迪被迫撤回了一项旨在扩大 CDC 专家组资格并关注所谓“疫苗伤害”的章程。

Gemini 3.5 Flash might be fast enough for gen AI to make sense

文章分析认为，Google 的 Gemini 3.5 Flash 模型凭借其高效性能，可能成为生成式 AI 真正走向实用化的关键。

The era of 1,000 Hz gaming monitors has arrived, but why?

LG 推出了 1000Hz 刷新率的游戏显示器，文章探讨了这种极致刷新率在实际游戏体验中的必要性。

CaseGap AI

一款旨在帮助律师事务所发现并修复收入漏洞的 AI 工具。

Insights by Omnia

提供分步行动计划，帮助用户提升 AI 可见性的工具。

VWFNDR™ + MBL

一款拍摄原始照片的工具，并提供证明照片真实性而非 AI 生成的验证功能。

Agora-1 by Odyssey

一个可交互的多智能体世界模型。

CLI Market

一个为 AI 智能体提供 3760 家零售商统一 API 接口的平台。

Trainer

通过录制屏幕来训练 AI 智能体的工具。

PollyReach

为 AI 智能体提供真实电话号码和语音功能，使其能够拨打电话。

Motion

一款面向动态设计领域的视频智能体。

Mantle Chat

一个支持团队协作与 AI 共同工作的平台。

Drizz

能够自动编写、运行并修复自身测试的移动端测试工具。

Roundtables: Inside the Musk v. Altman Trial

MIT Technology Review 报道了 Musk 起诉 OpenAI 的审判过程，并邀请法律专家进行深度解析。

Understanding the modern cybercrime landscape

HPE 的报告指出，网络犯罪正在工业化，犯罪分子利用 AI 和自动化技术大幅提升了攻击的规模和速度。

The Download: Musk v. Altman, smart glasses for warfare, and Google I/O

每日科技简报，涵盖了 Musk v. Altman 审判结果、军用智能眼镜以及 Google I/O 大会要点。

Colossal Biosciences is growing chickens in a 3D-printed artificial eggshell

生物技术公司 Colossal Biosciences 开发出一种 3D 打印的“人工蛋壳”，用于孵化小鸡，这是其复活灭绝鸟类计划的一部分。

Here’s why Elon Musk lost his suit against OpenAI

法院裁定 Elon Musk 起诉 OpenAI 的案件因超过诉讼时效而被驳回，Musk 对此表示将继续上诉。

What to expect from Google this week

分析了 Google 在 I/O 大会前夕面临的挑战，指出其在基础模型竞赛中处于第三位，急需通过新产品重回领先地位。

The Signals That Matter – MIT Insider’s Panel

MIT 内部专家小组讨论了当前科技领域最重要的信号。

Inside Anduril and Meta’s quest to make smart glasses for warfare

Anduril 与 Meta 合作开发军用增强现实头显，旨在通过眼动追踪和语音指令实现无人机打击。

The Download: Musk v. Altman week 3, and Trump’s tech trading

简报回顾了 Musk v. Altman 审判的最后一周以及特朗普政府的科技贸易政策。

Musk v. Altman week 3: Elon Musk and Sam Altman traded blows over each other’s credibility.

详细回顾了 Musk 与 Altman 在法庭上的激烈交锋，最终陪审团支持 OpenAI，认为诉讼已过时效。

tinyhumansai / openhuman

个人 AI 超级智能，强调隐私、简洁与强大功能。

HKUDS / CLI-Anything

旨在让所有软件实现“智能体原生”的 CLI 工具。

Imbad0202 / academic-research-skills

为 Claude Code 设计的学术研究技能框架，涵盖研究、写作、评审、修订及定稿流程。

obra / superpowers

一个智能体技能框架及软件开发方法论。

anthropics / claude-plugins-official

Anthropic 官方管理的 Claude Code 插件目录。

rohitg00 / agentmemory

基于真实世界基准测试的 AI 编码智能体持久化记忆系统。

CloakHQ / CloakBrowser

一款能够通过所有机器人检测测试的隐身 Chromium 浏览器，是 Playwright 的直接替代品。

rtk-ai / rtk

一个 CLI 代理，可减少开发命令中 60-90% 的 LLM Token 消耗。

msitarzewski / agency-agents

一个完整的 AI 代理机构工具集，涵盖从前端开发到社区运营的各类专家智能体。

colbymchenry / codegraph

为 Claude Code、Cursor 等工具提供的预索引代码知识图谱，旨在减少 Token 消耗并提高效率。

Advancing content provenance for a safer, more transparent AI ecosystem

OpenAI 推进 AI 内容溯源，通过 Content Credentials 和 SynthID 等工具帮助用户识别和信任 AI 生成的媒体内容。

OpenAI and Dell partner to bring Codex to hybrid and on-premise enterprise environments

OpenAI 与 Dell 合作，将 Codex 引入混合云和本地企业环境，助力企业安全部署 AI 编码智能体。

OpenAI and Malta partner to bring ChatGPT Plus to all citizens

OpenAI 与马耳他政府合作，为所有公民提供 ChatGPT Plus 及 AI 技能培训。

How sales teams use Codex

展示了销售团队如何利用 Codex 准备会议资料、预测分析及处理停滞的交易。

Databricks brings GPT-5.5 to enterprise agent workflows

Databricks 在企业智能体工作流中引入 GPT-5.5，该模型在 OfficeQA Pro 基准测试中创下新高。

How business operations teams use Codex

展示了业务运营团队如何利用 Codex 创建战略更新和领导决策包。

A new personal finance experience in ChatGPT

ChatGPT 为美国 Pro 用户推出个人理财体验，支持安全连接金融账户并提供基于财务目标的 AI 洞察。

How data science teams use Codex

展示了数据科学团队如何利用 Codex 构建根本原因分析和 KPI 备忘录。

Sea’s View on the Future of Agentic Software Development with Codex

Sea Limited 的 CPO 解释了公司为何在工程团队中部署 Codex 以加速 AI 原生软件开发。

Work with Codex from anywhere

用户现在可以通过 ChatGPT 移动端随时随地监控、引导和批准编码任务。

Introducing Claude Opus 4.7

Anthropic 发布 Claude Opus 4.7，在编码、智能体任务和视觉处理方面表现更强，且更加稳定。

Introducing Claude Design by Anthropic Labs

推出 Claude Design，允许用户与 Claude 协作创建演示文稿、原型和设计文档。

Claude is a space to think

Anthropic 承诺 Claude 将保持无广告，认为广告激励与 AI 助手的核心价值相悖。

KPMG integrates Claude across its core business and workforce of more than 276,000 in strategic alliance

KPMG 与 Anthropic 达成战略联盟，将 Claude 集成到其全球 27.6 万名员工的核心业务中。

Anthropic acquires Stainless

Anthropic 宣布收购 Stainless。

PwC is deploying Claude to build technology, execute deals, and reinvent enterprise functions for clients

PwC 正在部署 Claude 以构建技术方案、执行交易并重塑企业职能。

Anthropic forms $200 million partnership with the Gates Foundation

Anthropic 与盖茨基金会达成 2 亿美元的合作伙伴关系。

Introducing Claude for Small Business

推出面向小型企业的 Claude 服务。

Higher usage limits for Claude and a compute deal with SpaceX

提高 Claude 的使用限额，并与 SpaceX 达成计算资源合作协议。

Agents for financial services

推出面向金融服务行业的智能体解决方案。

I/O 2026

Google I/O 2026 大会回顾，展示了如何让 AI 对每个人都更有帮助。

How AI Mode is changing the way people search in the U.S.

AI Mode 发布一年后，数据显示用户正从关键词搜索转向自然语言查询。

New ways to create and get things done in Google Workspace

Google Workspace 引入语音功能，发布设计工具 Google Pics 并更新 AI Inbox。

I/O 2026: Welcome to the agentic Gemini era

Sundar Pichai 宣布进入智能体 Gemini 时代，强调 AI 将帮助用户完成更多工作。

Everything new in our Google AI subscriptions, fresh from I/O 2026

推出 100 美元的 AI Ultra 订阅计划，并更新了现有 AI 订阅权益。

The new AI-powered Google Finance is expanding to Europe.

AI 驱动的 Google Finance 扩展至欧洲，支持本地语言。

See what happens when creative legends use AI to make ads for small businesses.

推出“The Small Brief”计划，邀请广告界传奇人物利用 AI 为小型企业制作广告。

5 gardening tips you can try right in Search

介绍如何利用 Google 的 AI Mode 和 Search Live 帮助植物生长。

OlmoEarth v1.1: A more efficient family of models

发布 OlmoEarth v1.1 模型系列，强调更高的效率。

Introducing the Ettin Reranker Family

推出 Ettin 重排序模型系列。

Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation

介绍如何使用 LoRA/DoRA 微调 NVIDIA Cosmos Predict 2.5 以用于机器人视频生成。

PaddleOCR 3.5: Running OCR and Document Parsing Tasks with a Transformers Backend

PaddleOCR 3.5 支持使用 Transformers 后端运行 OCR 和文档解析任务。

The Open Agent Leaderboard

发布开放智能体排行榜。

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context

发布 Granite 多语言嵌入模型，支持 32K 上下文，在 100M 以下参数规模中表现优异。

Unlocking asynchronicity in continuous batching

探讨在连续批处理中解锁异步性的方法。

Building Blocks for Foundation Model Training and Inference on AWS

介绍在 AWS 上进行基础模型训练和推理的构建模块。

vLLM V0 to V1: Correctness Before Corrections in RL

探讨 vLLM 从 V0 到 V1 的演进，强调强化学习中的正确性。

Adding Benchmaxxer Repellant to the Open ASR Leaderboard

为开放 ASR 排行榜增加防刷榜机制。

After Orthogonality: Virtue-Ethical Agency and AI Alignment

探讨理性人与理性 AI 的目标设定，提出基于实践的对齐视角。

AGI Is Not Multimodal

文章认为，仅靠多模态语言模型无法实现 AGI，因为这忽略了人类智能中具身理解的重要性。

Shape, Symmetries, and Structure: The Changing Role of Mathematics in Machine Learning Research

探讨数学在现代机器学习研究中角色的转变，从原则性架构转向计算密集型工程。

What’s Missing From LLM Chatbots: A Sense of Purpose

指出当前 LLM 聊天机器人虽然基准测试分数很高，但缺乏真正的“目的感”。

We Need Positive Visions for AI Grounded in Wellbeing

呼吁建立以人类福祉为基础的 AI 积极愿景。

Financial Market Applications of LLMs

探讨 LLM 在金融市场中的应用潜力。

A Brief Overview of Gender Bias in AI

简要概述 AI 中的性别偏见问题。

Mamba Explained

解释 Mamba 模型，作为 Transformer 的替代方案，在处理长序列方面更具效率。

Car-GPT: Could LLMs finally make self-driving cars happen?

探讨 LLM 在自动驾驶中的应用前景及挑战。

Do text embeddings perfectly encode text?

介绍 ‘Vec2text’ 工具，能够将嵌入还原为文本，强调了嵌入数据安全的重要性。

AgentWall: A Runtime Safety Layer for Local AI Agents

提出 AgentWall，作为本地 AI 智能体的运行时安全层。

ANNEAL: Adapting LLM Agents via Governed Symbolic Patch Learning

提出 ANNEAL，通过受控符号补丁学习来适应 LLM 智能体。

From Prompts to Protocols: An AI Agent for Laboratory Automation

介绍一种用于实验室自动化的 AI 智能体，旨在加速科学发现。

Skim: Speculative Execution for Fast and Efficient Web Agents

提出 Skim，一种用于 Web 智能体的推测执行框架，以提高效率。

Scalable Uncertainty Reasoning in Knowledge Graphs

探讨知识图谱中可扩展的不确定性推理。

Counterparty Modeling is Not Strategy: The Limits of LLM Negotiators

研究 LLM 智能体在谈判中的局限性，指出建模对手并不等同于策略。

PRISMat: Policy-Driven, Permutation-Invariant Autoregressive Material Generation

提出 PRISMat，一种策略驱动的材料生成模型。

TTE-Flash: Accelerating Reasoning-based Multimodal Representations via Think-Then-Embed Tokens

提出 TTE-Flash，通过“先思考后嵌入”的 Token 加速多模态表示。

The Scaling Laws of Skills in LLM Agent Systems

研究 LLM 智能体系统中技能的扩展规律。

PQR: A Framework to Generate Diverse and Realistic User Queries that Elicit QA Agent Failures

提出 PQR 框架，用于生成多样化且真实的用户查询，以发现 QA 智能体的故障。

Scaling Accessible Mathematics on arXiv: HTML Conversion and MathML 4

报告 arXiv 在 HTML 转换和 MathML 4 方面的进展，以提升数学论文的可访问性。

Beyond Sentiment Classification: A Generative Framework for Emotion Intensity Evaluation in Text

提出一种用于文本情感强度评估的生成式框架。

SKG-Eval: Stateful Evaluation of Multi-Turn Dialogue via Incremental Semantic Knowledge Graphs

提出 SKG-Eval，通过增量语义知识图谱对多轮对话进行状态化评估。

A Scalable Tool for Measuring Manner and Result Verbs in Developmental Language Research

开发了一种用于测量发展语言研究中方式动词和结果动词的可扩展工具。

CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?

提出 CHI-Bench，评估 AI 智能体在复杂医疗工作流中的自动化能力。

Language Acquisition Device in Large Language Models

探讨 LLM 中的语言习得机制，旨在提升数据效率。

California’s Wildfire Season Is Already Overactive

加州野火季节提前到来，高温干燥的冬季导致火灾风险激增。

Everything Announced at Google I/O 2026: Gemini, Search, Smart Glasses

Google I/O 2026 大会全汇总：Gemini 模型升级、搜索重构及新款智能眼镜。

Meta Employees Are Scrambling to Use Up Benefits Ahead of Layoffs

Meta 员工在裁员前夕争相使用福利，包括耳机津贴等。

Google Makes It Easy to Deepfake Yourself

Google 更新 AI 创建软件 Flow，新增视频模型及生成自拍视频的“头像”工具。

Demis Hassabis Thinks AI Job Cuts Are Dumb

DeepMind CEO Demis Hassabis 认为企业应利用 AI 提升生产力，而非裁员。

Google Search Goes Agentic—and Doesn’t Need You Anymore

Google 搜索全面转向智能体模式，旨在实现高度个性化和自动化。

Hands-On With All of Google’s New Upcoming Android XR Smart Glasses

上手体验 Google 与三星 XR 平台支持的 Warby Parker 和 Gentle Monster 智能眼镜。

Gemini Spark Is Google’s Response to OpenClaw’s 24/7 AI Agent

Gemini Spark 是 Google 对 OpenClaw 24/7 AI 智能体的回应，旨在处理支付和邮件等任务。

Former OpenAI Staffers Warn That xAI’s Poor Safety Record Could Complicate SpaceX’s IPO

前 OpenAI 员工警告称，xAI 的安全记录可能影响 SpaceX 的 IPO。

The Zuckerbergs Are Hiring a Lifeguard but Calling It a ‘Beach Water Person’

扎克伯格家族办公室在夏威夷招聘救生员，职位名称为“海滩水域人员”。

Type out the code

讨论手动敲入代码对学习和理解的价值。

My domain got abused on Github Pages

讨论域名在 GitHub Pages 上被滥用的经历。

What would you want from a forge?

探讨用户对代码托管平台（Forge）的功能需求。

Software’s Centaur Era

探讨软件开发的“半人马时代”（人机协作）。

The Quiet Renovation at Bitwarden

讨论 Bitwarden 的内部更新。

Comprehensive Response to Bambu’s AGPLv3 Violations

关于 Bambu 违反 AGPLv3 协议的全面回应。

The Super Tiny Compiler, but in Ada

用 Ada 语言实现的超小型编译器。

Better generated branch names with jj

讨论如何使用 jj 生成更好的分支名称。

How we used Quint to find over 10 bugs in SQLite while hardening Turso

分享如何使用 Quint 在 SQLite 中发现 10 多个 Bug。

Your Outlier Detection is Lying to You

探讨 DBSCAN 在高维数据中的局限性及替代方案。

I built a self-hosted Linux fleet manager with no database and zero pip dependencies

开发者构建了一个无需数据库、零 pip 依赖的自托管 Linux 集群管理工具。

DevOps Dash: practice incident response on your phone

一款移动端 DevOps 模拟器，帮助用户练习事故响应。

Is Python Beneficial in the Future? Career, Salary & Industry Outlook (2026–2030)

分析 Python 在 2026-2030 年的职业前景与行业地位。

I Built an AI App That Keeps You Consistent (Not Just Motivated) 🚀

介绍 Momentum AI，一款旨在帮助用户保持一致性的 AI 应用。

What Nanochon’s Series A Tells Us About the Bio 3D Printing Commercialization Threshold

分析 Nanochon 的 A 轮融资对生物 3D 打印商业化的启示。

Understanding Solana’s Account Model From a Web2 Perspective

从 Web2 视角深入理解 Solana 的账户模型。

Why There’s a Tanker in Central Madrid

探讨 AIS 数据处理中的挑战。

Harness Engineering: The ‘New Software’ in the AI Era?

探讨在 AI 时代，Harness Engineering 是否会成为“新软件”。

Keccak256 From Scratch in 200 Lines of Kotlin (Because Web3j Was 8 MB)

开发者为了减小 APK 体积，用 200 行 Kotlin 代码从零实现了 Keccak256 哈希函数。

Meta Engineering

介绍 Meta 如何构建支持数十亿用户的社交发现功能“Friend Bubbles”。

Migrating Data Ingestion Systems at Meta Scale

分享 Meta 数据摄取系统的架构升级与大规模迁移经验。

Labyrinth 1.1: Making End-to-End Encrypted Backups Even More Reliable

发布 Labyrinth 1.1，增强 Messenger 端到端加密备份的可靠性。

How Meta Is Strengthening End-to-End Encrypted Backups

介绍 Meta 如何利用 HSM 备份密钥库加强端到端加密备份。

Modernizing the Facebook Groups Search to Unlock the Power of Community Knowledge

Meta 升级 Facebook 群组搜索，采用混合检索架构以提升社区内容发现效率。

Capacity Efficiency at Meta: How Unified AI Agents Optimize Performance at Hyperscale

分享 Meta 如何利用 AI 智能体平台优化超大规模基础设施的性能。

Post-Quantum Cryptography Migration at Meta: Framework, Lessons, and Takeaways

分享 Meta 在后量子密码学迁移方面的框架与经验。

Escaping the Fork: How Meta Modernized WebRTC Across 50+ Use Cases

分享 Meta 如何解决 WebRTC 内部版本分叉问题，实现与上游同步。

Trust But Canary: Configuration Safety at Scale

探讨 Meta 如何通过金丝雀发布和渐进式部署确保配置变更的安全性。

Fast-tracking genetic leads to reverse cellular aging

生物学家利用 Co-Scientist 发现逆转人类细胞衰老的新因子。

Simulate real-world places with Project Genie and Street View

Project Genie 集成 Street View，支持模拟真实世界场景。

Introducing Gemini Omni

发布 Gemini Omni。

Introducing Google Antigravity 2.0

发布 Google Antigravity 2.0。

Gemini for Science: AI experiments and tools for a new era of discovery

推出 Gemini for Science 工具集，助力科学探索。

Making it easier to understand how content was created and edited

扩展内容溯源工具，帮助用户理解内容创建与编辑过程。

Finding the molecular switches behind new infectious diseases

利用 Co-Scientist 识别新发传染病的遗传触发因素。

Opening new paths in aging research

Calico Life Sciences 利用 Co-Scientist 在衰老研究中取得新进展。

Accelerating discovery of liver disease mechanisms

利用 Co-Scientist 识别肝病治疗新靶点。

Uniting biological toolkits for a new approach to ALS

Co-Scientist 助力波士顿儿童医院与 MIT 探索 ALS 的 RNA 疗法。

A conversation with Kevin Scott: What’s next in AI

与 Kevin Scott 对话，探讨 AI 的未来发展。

From Hot Wheels to handling content: How brands are using Microsoft AI to be more productive and imaginative

探讨品牌如何利用 Microsoft AI 提升生产力与创造力。

Microsoft open sources its ‘farm of the future’ toolkit

Microsoft 开源其“未来农场”工具包。

How data and AI will transform contact centres for financial services

探讨数据与 AI 如何重塑金融服务行业的联络中心。

AI-equipped drones study dolphins on the edge of extinction

利用 AI 无人机研究濒危海豚。

Online math tutoring service uses AI to help boost students’ skills and confidence

在线数学辅导服务利用 AI 提升学生的技能与信心。

AI-Mimi is building inclusive TV experiences for Deaf and Hard of Hearing user in Japan

AI-Mimi 为日本听障用户构建包容性电视体验。

Microsoft’s framework for building AI systems responsibly

Microsoft 的负责任 AI 系统构建框架。

Singapore develops Asia’s first AI-based mobile app for shark and ray fin identification to combat illegal wildlife trade

新加坡开发亚洲首款 AI 鲨鱼鳍识别应用，打击非法野生动物贸易。

The opportunity at home – can AI drive innovation in personal assistant devices and sign language?

探讨 AI 在个人助理设备和手语创新方面的潜力。

VentureBeat AI

Google 25 年来首次重构搜索框，标志着搜索范式的彻底改变。

Railway secures $100 million to challenge AWS with AI-native cloud infrastructure

Railway 融资 1 亿美元，旨在通过 AI 原生云基础设施挑战 AWS。

Claude Code costs up to $200 a month. Goose does the same thing for free.

对比 Anthropic 的 Claude Code 与免费的 Goose 编码工具。

Listen Labs raises $69M after viral billboard hiring stunt to scale AI customer interviews

Listen Labs 融资 6900 万美元，此前曾通过病毒式广告牌招聘活动引发关注。

Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI

Salesforce 发布全新 Slackbot AI 智能体，在办公 AI 领域与微软和 Google 展开竞争。

Anthropic launches Cowork, a Claude Desktop agent that works in your files — no coding required

Anthropic 发布 Cowork，一款无需编码即可在本地文件上工作的 Claude 桌面智能体。

Nous Research’s NousCoder-14B is an open-source coding model landing right in the Claude Code moment

Nous Research 发布开源编码模型 NousCoder-14B，性能强劲。

Systematic Optimization of Real-Time Diffusion Model Inference on Apple M3 Ultra

研究 Apple M3 Ultra 上实时扩散模型推理的系统优化。

Mirror Descent-Type Algorithms for the Variational Inequality Problem with Functional Constraints

提出用于函数约束变分不等式问题的镜像下降算法。

Reducing Credit Assignment Variance via Counterfactual Reasoning Paths

通过反事实推理路径减少信用分配方差。

SignMuon: Communication-Efficient Distributed Muon Optimization

提出 SignMuon，一种通信高效的分布式 Muon 优化算法。

When Actions Disappear: Adversarial Action Removal in Self-Play Reinforcement Learning

研究自博弈强化学习中的对抗性动作移除问题。

A Structural Threshold in Decision Capacity Governs Collapse in Self-Play Reinforcement Learning

揭示自博弈强化学习中决策能力的结构性阈值。

Investigating Action Encodings in Recurrent Neural Networks in Reinforcement Learning

研究强化学习中循环神经网络的动作编码。

Forecasting Medium-Horizon Alzheimer’s Disease Progression: Residual Gap-Aware Transformers for 24-Month CDR-SB Change from ADNI Clinical and Biomarker Histories

利用残差间隙感知 Transformer 预测阿尔茨海默病的中期进展。

Noise2Params: Unification and Parameter Determination from Noise via a Probabilistic Event Camera Model

提出 Noise2Params，通过概率事件相机模型实现噪声统一与参数确定。

StrLoRA: Towards Streaming Continual Visual Instruction Tuning for MLLMs

提出 StrLoRA，用于多模态大模型的流式持续视觉指令微调。

How Many Visual Tokens Do Multimodal Language Models Need? Scaling Visual Token Pruning with F^3A

研究多模态语言模型所需的视觉 Token 数量，并提出 F^3A 剪枝方法。

Fre-Res: Frequency-Residual Video Token Compression for Efficient Video MLLMs

提出 Fre-Res，一种用于高效视频多模态大模型的频率残差视频 Token 压缩方法。

GeoSym127K: Scalable Symbolically-verifiable Synthesis for Multimodal Geometric Reasoning

提出 GeoSym127K，用于多模态几何推理的可扩展符号验证合成。

SwordBench: Evaluating Orthogonality of Steering Image Representations

提出 SwordBench，用于评估图像表示干预的正交性。

Cross-Source Supervision for Bone Infection Segmentation in Dual-Modality PET-CT

提出跨源监督方法，用于 PET-CT 双模态骨感染分割。

StreamPro: From Reactive Perception to Proactive Decision-Making in Streaming Video

提出 StreamPro，实现流式视频中从反应式感知到主动决策的转变。

Deploying a Multistage Multimodal Recommender System on Amazon Elastic Kubernetes Service

在 Amazon EKS 上部署多阶段多模态推荐系统的实践指南。

Introduction to Lean for Programmers

面向程序员的 Lean 数学语言入门。

Grounding LLMs with Fresh Web Data to Reduce Hallucinations

探讨如何利用实时 Web 数据为 LLM 提供基础，以减少幻觉。

Proxy-Pointer RAG: Solving Entity and Relationship Sprawl in Large Knowledge Graphs

提出 Proxy-Pointer RAG，解决大型知识图谱中的实体与关系蔓延问题。

Six Choices Every AI Engineer Has to Make (and Nobody Teaches)

总结 AI 工程师在模型上线后必须做出的六个生产权衡。

One Flexible Tool Beats a Hundred Dedicated Ones

探讨为何在智能体时代，灵活的工具往往优于专用工具。

Why Your AI Demo Will Die in Production

分析为何 95% 的企业 AI 试点项目无法成功上线。

How to Maximize OpenAI’s Codex

学习如何最大化利用 OpenAI 的编码智能体。

Pandas Isn’t Going Anywhere: Why It’s Still My Go-To for Data Wrangling

探讨为何 Pandas 依然是数据清洗的首选工具。

LLM Evals Are Based on Vibes — I Built the Missing Layer That Decides What Ships

作者构建了一个轻量级评估层，将 LLM 输出转化为可重复的决策，以替代基于“感觉”的评估。