My first 24 hours with Siri AI on the Mac

我与 Mac 版 Siri AI 相处的最初 24 小时

Siri is better, but its limitations are much more obvious on a Mac than an iPhone. Siri 确实变强了，但在 Mac 上，它的局限性比在 iPhone 上表现得更为明显。

I turned off Siri on the Mac years ago and never looked back. Similarly, I found Apple Intelligence so fruitless I never engage with it. But the new Siri AI coming to macOS 27 Golden Gate has at least got me slightly rethinking things. 多年前我就关闭了 Mac 上的 Siri，从此再未开启。同样，我觉得 Apple Intelligence 毫无用处，所以也从不使用。但 macOS 27 Golden Gate 中搭载的新版 Siri AI 至少让我开始重新审视它。

I’m still early in testing Siri AI, as I’ve only had access to it in the macOS 27 developer beta for little more than 24 hours. It’s also in an early preview state on the dev beta, so there should be lots of runway for improvements before it releases later this year. I don’t even know if it’s done indexing my files and folders on our review unit M5 MacBook Air and M5 Max MacBook Pro. Unlike on the iOS 27 dev beta, there’s no “indexing in progress” box in the settings page. I asked Siri if it could tell me, but it told me to click a button in Settings that isn’t there. 我目前还处于测试 Siri AI 的早期阶段，因为我接触 macOS 27 开发者预览版的时间刚过 24 小时。它在开发者预览版中仍处于早期预览状态，因此在今年晚些时候正式发布前，还有很大的改进空间。我甚至不知道它是否已经完成了对我们评测机（M5 MacBook Air 和 M5 Max MacBook Pro）中文件和文件夹的索引。与 iOS 27 开发者预览版不同，设置页面中没有“正在索引”的提示框。我问 Siri 能否告知进度，它却让我去点击设置里一个根本不存在的按钮。

My colleagues got a headstart testing Siri AI on the iPhone and Apple Watch, and getting a read on its general vibe, and they’ve so far had some positive feedback from using it. My feelings are a little more mixed. 我的同事们比我更早开始在 iPhone 和 Apple Watch 上测试 Siri AI，并对其整体体验有了初步了解，目前他们的反馈还算积极。而我的感受则更为复杂。

When I sit down at a laptop I don’t need a voice assistant for searching things I’m randomly curious about or checking the weather like I would on a phone; I can do that faster and more accurately with keyboard and mouse. So I tried to think of ways to let Siri AI help me on macOS — things that might actually be useful to me in my everyday work. 当我坐在笔记本电脑前时，我并不像在手机上那样需要语音助手来搜索随手想到的问题或查看天气；用键盘和鼠标操作会更快、更准确。因此，我试图寻找让 Siri AI 在 macOS 上帮助我的方式——即那些在日常工作中真正有用的功能。

I’d be happy to automate some of the time-consuming benchmarking I do when reviewing laptops, but although Siri AI can launch apps, it can’t take actions inside them (not that Apple ever claimed it could). I then tried to see if vibe coding a couple Shortcuts could get me there instead. This isn’t a Siri AI feature, but it is a new part of Apple Intelligence. I asked Shortcuts to run a test in either Geekbench or Cinebench, capture the results in a screenshot, wait a few minutes, and repeat the process two more times. But the resulting automations couldn’t actually run the tests either. Apple Intelligence made a shortcut that opened Geekbench and took screenshots (but forgot about actually running the benchmark), and it made a Cinebench shortcut that had “Wait for you to run the test” as an actual step. Maybe if developers continue expanding App Intents this could one day work. 我很乐意将评测笔记本电脑时那些耗时的基准测试自动化，但尽管 Siri AI 可以启动应用程序，它却无法在应用内部执行操作（当然，苹果从未声称它能做到这一点）。于是我尝试通过“灵感编程”（vibe coding）创建几个快捷指令来实现。这虽然不是 Siri AI 的功能，但却是 Apple Intelligence 的新特性。我要求快捷指令在 Geekbench 或 Cinebench 中运行测试，截取结果，等待几分钟，然后再重复两次。但生成的自动化流程实际上也无法运行测试。Apple Intelligence 创建的快捷指令只会打开 Geekbench 并截图（却忘了运行测试本身），而它创建的 Cinebench 快捷指令中，甚至包含了一个“等待你运行测试”的步骤。也许随着开发者不断扩展 App Intents，未来有一天这能行得通。

So if Siri can’t help me run my benchmarks, maybe it can at least help me be a little faster in logging the data. In my normal workflow, I run each benchmark three times, taking screenshots as I go, and later average out the results before cataloging them in a spreadsheet. Apple’s WWDC keynote showed someone using Ask Siri in Spotlight to analyze data in local files. So I tried selecting batches of those screenshots in Finder and asking Siri to calculate the average scores for me. It worked pretty well — most of the time. 既然 Siri 无法帮我运行基准测试，那它至少能帮我加快记录数据的速度吧？在我的日常工作中，我会将每个基准测试运行三次，边测边截图，最后计算平均值并录入电子表格。苹果在 WWDC 主题演讲中展示了用户通过 Spotlight 中的“询问 Siri”来分析本地文件数据。于是，我尝试在访达（Finder）中选中一批截图，让 Siri 为我计算平均分。大多数情况下，它的表现还不错。

It was smart enough to distinguish single-core CPU scores from multicore CPU scores and GPU scores, average the test results, and arrange them in easy-to-read tables. But it could get thrown off if I included screenshots of too many different types of tests, especially if I mixed ones with synthetic score results (Geekbench, PugetBench, etc.) and time-based results (Blender render tests and our 4K video export test). And it sometimes got thrown off by the CPU rankings data that’s visible in Cinebench screenshots. Ideally, I’d be able to have Siri AI accurately calculate the 15 or so averages from my dozens of screenshots all at once — that would save me some serious time. But for now, it can at best only help me a little bit. And unless it gets better I’m still inclined to continue doing it all myself, especially since Siri messed up the numbers a couple times by pulling the wrong data. 它足够聪明，能区分 CPU 单核分数、多核分数和 GPU 分数，并能将测试结果取平均值，整理成易于阅读的表格。但如果我包含的测试类型太多，尤其是将合成评分结果（如 Geekbench、PugetBench 等）与基于时间的测试结果（如 Blender 渲染测试和我们的 4K 视频导出测试）混在一起时，它就会出错。有时，Cinebench 截图中的 CPU 排名数据也会干扰它的判断。理想情况下，我希望 Siri AI 能一次性从几十张截图中准确计算出 15 个左右的平均值——那将为我节省大量时间。但就目前而言，它顶多只能帮我一点小忙。除非它有所改进，否则我还是倾向于亲力亲为，毕竟 Siri 已经好几次因为提取了错误数据而搞乱了数字。

So far, Siri AI seems a lot more capable within Apple’s ecosystem than it is outside of it, even for apps and files that are already on my Mac but in non-Apple apps. When I asked Siri to find my pictures of cats or babies, it pulled up results from Apple’s Photos and Messages apps. This could be enough for plenty of people, but not for me. Most of my messaging is done in Signal, and photos from my phone are uploaded to Google Photos, not iCloud. Siri also missed the thousands of images I have in my Lightroom Classic catalog, even though the files are stored locally in the Pictures folder and I kept asking it to access them directly. It’s possible those files haven’t been indexed yet, but I have no way to tell. 到目前为止，Siri AI 在苹果生态系统内的表现似乎远强于生态系统之外，即使是对于那些已经存在于我 Mac 上但属于非苹果应用的程序和文件也是如此。当我让 Siri 查找猫或婴儿的照片时，它只从苹果的“照片”和“信息”应用中调取了结果。这对很多人来说可能足够了，但对我而言不行。我的大部分聊天都在 Signal 上进行，手机照片也上传到了 Google Photos 而非 iCloud。Siri 还漏掉了我 Lightroom Classic 目录中的数千张图片，尽管这些文件就存储在本地的“图片”文件夹中，且我多次要求它直接访问。可能这些文件尚未完成索引，但我无从得知。

For now, I’m getting similar vibes to when I tested Copilot Vision last year. Like Copilot Vision, you can use Siri’s Visual Intelligence to ask questions about things on your screen. And like Copilot, it’s limited. I asked Siri to evaluate benchmark results on a spreadsheet in Google Sheets, but it can’t see all the data if it’s not visible onscreen all at once. I could get it to see the whole spreadsheet by downloading it as an Excel file and pointing Siri at it in Finder, but when I asked for the laptop with the highest single-core Geekbench score it gave me multicore data. Not great. 目前，我的感受与去年测试 Copilot Vision 时如出一辙。和 Copilot Vision 一样，你可以利用 Siri 的“视觉智能”来询问屏幕上的内容。但也和 Copilot 一样，它非常有限。我让 Siri 评估 Google Sheets 电子表格中的基准测试结果，但如果数据不能一次性全部显示在屏幕上，它就无法读取。我可以通过将其下载为 Excel 文件并让 Siri 在访达中读取来让它看到整个表格，但当我询问哪台笔记本电脑的 Geekbench 单核分数最高时，它给出的却是多核数据。表现并不理想。