The best AI dictation apps, tested and ranked
The best AI dictation apps, tested and ranked
最佳 AI 听写应用测评与排名
AI dictation apps have come a long way in a short time. For years they were slow and inaccurate — unless you spoke with a particular accent and enunciated clearly. Advances in large language models (LLMs) and speech-to-text models have changed that, producing systems that can decipher speech more accurately while retaining enough context to format the text correctly. Developers have also built in features to automatically remove filler words, fix stumbles, and handle punctuation — outputting text that needs far fewer edits. With dozens of such apps now on the market, we’ve rounded up our picks for the best and most useful dictation apps available right now.
AI 听写应用在短时间内取得了长足的进步。多年来,它们一直运行缓慢且不够准确——除非你带着特定的口音并清晰地发音。大型语言模型(LLM)和语音转文字模型的进步改变了这一现状,现在的系统能够更准确地识别语音,同时保留足够的上下文来正确格式化文本。开发者还内置了自动删除填充词、修正口误和处理标点符号的功能,输出的文本需要修改的地方大大减少。目前市面上已有数十款此类应用,我们为您精选了当下最好用、最实用的听写应用。
Wispr Flow
Wispr Flow is a well-funded AI dictation app that lets you add custom words and instructions for dictation. It has native apps for macOS, Windows, and iOS; an Android version is in the works. The app lets you customize how it transcribes your text by choosing from “formal,” “casual,” and “very casual” styles for different kinds of writing, such as personal messaging, work, and email. And if you use it with vibe-coding tools like Cursor, you can turn on a feature to automatically recognize variables or tag files in the chat. The app lets you transcribe up to 2,000 words per week for free on desktop, and 1,000 words per month on iOS. Paid subscription plans offer unlimited transcription and start at $15 per month.
Wispr Flow 是一款资金雄厚的 AI 听写应用,允许你添加自定义词汇和听写指令。它拥有 macOS、Windows 和 iOS 的原生应用,Android 版本正在开发中。该应用允许你通过选择“正式”、“随意”和“非常随意”等风格来定制文本转录方式,以适应个人消息、工作和电子邮件等不同写作场景。如果你将其与 Cursor 等“氛围编程”(vibe-coding)工具配合使用,还可以开启自动识别变量或在聊天中标记文件的功能。该应用在桌面端每周免费提供 2,000 字的转录额度,iOS 端每月 1,000 字。付费订阅计划提供无限转录,起价为每月 15 美元。
Willow
Willow advertises itself as a big time-saver for those who don’t like to type. Alongside common features like automatic editing and formatting, the app uses large language models to generate a full passage of text from just a few dictated words. Willow also takes a more privacy-focused approach by storing all transcripts locally on your device and lets you opt out of model training entirely. It also lets you add custom vocabulary to help it adapt to your industry’s terminology, or your local dialect. Willow lets you dictate 2,000 words per month on its desktop app for free. Individual subscription plans start at $15 per month, unlocking unlimited dictation and enabling the app to remember your writing style.
Willow 宣称自己是那些不喜欢打字的人的“时间杀手”。除了自动编辑和格式化等常见功能外,该应用还利用大型语言模型,仅凭你听写的几个词就能生成完整的段落。Willow 在隐私保护方面也更为注重,它将所有转录内容本地存储在你的设备上,并允许你完全选择退出模型训练。它还支持添加自定义词汇,以帮助其适应你所在行业的术语或当地口音。Willow 的桌面版每月免费提供 2,000 字的听写额度。个人订阅计划起价为每月 15 美元,解锁无限听写功能,并能让应用记住你的写作风格。
Monologue
If privacy is your priority, Monologue lets you download its AI model directly to your device for transcriptions, keeping your data off the cloud entirely. What’s more, the app lets you customize its tone depending on the app you use it with. Monologue lets you transcribe 1,000 words per month for free; a subscription costs $10 per month or $100 per year. The company also sends its most active users a physical shortcut device called the Monokey to use with the app.
如果隐私是你的首要考虑因素,Monologue 允许你将 AI 模型直接下载到设备上进行转录,从而完全避免数据上传至云端。此外,该应用还允许你根据所使用的软件自定义语气。Monologue 每月免费提供 1,000 字的转录额度;订阅费用为每月 10 美元或每年 100 美元。该公司还会向最活跃的用户赠送一款名为 Monokey 的实体快捷键设备,用于配合应用使用。
Superwhisper
Superwhisper is primarily a dictation app, but it can also transcribe from audio or video files. The app lets you choose and download AI models, including several of its own at different speeds and accuracy levels, along with Nvidia’s Parakeet speech-recognition models. The app also lets you write custom prompts to steer the output, and you can view both processed and unprocessed transcripts directly from your system keyboard. The basic voice-to-text feature is free to use, and you get 15 minutes to test Pro features such as translation and transcription. The paid tier lets you use your own AI API keys and connect cloud and local models without any usage caps. The monthly plan costs $8.49 per month, the annual plan costs $84.99 per month, or you can pay $249.99 for a lifetime subscription.
Superwhisper 主要是一款听写应用,但它也可以转录音频或视频文件。该应用允许你选择和下载 AI 模型,包括其自有的几种不同速度和准确度等级的模型,以及 Nvidia 的 Parakeet 语音识别模型。该应用还支持编写自定义提示词来引导输出,你可以直接从系统键盘查看处理前后的转录文本。基础的语音转文字功能免费,并提供 15 分钟的 Pro 功能(如翻译和转录)测试时长。付费层级允许你使用自己的 AI API 密钥,并连接云端和本地模型,没有任何使用限制。月度计划为每月 8.49 美元,年度计划为每月 84.99 美元,或者支付 249.99 美元购买终身订阅。
VoiceTypr
The VoiceTypr app takes an offline-first, no-subscription approach, letting you use local models for transcription. It also has a GitHub repository for those who want to host and run the open source version themselves. VoiceTypr supports over 99 languages and works on both Mac and Windows. The app is available to try for three days for free, and after that, it will allow you to buy a lifetime license. The app costs $35 for one device, $56 for two, and $98 for four devices.
VoiceTypr 应用采取“离线优先、无需订阅”的策略,让你使用本地模型进行转录。它还提供了一个 GitHub 仓库,供那些希望自行托管和运行开源版本的用户使用。VoiceTypr 支持超过 99 种语言,适用于 Mac 和 Windows。该应用提供 3 天免费试用,之后需购买终身许可。单设备授权费用为 35 美元,两台设备 56 美元,四台设备 98 美元。
Aqua
Aqua is a Y Combinator-backed voice-typing app for Windows and macOS that claims to be one of the fastest tools in the category in terms of latency (the delay between when you speak and when text appears on screen). Besides handling grammar and punctuation, Aqua also lets you autofill text by saying phrases — you can say “my address” and have Aqua type it in, for example. The app also offers its own speech-to-text API, letting other apps plug into Aqua’s transcription engine. The free tier gets you 1,000 words per month. Paid plans start at $8 per month billed annually and unlock unlimited words and 800 custom dictionary values.
Aqua 是一款由 Y Combinator 支持的 Windows 和 macOS 语音输入应用,号称是同类工具中延迟(从说话到文字出现在屏幕上的时间差)最低的应用之一。除了处理语法和标点符号外,Aqua 还允许你通过说出短语来自动填充文本——例如,你可以说“我的地址”,Aqua 就会自动输入它。该应用还提供自己的语音转文字 API,允许其他应用接入 Aqua 的转录引擎。免费层级每月提供 1,000 字额度。付费计划起价为每月 8 美元(按年计费),可解锁无限字数和 800 个自定义词典条目。
Handy
Handy is an open-source, free transcription tool that runs on Mac, Windows, and Linux. The app is pretty basic and doesn’t offer much customization, but if you want to start using your voice more and don’t want to pay, it is a good option. The app has a basic settings menu that lets you toggle push-to-talk and change the hotkey to activate transcription.
Handy 是一款开源、免费的转录工具,可在 Mac、Windows 和 Linux 上运行。该应用非常基础,没有提供太多自定义选项,但如果你想开始更多地使用语音输入且不想付费,它是一个不错的选择。该应用拥有一个基础设置菜单,允许你切换“按键说话”模式并更改激活转录的快捷键。
Typeless
Typeless stands out for its high free word count. The company claims it doesn’t retain any data or use it to train AI models. Typeless also offers to rewrite sentences you may have fumbled. The app lets you dictate up to 4,000 words per week (roughly 16,000 words per month) on its free tier. You can pay $12 per month (billed annually) to unlock unlimited words and get access to new features. Typeless is available for Windows and macOS only.
Typeless 以其高额的免费字数脱颖而出。该公司声称不保留任何数据,也不会使用数据来训练 AI 模型。Typeless 还提供重写功能,可以帮你修改可能说错的句子。该应用在免费层级下每周允许听写最多 4,000 字(每月约 16,000 字)。你可以支付每月 12 美元(按年计费)来解锁无限字数并获得新功能。Typeless 仅适用于 Windows 和 macOS。
VoiceInk
VoiceInk is an open-source private dictation app for Mac. The app supports global shortcuts for recording start/stop, along with a push-to-talk mode. It reads the context on screen and adjusts its output accordingly. The app can automatically detect certain apps and URLs and apply custom formatting or rules to.
VoiceInk 是一款适用于 Mac 的开源隐私听写应用。该应用支持用于录音开始/停止的全局快捷键,以及“按键说话”模式。它能读取屏幕上的上下文并相应地调整输出。该应用可以自动检测特定的应用和 URL,并应用自定义的格式或规则。