Do You Actually Need to Pay for Transcription Software?
Do You Actually Need to Pay for Transcription Software?
你真的需要为转录软件付费吗?
I’m constantly seeing ads for Wispr Flow, an AI-powered transcription tool. The pitch—that you’ll be able to write faster by talking out loud instead of typing—is compelling, especially if you’re a slow typist. The marketing promises you’ll be able to “write at the speed of thought, 4x faster than your keyboard.” 我经常看到 AI 转录工具 Wispr Flow 的广告。它的卖点——通过口述而非打字来提高写作速度——非常有吸引力,尤其是对于打字慢的人来说。其营销承诺你能够“以思维的速度写作,比键盘输入快 4 倍”。
I already type faster than I can think. (Fast typist, or slow thinker? You decide.) But Wispr Flow’s core promise isn’t just transcription—it’s post-processing. The tool uses two steps. First, modern AI transcription tools turn your voice into text; second, a large language model (LLM) removes filler words and formats your words into complete sentences and paragraphs. The idea is that you can talk out your ideas and watch them turn into properly formatted text. This works inside any text box on your computer or phone. 我的打字速度已经快过我的思考速度了。(是打字快,还是思考慢?由你决定。)但 Wispr Flow 的核心承诺不仅仅是转录,而是后期处理。该工具分为两步:首先,现代 AI 转录工具将你的语音转化为文字;其次,大语言模型(LLM)会删除口头禅,并将你的话语整理成完整的句子和段落。其理念是,你可以口述想法,并看着它们变成格式规范的文本。这可以在电脑或手机上的任何文本框中使用。
I’ve tested this a few times and have to admit the results are pretty good. Apple’s dictation feature, free on all its devices, works well enough—so does Google’s Assistant Voice Typing on Pixel phones (which is getting another AI upgrade soon). But there’s real value in software that removes filler words and formats everything into paragraphs. And Wispr Flow is sleekly designed, guiding you through the setup process with snappy graphics. 我测试了几次,不得不承认效果相当不错。苹果设备上免费的听写功能表现尚可,Pixel 手机上的谷歌语音输入(即将迎来另一次 AI 升级)也是如此。但能够删除口头禅并将内容整理成段落的软件确实有其价值。而且 Wispr Flow 设计精美,通过简洁的图形引导你完成设置过程。
So what’s the catch? Price. WisprFlow costs $144 per year (billed annually) or $15 a month after an extremely limited free trial. But the technology Wispr Flow is built around—AI-based transcription and LLMs—is widely available. On the speech-to-text side, Nvidia’s Canary and OpenAI’s Whisper are both open source, meaning they’re completely free to run on your own device. And most AI enthusiasts are already paying for OpenAI, Claude, or Google’s Gemini, any of which can handle the post-processing part of Wispr Flow. So can free local tools like Ollama, Google Recorder, or Apple Intelligence. 那么问题出在哪里?价格。Wispr Flow 每年收费 144 美元(按年计费),或者在极其有限的免费试用后每月收费 15 美元。但 Wispr Flow 所依赖的技术——基于 AI 的转录和 LLM——已经非常普及。在语音转文字方面,Nvidia 的 Canary 和 OpenAI 的 Whisper 都是开源的,这意味着它们可以在你自己的设备上完全免费运行。大多数 AI 爱好者已经订阅了 OpenAI、Claude 或 Google Gemini,它们中的任何一个都能处理 Wispr Flow 的后期处理部分。像 Ollama、Google Recorder 或 Apple Intelligence 这样的免费本地工具也可以做到。
With all this in mind, I’ve been wondering: Is there a good, free platform-agnostic alternative to Wispr Flow? I tried out several applications—here’s what I found. 考虑到这些,我一直在想:有没有一种好的、免费的、跨平台的 Wispr Flow 替代品?我尝试了几款应用程序,以下是我的发现。
Spokenly, the Best Free Alternative
Spokenly:最佳免费替代品
If you want to get the benefits of Wispr Flow without a subscription quickly, you could do worse than Spokenly, available on both macOS and Windows. It’s not open source, but it is free to download and does not require an account to use. There’s a Pro plan that costs $10 a month or $100 a year. The paid plan is only necessary if you’re using Spokenly’s cloud models. You can opt to use a local model instead, which is free. Alternatively, if you’re already paying for a service like OpenAI or Groq, you can add your API key and use that for transcribing—that’s free with Spokenly. 如果你想快速获得 Wispr Flow 的功能而无需订阅,Spokenly 是个不错的选择,它支持 macOS 和 Windows。它不是开源的,但可以免费下载且无需注册账号即可使用。它有一个每月 10 美元或每年 100 美元的专业版计划。只有当你使用 Spokenly 的云端模型时才需要付费。你可以选择使用免费的本地模型。或者,如果你已经订阅了 OpenAI 或 Groq 等服务,你可以添加 API 密钥并将其用于转录——这在 Spokenly 中是免费的。
Spokenly offers optional post-transcription formatting. You can also choose a different LLM provider for the post-transcription formatting of text. As a Mac user, I opted to use Apple Intelligence—it’s totally free and worked really well in my tests. But it supports OpenAI, Anthropic, and Groq, plus a few other LLM providers. The application also allows you to write as many custom prompts for post-transcription processing as you like, each with its own keyboard shortcut. Spokenly 提供可选的转录后格式化功能。你还可以选择不同的 LLM 提供商来进行文本格式化。作为 Mac 用户,我选择了 Apple Intelligence——它完全免费,在我的测试中表现非常好。它还支持 OpenAI、Anthropic、Groq 以及其他一些 LLM 提供商。该应用程序还允许你编写任意数量的自定义提示词用于后期处理,每个提示词都可以设置专属的键盘快捷键。
One of my favorite things is that Spokenly can work entirely offline. If you use a local model for transcription and a local model like Apple Intelligence for the post-transcription formatting, the entire thing works without any data leaving your computer. That’s nice from a privacy perspective, and from a functionality standpoint, the feature will work even when your internet is shaky. 我最喜欢的一点是 Spokenly 可以完全离线工作。如果你使用本地模型进行转录,并使用像 Apple Intelligence 这样的本地模型进行后期格式化,整个过程无需任何数据离开你的电脑。从隐私角度来看这很好,从功能角度来看,即使在网络不稳定的情况下,该功能也能正常工作。
This is, without a doubt, more work than setting up Wispr Flow. When you’re done, though, you have a working application with no monthly subscription. I recommend trying it out. 毫无疑问,这比设置 Wispr Flow 要麻烦一些。但完成后,你就拥有了一个无需每月订阅的实用程序。我建议尝试一下。
A Few Other Free Alternatives
其他几个免费替代品
Like I said before: AI transcription and LLMs are both widely available technologies. It should be no surprise, then, that there are many Wispr Flow alternatives out there right now. 正如我之前所说:AI 转录和 LLM 都是非常普及的技术。因此,现在市面上有很多 Wispr Flow 的替代品也就不足为奇了。
For Mac users, the completely free and open source MacParakeet is a great option. It’s open source and completely free to download and use without an account. There’s also no upselling in the application. Transcribing is handled using local models, either Parakeet or Whisper, and a variety of LLMs—both local and online—are supported for the formatting step. That’s the closest completely free app to Wispr Flow I’ve found. 对于 Mac 用户来说,完全免费且开源的 MacParakeet 是一个很好的选择。它是开源的,下载和使用完全免费,无需账号。应用程序内也没有任何推销。转录使用本地模型(Parakeet 或 Whisper)处理,格式化步骤支持多种本地和在线 LLM。这是我发现的最接近 Wispr Flow 的完全免费应用。
VoiceInk, another Mac-only option, is open source and free to use if you download the code from GitHub and compile it yourself. The app otherwise costs $25, one time, after which you can use all features without any ongoing payments. Note that the formatting step for this requires an API key from a service such as Gemini, Anthropic, OpenAI, or Claude. VoiceInk 是另一个仅限 Mac 的选择,如果你从 GitHub 下载代码并自行编译,它是开源且免费的。否则该应用一次性收费 25 美元,之后你可以使用所有功能而无需持续付费。请注意,其格式化步骤需要 Gemini、Anthropic、OpenAI 或 Claude 等服务的 API 密钥。
Windows and Linux users should look into FOSS Voquill, which is completely free, open source software (hence the FOSS), and works offline. It doesn’t offer a formatting step, which is disappointing, but I’m including it because it’s the best free Windows and Linux option I’ve found without any annoying upselling. Windows 和 Linux 用户应该看看 FOSS Voquill,它是完全免费的开源软件(因此得名 FOSS),并且可以离线工作。它不提供格式化步骤,这令人失望,但我把它包括在内是因为这是我发现的最好的、没有任何烦人推销的 Windows 和 Linux 免费选项。
Windows users and Mac users who don’t like the above options for any reason have one more choice: OpenWhispr. This open source tool doesn’t require an account (but you’ll have to find a tiny “Continue without an account” button). The application offers a subscription, but you can opt to set up local models and external API keys instead to avoid paying. Windows 用户以及因任何原因不喜欢上述选项的 Mac 用户还有一个选择:OpenWhispr。这个开源工具不需要账号(但你需要找到一个微小的“Continue without an account”按钮)。该应用程序提供订阅服务,但你可以选择设置本地模型和外部 API 密钥来避免付费。
Do You Really Need to Type With Your Voice?
你真的需要用语音打字吗?
Wispr Flow has its upsides. It’s easy to configure, for one thing, and has a consistent user interface. I can understand why someone might opt to pay for a subscription. But if money is tight right now, there are free options available. Wispr Flow 有其优势。首先,它易于配置,并且拥有统一的用户界面。我可以理解为什么有人会选择付费订阅。但如果手头紧,也有免费的选择。
I had fun exploring this growing field, but I’m going to stick to my keyboard. Wispr Flow, and apps like it, promise to let you write at the speed of thought, but I type faster than I think. If I can be philosophical for a second, writing is how I think. Typing a sentence, looking at it, and refining it isn’t an… 探索这个不断发展的领域很有趣,但我还是会坚持使用键盘。Wispr Flow 和类似的应用程序承诺让你以思维的速度写作,但我的打字速度比思考速度快。如果我能感性一点的话,写作就是我思考的方式。打出一个句子,审视它,然后润色它,这并不是一种……