Whisperstream vs Aqua Voice
Local. One-time. Windows.
Whisperstream is a $29-once, on-your-PC dictation app for Windows. Aqua Voice is an $8-a-month cloud app for Mac, Windows, and iOS, with a model that actively rewrites your phrasing. If you prefer paying once, keeping audio off the network, and getting back the words you said, Whisperstream is the closer fit. If you want cross-platform reach plus LLM cleanup of every sentence, Aqua stays the better answer.
Updated
At a glance
Whisperstream is a $29-once, on-your-PC dictation app for Windows. Aqua Voice is an $8-per-month cloud app for Mac, Windows, and iOS. If you want literal transcription and audio that stays on your machine, Whisperstream is the closer fit. If you want LLM-driven phrasing cleanup and cross-platform reach, Aqua stays the better answer.
- Pricing
- $29 once vs $96 a year
- Where audio goes
- Audio stays on your PC vs uploaded to Aqua's servers
- Best for
- Privacy-first Windows users who want literal transcription
Side by side
Pricing
Whisperstream costs $29 once. Aqua Pro is $8 a month, billed annually, which works out to $96 a year. Over three years that is $29 versus $288, almost ten times the cost. Whisperstream breaks even against Aqua at month four. Aqua subscription fees are non-refundable per their terms, so if you do try Aqua and prefer something else, you ride out the current year rather than recover the spend.
Privacy
Whisperstream transcribes on your PC; audio never leaves your machine. Aqua is a cloud app and a typical session passes through three privacy choices. First, audio is transmitted to Aqua's servers for transcription. Second, Aqua's "Deep Context" reads on-screen text to improve results; it is off by default but recommended in the onboarding flow. Third, transcript data may be stored on Aqua's servers to improve the product unless Privacy Mode is turned on, with a contractual Zero Data Retention SLA available only on the Enterprise plan.
All three behaviors are documented on Aqua's site. Their privacy policy states that "for users with Privacy Mode disabled, we may securely store transcript data on our servers to the extent necessary to improve the product." Their homepage Deep Context section describes the feature as "your screen is its dictionary, understands what's on your screen, from code syntax to everyday text" and confirms it is "optional and off by default." Whisperstream has no equivalent screen-reading or training-opt-in toggle because the model is local and there is no telemetry path for audio.
How we tested
We ran a 36-sample word-error-rate benchmark covering technical jargon, code-like dictation, brand names, numbers, natural sentences with embedded jargon, long sustained dictation, and easy-mode English. Single take per sample, same Windows machine, same mic. Aqua Pro on the Avalon model, Deep Context off, Model improvements off. Whisperstream in its stock shipping configuration: pure ASR via NVIDIA's Parakeet TDT v3 on CPU, no LLM cleanup layer, dictionary empty.
Aqua edges Whisperstream on bare technical terms and proper nouns, where its larger cloud model handles rare words better. Whisperstream wins on code-like dictation and long sustained passages, where Aqua's LLM cleanup compounds into semantic rewrites. Both products tie on plain English.
Both products are within about two percentage points of WER on this corpus. The wedge is the failure mode. Whisperstream's mistakes are spelling typos in rare brand names ("Tauri" came back as "Tari") that any human reader spots immediately and the dictionary tab fixes permanently. Aqua's mistakes are semantic rewrites: grammatical, plausible, and easy to miss when you are skimming.
When Aqua might still be right
Aqua is the right answer in several cases. We would rather name them upfront than pretend Whisperstream is everyone's best pick.
- You use a Mac or an iPhone. Whisperstream is Windows-only. Aqua ships native apps for macOS (Apple Silicon and Intel), Windows, and iOS.
- You want phrasing cleanup as a feature, not a bug. Aqua's Avalon model actively rewrites sentences, fixes grammar, and formats numbers in real time. For drafting Slack messages and emails that is a productivity win. Whisperstream stays literal on intent and content.
- You need 49-language coverage. Aqua supports 49 languages. Whisperstream's Parakeet TDT v3 model covers 25 European languages, so non-European languages are not in scope today.
- You need SOC 2 Type II for procurement. Aqua is SOC 2 Type II certified by Advantage Partners. Whisperstream is a single-user app and does not carry that certification.
For a broader Windows survey beyond this head-to-head, see our Wispr Flow alternatives roundup. If your other paid options were Wispr Flow or Willow Voice, the closer comparisons are Whisperstream vs Wispr Flow and Whisperstream vs Willow Voice.
If none of those apply, here is how most Aqua Pro users move over.
Switching from Aqua Voice to Whisperstream
- 01
Stop your Aqua Pro renewal
Open Aqua's billing settings and turn off auto-renew. Aqua subscription fees are non-refundable per their terms, so you cannot recover the spend, but you can ride out the current year while you settle in with Whisperstream.
- 02
Export your Custom Instructions and dictionary
Aqua Pro carries Custom Instructions and up to 800 dictionary entries. Copy both out before your subscription expires. Whisperstream has a dictionary tab that accepts the same kind of one-for-one word replacements; the Custom Instructions LLM-cleanup directives do not map one-to-one, so plan to redo any formatting rules by hand.
- 03
Install Whisperstream
Download the installer from this page and run it. The first launch downloads the speech model, about 600 MB on disk, which usually takes a few minutes. After that, everything runs offline.
- 04
Set your hotkey and paste your dictionary
Open Whisperstream's settings and pick a push-to-talk hotkey; the default is Right Shift, which most users keep. While you are in settings, paste your exported Aqua dictionary into the dictionary tab.
- 05
Test it in your real apps
Open the apps you usually dictate into, like Slack, Outlook, Word, or your editor, and try a sentence in each. If a word lands wrong, add an override and try again. Most users settle the dictionary in one sitting.
Frequently asked questions
No Typing,Just Speaking.Fully Local.
Private dictation for Windows. No cloud processing. No subscription.