Question 1

Does Whisperstream have AI cleanup?

Accepted Answer

Yes, but it is optional and off by default. Whisperstream can clean up dictation by removing filler words, fixing grammar and punctuation, and applying spoken self-corrections. The cleanup model runs locally on a capable GPU, with a free cloud fallback for older hardware. Plain transcription always runs fully on-device; you choose if and when to turn cleanup on.

Question 2

Can I create my own cleanup modes or write my own prompt?

Accepted Answer

Yes. AI cleanup comes with built-in modes and lets you create your own profiles, each with any prompt you write, so you can shape tone, formatting, and style however you like. Whisperstream applies each profile automatically to the app or website you set it for. Your prompt runs on a local model or a free Gemini key (or OpenAI or Anthropic). Plain transcription always stays fully on-device.

Question 3

Does Whisperstream work offline?

Accepted Answer

Yes. Whisperstream needs the internet only twice: once to download the speech model and once to activate your license. After that, dictation, transcription, and formatting all run with no connection at all. You can dictate on a plane, in a secure facility, or anywhere your network is down.

Question 4

Do I need a GPU to run Whisperstream?

Accepted Answer

No. Core transcription runs entirely on your CPU, so a dedicated graphics card is not required. A modern processor delivers near-instant results. A GPU is only used for the optional AI cleanup feature. Whisperstream runs on 8 GB of RAM, with 16 GB recommended.

Question 5

What languages does Whisperstream support?

Accepted Answer

Whisperstream transcribes 25 languages, including English, Spanish, French, German, Italian, Portuguese, Russian, and Polish, plus 17 more. Every language runs through the same on-device model, so multilingual dictation stays fully local with no cloud service and no per-language download.

Question 6

Which apps does Whisperstream type into?

Accepted Answer

Whisperstream pastes transcribed text into whatever window has focus, so it works the same in Outlook, Word, Slack, VS Code, your browser, or a terminal. There is no add-in to install and no per-app configuration. If your cursor can blink in a text field, Whisperstream can type into it.

Everything Whisperstream does

On-device transcription

Push-to-talk hotkey

Types into any app

Transcript history

Audio file import

25 languages

Works offline

System-volume ducking

Optional AI cleanup

Automatic profile switching

Custom dictionary

Custom style prompt

Microsoft-signed installer

Runs on CPU, no GPU required

Frequently asked questions

Own your dictation.$29 once.Fully local.