Core dictation
The everyday pipeline: capture your voice, transcribe it on your CPU, and type it into whatever app you have open.
On-device transcription
Whisperstream transcribes your speech entirely on your computer using NVIDIA's open-weight Parakeet TDT v3 model running on your CPU. Your audio is never uploaded, streamed, or stored on a server, and you do not need an account. After the one-time model download, transcription works with no internet connection at all.
Push-to-talk hotkey
Dictation is push-to-talk by default: hold your hotkey, speak, and release, so nothing is ever listening in the background. The default is Right Shift, and you can remap it to any key from the Controls tab. Prefer hands-free? Switch to toggle mode, where one press starts recording and another stops it. No admin rights are needed to register the shortcut.
Types into any app
Whisperstream pastes your transcribed text into whatever window has focus, so it works the same in Outlook, Word, Slack, VS Code, your browser, or a terminal. There is no add-in to install and no per-app configuration. If your cursor can blink in a text field, Whisperstream can type into it.
Transcript history
Whisperstream keeps a private history of everything you dictate, so you can find a past transcript, replay its original audio, and copy the result again later. The history is encrypted at rest on your own device and is never synced to a server. Search by keyword, play the recording back next to its text, and delete anything you would rather not keep.
Audio file import
Beyond live dictation, Whisperstream transcribes audio files you already have. Drop in a meeting recording, a voice memo, or an interview and it runs through the same on-device Parakeet model, with no upload and no account. The finished transcript lands in your history alongside everything you have dictated, ready to search, replay, and copy.
25 languages
Whisperstream transcribes 25 languages, including English, Spanish, French, German, Italian, Portuguese, Russian, and Polish, plus 17 more. Every language runs through the same on-device model, so multilingual dictation stays fully local with no cloud service and no per-language download.
Works offline
Whisperstream needs the internet only twice: once to download the speech model and once to activate your license. After that, dictation, transcription, and formatting all run with no connection at all. You can dictate on a plane, in a secure facility, or anywhere your network is down.
System-volume ducking
Whisperstream can lower or mute your system volume while you dictate, so music or a video does not bleed into your recording or distract you. Choose off, reduce to 20 percent, or full mute in the Audio settings. Volume returns to its previous level the moment you release the hotkey.