Does it work with my Anthropic API key?

Yes. Whisperstream pastes a transcript into Claude Code's prompt the same way you would type it. It has no involvement in how Claude Code talks to Anthropic, so any auth method works: claude.ai, raw API key, Amazon Bedrock, Google Vertex AI, Microsoft Foundry.

What's the latency vs cloud?

Whisperstream transcribes locally on-CPU, so there is no network round-trip. Anthropic does not publish a latency number for /voice. Both feel fast on a modern machine; the difference is that Whisperstream's path is bounded by your CPU rather than internet conditions. The Claude Code issue tracker also has open reports about /voice reliability that a local pipeline avoids.

Does it work with Cursor, VS Code, or Windsurf?

Yes. Whisperstream pastes into whichever window has focus. Cursor's chat, VS Code's Copilot chat or any extension chat panel, Windsurf's prompt, the terminal, all the same setup, all the same hotkey.

Is Claude Code open source?

No. Claude Code is source-available, not open source. Its LICENSE.md reads "© Anthropic PBC. All rights reserved. Use is subject to Anthropic's Commercial Terms of Service." You can read the code; it is not under an OSI-approved license.

Will Anthropic ever add API-key support to /voice?

Anthropic has not announced any planned expansion of /voice to API-key, Bedrock, Vertex, or Foundry auth. As of May 2026, the docs are firm that /voice requires a claude.ai account. If that changes, the comparison on this page changes; until then, Whisperstream is the way to dictate in Claude Code outside claude.ai auth.

Does Whisperstream work reliably in Claude Code's terminal?

Yes. Anthropic issue #51725 documents that Claude Code's TUI rejects keystroke-simulated input but accepts clipboard paste. Whisperstream uses clipboard paste by default, the workflow that issue confirms works. If you ever hit a problem in your terminal, the transcript still lands on your clipboard so you can Ctrl+V it yourself.

Voice coding

Voice Input for Claude Code on Windows

Push to talk. Audio stays on your PC. Works with any Claude Code auth method.

Claude Code shipped a native /voiceslash command in v2.1.69 (March 2026), but it works only with claude.ai accounts and streams audio to Anthropic's servers. Whisperstream is a $29 push-to-talk dictation app for Windows. It runs on your PC, types into Claude Code's prompt for you, and works with any auth method including API key, Bedrock, Vertex, and Foundry.

Updated July 2, 2026

Get Whisperstream - $29 once See the comparison

At a glance

Claude Code's native /voiceslash command requires v2.1.69 or later (March 2026), and it works only with claude.ai accounts while streaming audio to Anthropic's servers. Whisperstream is a $29 push-to-talk dictation app for Windows. It runs on your PC, types into Claude Code's prompt for you, and works with any auth method (claude.ai, API key, Bedrock, Vertex, and Foundry).

Auth methods: Whisperstream: any. /voice: claude.ai only
Where audio goes: Stays on your PC vs streamed to Anthropic
Pricing: $29 once vs free with claude.ai Pro ($20/mo)

The auth gate

/voice only runs on claude.ai accounts

Anthropic's voice-dictation docs are explicit: “The speech-to-text service is only available when you authenticate with a Claude.ai account, and is not available when Claude Code is configured to use an Anthropic API key directly, Amazon Bedrock, Google Vertex AI, or Microsoft Foundry.” If your team runs Claude Code on a raw API key, in Bedrock, in Vertex, or in Microsoft Foundry, the /voice command will not start.

Whisperstream sidesteps the restriction by working a layer below Claude Code's auth. It pastes a transcript into whatever text field has focus, the same way you would type. It does not call Anthropic's API; it does not know how Claude Code authenticates. Whichever auth path you use, Whisperstream still works.

The full picture

Side by side

Whisperstream

/voiceClaude Code /voice

Auth methods supported

Any (claude.ai, API key, Bedrock, Vertex, Foundry)

claude.ai account only1

Where audio goes

Stays on your PC

Streams to Anthropic's servers2

ASR model

NVIDIA Parakeet and Qwen3 ASR, on-CPU

Cloud (Anthropic-managed)2

Audio retention

None, audio never leaves device

Falls under Claude Code data policy: up to 5 years if opted in to model improvement, 30 days otherwise3

Pricing

$29 once

Free with claude.ai Pro ($20/mo)4

Platform

Windows 10 and 11

macOS, Windows, Linux (WSL needs WSLg)5

Works in other editors

Yes, anywhere you can paste

No, Claude Code only

Cleans up technical dictation

Code mode + per-app profile auto-applied in the terminal/IDE

Raw transcript

License

Proprietary

Proprietary, source-available6

Sources

Why the mechanism works

The TUI accepts clipboard paste

Claude Code's terminal UI is selective about how text is delivered to its prompt. The Anthropic issue tracker has a recent report (issue #51725) where another dictation tool found its keystroke-simulated input rejected by Claude Code's TUI while clipboard paste worked. The reporter's own words: “Claude Code's TUI seems to reject [keystroke simulation]. Clipboard-based paste works fine, suggesting the TUI only accepts bracketed-paste input.”

Whisperstream uses clipboard paste. Audio gets transcribed locally, the transcript lands on your clipboard, and a Ctrl+V keystroke pastes it into whichever window has focus. That is the same delivery method the issue identifies as working. The page you are reading is partly a way to point at that primary-source confirmation so you do not have to test it yourself.

Privacy

Where the audio goes

Per Anthropic's docs, “voice dictation streams your recorded audio to Anthropic's servers for transcription. Audio is not processed locally.” That audio falls under Claude Code's general consumer data retention, which the docs describe as up to five years for accounts that allow data use for model improvement and 30 days otherwise.

Whisperstream transcribes on your PC. The audio frame buffer lives in memory; the ASR model (NVIDIA Parakeet) runs on-CPU via ONNX Runtime; the transcript goes to your clipboard and gets pasted into the focused window. Nothing leaves the machine unless you decide to send it.

When the other option wins

When /voice is the right answer

There are real cases where Anthropic's native /voice is the right answer, not Whisperstream.

You authenticate Claude Code with a claude.ai Pro or Max account and only ever use Claude Code, not Cursor or Windsurf or VS Code. /voice is built in.
You work primarily on macOS or Linux. Whisperstream is Windows-only today.
You want the cloud round-trip on purpose, for example to keep an audit trail of voice prompts on a managed Anthropic account.
You are comfortable with up to five years of audio retention if your account opts in to model improvement.

For a broader Windows survey beyond Claude Code, see our Wispr Flow alternatives roundup or the offline dictation roundup. If none of the cases above apply to you, here is how setup looks in about five minutes.

Setup guide

Set up Whisperstream for Claude Code

Install Whisperstream
Download the installer from this page and run it. Whisperstream runs on Windows 10 or 11 x64 with about 4 GB of free RAM; no GPU required. The first launch downloads the speech model, about 600 MB, which takes a few minutes. After that, everything runs offline.
Set your push-to-talk hotkey
Open Whisperstream's settings and pick a hotkey. The default is Right Shift, which most users keep. There's also a separate toggle-recording shortcut, so you can start and stop hands-free for long, multi-paragraph prompts instead of holding the key down.
Open Claude Code in your terminal
Windows Terminal, PowerShell, Wezterm, Alacritty, any terminal works. Whisperstream pastes into whatever window has focus.
Put the cursor in Claude Code's prompt
The same place you would type a prompt today.
Hold the hotkey and speak, then release
Whisperstream pastes the transcript into Claude Code's prompt. Press Enter to submit, the same as if you had typed it.

The same five steps work for Cursor, Windsurf, VS Code chat, and any other editor that takes typed text. That is the point.

Works everywhere

Works with the rest of your stack

Whisperstream pastes text into whatever window has focus. Anything you can type into, you can dictate into.

Every prompt you dictate is also saved to a searchable, encrypted history on your device, with audio playback. So when you need that long, carefully worded prompt again, you can find it and re-paste it instead of re-dictating it. The history is encrypted at rest and never synced to a server. See transcript history.

Cursor: Push to talk into Cursor's chat panel. No plugin, no integration, the same hotkey.
Windsurf: Same setup, same hotkey. Windsurf's chat is a text field; Whisperstream types into it.
VS Code: Works with GitHub Copilot chat, Continue, and any extension that takes typed prompts.
Codex, Aider, OpenCode: Whisperstream is auth-agnostic and app-agnostic. If your agent reads a prompt, you can dictate into it.

See all features for the broader picture of how Whisperstream fits into a Windows dev setup.

Keep reading

Related comparisons

Frequently asked questions

Yes. Claude Code's native /voice slash command requires v2.1.69 or later (March 2026), and Anthropic's docs cover it at code.claude.com/docs/en/voice-dictation. Two practical catches: it requires a claude.ai account and streams your audio to Anthropic's servers, and some users report that /voice mis-transcribes technical terms and homophones (there are open accuracy reports on the Claude Code issue tracker). Whisperstream is the local alternative. It transcribes on your PC with NVIDIA Parakeet and works with any Claude Code auth (API key, Bedrock, Vertex, Foundry), so you can dictate in Claude Code without a claude.ai account and without sending audio to the cloud.

Three reasons. First, Whisperstream works regardless of how Claude Code authenticates, including API key, Bedrock, Vertex, and Foundry, while /voice requires a claude.ai account. Second, audio never leaves your PC because transcription runs on-device via NVIDIA Parakeet and Qwen3 ASR. Third, Whisperstream has an optional on-device AI cleanup with a Code mode tuned for code and technical dictation, which /voice does not have. It runs a local model on a capable GPU (with a free cloud fallback), needs no API key, and is off by default.

Own your dictation.$29 once.Fully local.

Free to try. No account.

Download free for Windows30-day money-back guarantee