Dictation for Citrix and RDP that actually lands in the remote session
Voice typing in locked-down remote desktops, with keystroke input
Most dictation tools move text by writing your clipboard and pasting it. Citrix, RDP, and Azure Virtual Desktop can restrict or disable clipboard redirection by policy, and on new Azure Virtual Desktop and Windows 365 deployments it is off by default, so paste has nowhere to land. Whisperstream's keystroke input mode types your dictation in as keyboard input, the channel a remote session always accepts. Turn it on with one toggle.
Updated
At a glance
Dictation that delivers text by pasting it tends to fail inside a remote desktop, because the clipboard is a channel administrators control and often lock down. Keystroke input sidesteps the whole problem by typing your dictation in as keyboard input, which a remote session always accepts.
- What breaks
- Dictation that delivers text by pasting it (clipboard, then Ctrl+V) into a remote app.
- Why
- Citrix, RDP, and Azure Virtual Desktop treat the clipboard as a redirected channel that admins can restrict or turn off, and new Azure Virtual Desktop and Windows 365 deployments turn it off by default.
- The fix
- Keystroke input types your dictation in as keyboard input, the channel a remote session always accepts, so it does not depend on the clipboard.
- Who this is for
- Anyone dictating into a Citrix, RDP, or Azure Virtual Desktop session on a managed PC: healthcare, finance, legal, customer support, and other locked-down environments. If you also rely on a screen reader, the same keystroke mode helps there too (see dictation that works with screen readers).
Why remote desktop sessions break clipboard dictation
A remote session shows you an app running on another machine. Your keyboard and mouse are sent to that machine, and a few extra conveniences (the clipboard, drive mapping, printers) ride along as separate redirected channels. The clipboard is one of those channels, and it is one administrators control. In Citrix it is the Client clipboard redirection policy. In Remote Desktop Services it is the Do not allow Clipboard redirection Group Policy. In Azure Virtual Desktop and Windows 365 it is a host-pool setting that, on new deployments, is off in both directions by default.
When that channel is restricted or off, copy and paste between your local PC and the remote session stops working. That is usually a deliberate security choice in regulated and managed environments. It also means a dictation app that delivers text by writing your clipboard and pasting it has nowhere to put the words. The text never crosses into the remote app. This is not Citrix or RDP being broken; it is the clipboard doing exactly what policy told it to do.
Keyboard input is different. Typing is how you use a remote app at all, so keystrokes are always delivered into the session. Whisperstream's keystroke input mode leans on that: instead of pasting, it types your dictation in character by character as keyboard input, so there is no clipboard step to block.
Writes your dictation to the clipboard, then sends Ctrl+V. Fast and fine locally. In a remote session it only works if your IT team leaves clipboard redirection on.
Types your dictation in as keyboard input, character by character. Works in a remote session whether or not the clipboard is redirected, because typing is always delivered.
How dictation tools get text into a remote session
There are three ways a dictation tool can get text into a Citrix or RDP session. Only one needs nothing from your IT team.
Works in a locked-down session
- Mechanism
- Types the text in as keyboard input, character by character.
- Example
- Whisperstream, with Input Style set to Keystroke.
- Setup
- A toggle in the app, optionally scoped per app.
Breaks when the clipboard is off
- Mechanism
- Writes the clipboard and sends Ctrl+V.
- Example
- Most consumer dictation tools, e.g. Wispr Flow, whose docs say it depends on the remote client's clipboard sharing settings.
- Setup
- Nothing in the app, but it depends on IT clipboard policy.
Works, but IT-deployed
- Mechanism
- A vendor extension installed into the remote-session stack.
- Example
- Dragon Medical One, via a Nuance Citrix virtual-channel extension.
- Setup
- An IT project: server and per-endpoint install, plus Citrix allow-list changes.
Examples are illustrative of each approach. Whisperstream behavior is first-party; Wispr Flow and Dragon details come from their published documentation.
Turn on Keystroke input mode
Set keystroke mode once, then dictate into the remote window the same way you would type into it. The whole change is a single toggle in Controls.
Open Whisperstream and go to Controls
Launch Whisperstream on your local PC, the same machine running your Citrix or RDP client, and open the Controls page.
Set Input Style to Keystroke
Under Input Style, choose Keystroke. Whisperstream will type your dictation in as keystrokes instead of pasting it, so it lands in the remote-session window.
Raise the keystroke delay if needed
If a slow session drops characters, increase the keystroke delay. The setting is described in the app as a pause between keystrokes for slow remote sessions that drop fast typing.
Optional: scope it to your remote-desktop client
Add a per-app profile so only your Citrix or RDP client uses Keystroke, and local apps keep the faster clipboard paste.
Put your cursor in the remote field and speak
Click into the text field in your remote app, hold the push-to-talk key (Right Shift by default), and speak. The text types into the session.
When another approach still fits
Keystroke mode is the right default for dictating into a locked-down remote session, but it is not the only answer, and a few situations point elsewhere.
- You already run Dragon in Citrix
- If your organization already runs Dragon Medical One with PowerMic across Citrix, that is a deep, capable medical-dictation workflow, and there is no reason to replace it for that use. For a wider look at Dragon on Windows, see our Dragon NaturallySpeaking alternatives roundup.
- Your clipboard is not locked down
- If your IT team leaves clipboard redirection on, clipboard-based dictation tools work fine in the session, and you may not need keystroke mode at all.
- You dictate inside the virtual desktop
- If your model is to install a dictation tool inside the virtual desktop rather than dictate from your local PC, a server-side or VDI-installed tool may suit you better. Whisperstream runs locally and types into the session from your own machine.
Frequently asked questions
Own your dictation.$29 once.Fully local.
Free to try. No account.