New in 22H2: Accessibility

Windows 11 version 22H2 includes three major new accessibility features, which is quite exciting. Unfortunately, all three are available in U.S. English only.

They are…

Live captions

This rather incredible feature will provide live, on-the-fly captioning for any content that plays audio, which includes audio and video recordings, of course, but also apps like Skype and Zoom where you may be conversing with another person. Live captions appear in a resizable Live Captions app window, which can be docked to the top or bottom of the screen or appear as a floating window.

Windows Intelligence In Your Inbox

Sign up for our new free newsletter to get three time-saving tips each Friday — and get free copies of Paul Thurrott's Windows 11 and Windows 10 Field Guides (normally $9.99) as a special welcome gift!

"*" indicates required fields

This field is for validation purposes and should be left unchanged.

Live captions uses on-device artificial intelligence (AI) to do its thing, and so it can work even if the PC is offline.

To enable Live captions, search for “live captions” with Start search. Or, open Quick settings, select Accessibility, and then choose “Live captions” from the list. (You can also type WINKEY + CTRL + L.)

The settings gear in the Live captions window displays a pop-up menu with Position, Caption options, Caption style, and Learn more links. Live captions uses the same Caption options that were available previously in Windows in Settings > Accessibility > Captions.

Voice access

Voice access builds on the other voice control capabilities in Windows—voice typing and Windows Speech Recognition—and lets you control Windows using just your voice. It’s enabled via the Settings app (navigate to Accessibility > Speech > Voice access), but you can of course configure it to run automatically when Windows starts.

As with Live captions, Voice access uses a small pane, which in this case always appears at the top of the screen. You can turn on Voice access by saying “voice access wake up.” And then from there, you use commands—of which there are hundreds—to control what happens on screen. Typical commands include “click [item name],” “double-click [item name],” “maximize window,” “switch to [app name],” “select that,” and many more.

There’s also a nice Voice access commands guide for the full list: just say “what can I say?”

If you make a voice command that could have different meanings, Voice access will display teardrop-shaped and numbered overlays over different items. For example, when I say, “click the desktop,” I’m given several choices based on what is displayed on the screen. After that, I simply need to say the number of the one I mean to select that item. But you can also say “show numbers” to put overlays on non-named items so that anything can be selected.

You can also display a grid—“show grid”—and then drill down further with commands like “grid 2” to fine-tune a selection, and dictate and edit text (“select that,” “select next word,” “delete that,” and so on).

Narrator

Windows 11 version 22H2 improves Narrator by adding a selection of more natural-sounding—i.e. less robotic-sounding) voices. These will need to be downloaded once, but they will then work offline normally. To get started, navigate to Settings > Accessibility > Narrator and select the Add button next to Add natural voices.

Once you’ve installed one or more natural voices, they will appear in the “Choose a voice” drop-down under Narrator’s voice in Narrator settings. Natural voices can be configured with speed, pitch, and volume customizations like other voices.

Final thoughts

While all three of these new features are impressive, and each meets a particular need, Live captions is a particularly interesting option because it can benefit anyone, and not just those with a hearing impairment. Thanks to our deaf son, we’ve come to prefer using captions on movies and TV shows, and while the availability of captions has improved over the years, this meets a real need because not all audio/video content has captions, especially live conversations.

Share post

Please check our Community Guidelines before commenting

Conversation 6 comments

  • bluvg

    14 June, 2022 - 3:01 pm

    <p>I’m not deaf (though some may say I’m hard of hearing), but I love love love captions. Sometimes you just miss a word or phrase, or someone doesn’t enunciate well. Sure, sometimes the captions get it wrong, but I’m frequently amazed when it gets something right that I didn’t guess myself upon first hearing. (One downside–it sometimes spoils jokes when pre-transcribed.)</p>

    • benhaube

      14 June, 2022 - 7:22 pm

      <p>The live captions on Pixel are great. I am not deaf, but I do have worse hearing than someone my age should. I have a tough time with hearing high frequencies, and I have bad tinnitus. Even if I can hear what they are generally saying, sometimes I will miss a word, so it is nice to have the captions there. The AI does an excellent job transcribing on the fly. </p>

  • dftf

    14 June, 2022 - 4:49 pm

    <p>The <strong>live-captions</strong> is great, though odd it’s taken this-long to implement, given <em>Windows</em> has had built-in voice-to-text support since, what… <em>XP</em>? Or at-least <em>Vista</em> in the updated form of "Windows Speech Recognition". Surely they could have adapted that tech to listen to an internal audio-stream, rather than only the microphone, to have done this by-now?</p><p><br></p><p>And as for the other-two: isn’t something similar to <strong>voice-access</strong> already available in the <em>Windows Speech Recognition </em>app? As for the <strong>Narrator</strong> voices, I wonder why they don’t use the same tech as the "Read Aloud" feature in <em>Edge</em>, as most of the online-voices there sound very-natural.</p><p><br></p><p>I wonder if any of these will get backported to <em>Windows 10</em>?</p>

    • bluvg

      14 June, 2022 - 6:04 pm

      <p>True points, although speech recognition today is notably <em>miles</em> ahead of where it was even just a few years ago, almost eerily so at times.</p>

    • mikegalos

      15 June, 2022 - 9:08 am

      <p>Yes. In fact I worked on the equivalent text2speech and speech2text part of Windows back in 2008 and they’ve been in Windows longer than that.</p>

  • pherbie

    Premium Member
    15 June, 2022 - 7:45 am

    <p>This is actually BIG news, finally something that differentiates Windows 11. And a good thing too.</p>

Windows Intelligence In Your Inbox

Sign up for our new free newsletter to get three time-saving tips each Friday

"*" indicates required fields

This field is for validation purposes and should be left unchanged.

Thurrott © 2024 Thurrott LLC