WCAG 1.2.2 Captions — Testing Guide for Mobile & Web Apps

WCAG 1.2.2, "Captions (Prerecorded)," is a foundational accessibility requirement focusing on providing synchronized captions for all pre-recorded audio content within media. This means any spoken dia

January 09, 2026 · 6 min read · WCAG Guides

Ensuring Captions for Pre-recorded Audio: A Practical Guide to WCAG 1.2.2

WCAG 1.2.2, "Captions (Prerecorded)," is a foundational accessibility requirement focusing on providing synchronized captions for all pre-recorded audio content within media. This means any spoken dialogue, important sound effects, or other audio cues in videos or audio-only files must have corresponding text displayed on screen. This is a Level A success criterion, meaning it's a basic requirement for making content accessible to a wide range of users.

What WCAG 1.2.2 Requires

In straightforward terms, if your application plays a video or audio file that has sound, you must provide captions. These captions should accurately represent everything that is said and any non-speech sounds that are crucial for understanding the content. This includes things like music if it's important to the narrative, or sound effects that convey an action. The captions must be synchronized with the audio, appearing on screen at the same time the sound is heard.

Why Captions Matter: Real User Impact

The necessity of captions extends far beyond a mere compliance checkbox. They are critical for individuals who are deaf or hard of hearing, enabling them to fully engage with and understand multimedia content. However, the benefits are far-reaching:

Consider the impact of a banking app's tutorial video on financial planning being inaccessible to a deaf user. They would miss crucial information, potentially leading to poor financial decisions. Similarly, a news app failing to caption a video report on a public health crisis excludes a significant portion of the population from vital information, a concern highlighted by regulations like the EU's European Accessibility Act (EAA) and the Americans with Disabilities Act (ADA) in the US.

Common Violations and Examples

Violations of WCAG 1.2.2 often occur due to oversight or a lack of understanding of its scope. Here are common scenarios:

Mobile Apps:

  1. Informational Videos with Dialogue: A mobile banking app features a short video explaining how to set up a new savings account. The video contains spoken instructions and visual cues. If no captions are present, deaf users cannot follow the setup process.
  2. Marketing Videos with Sound Effects: A retail app uses a promotional video showcasing a new product. The video includes dialogue and sound effects like a "ding" to highlight a feature. If the "ding" sound is crucial to understanding the benefit, its absence in captions creates a barrier.
  3. User-Generated Content (Less Common for strict prerecorded): While less common for *prerecorded* content, if a platform allows users to upload videos with spoken content and these are intended as part of the core app experience without moderation for captions, it can lead to violations.

Web Applications:

  1. Tutorials and Demonstrations: A web-based learning platform hosts video lectures or software demonstrations. If these videos include spoken explanations and no captions, students who are deaf or hard of hearing will struggle to learn the material.
  2. News Broadcasts or Documentaries: A news website embeds pre-recorded video segments of news reports or short documentaries. The spoken content and key sound effects must be captioned.
  3. Product Demonstrations on E-commerce Sites: An online store uses videos to showcase how a product works. If the narration or important audio cues are not captioned, potential customers with hearing impairments may not fully grasp the product's functionality.

Testing for WCAG 1.2.2 Compliance

A multi-pronged approach ensures robust compliance.

#### Manual Testing Steps

  1. Identify all pre-recorded media: Audit your application for any videos or audio files that play automatically or are initiated by user action.
  2. Play the media: Observe the audio content carefully. Listen to all spoken dialogue and significant non-speech sounds.
  3. Check for captions: Look for a caption track or a CC (Closed Captions) button.
  4. Enable captions: If captions are available, turn them on.
  5. Verify accuracy and synchronization:
  1. Test without audio: Mute your device's audio. Can you still understand the essential information conveyed by the video through the captions alone?

#### Automated Tools

While manual verification is crucial for nuance, automated tools can flag potential issues.

#### Mobile-Specific Considerations

Fixing Violations

The primary solution is to provide accurate, synchronized captions.

  1. Generate Captions:
  1. Create Caption Files: Generate caption files in standard formats like WebVTT (.vtt) for web and .srt for broader compatibility.
  2. Integrate Captions:
  1. Include Non-Speech Sounds: Do not forget to caption important sound effects or music cues that convey meaning.

How SUSA Checks for WCAG 1.2.2

SUSA (SUSATest) autonomously explores your application, mimicking diverse user behaviors, and identifies accessibility violations, including those related to WCAG 1.2.2.

By integrating SUSA into your QA process, you gain a powerful tool to proactively identify and address WCAG 1.2.2 violations, ensuring your applications are usable by a broader audience and compliant with accessibility standards.

Test Your App Autonomously

Upload your APK or URL. SUSA explores like 10 real users — finds bugs, accessibility violations, and security issues. No scripts.

Try SUSA Free