Skip to main content

Welcome to our new website! The content and organization have been heavily redone and we want to hear from you! 
Submit feedback

Video Captioning Guide

Creating accurate, synchronized captions for all video content.

🎬 Required for all UA video content

Why captions are required

Captions are required for all university video content under Title II ADA, Section 504, and UA policy. Beyond compliance, captions benefit:

  • Deaf and hard of hearing viewers — Primary accessibility need
  • Non-native English speakers — Supports language comprehension
  • Students in noisy or quiet environments — Libraries, public transit
  • Viewers with cognitive disabilities — Reinforces audio with text
  • Search and findability — Captions are indexed and searchable

Research shows: 80% of people who use captions are not deaf or hard of hearing (3Play Media, 2019). Studies show captions improve comprehension and retention for all viewers.

Types of captions

TypeDescriptionWhen to use
Closed captions (CC)Can be turned on/off by viewerStandard for most content
Open captionsAlways visible, burned into videoSocial media, signage displays
SubtitlesTranslation for different languagesMultilingual content
SDH (Subtitles for Deaf/HoH)Includes non-speech audio descriptionsWhen sound effects matter

Caption quality standards

Captions must meet WCAG 2.1 AA requirements (Success Criterion 1.2.2). Quality captions are:

Accurate

  • Verbatim speech (not paraphrased)
  • Correct spelling of names and technical terms
  • Proper grammar and punctuation
  • Speaker identification when multiple speakers
  • Target: 99% accuracy

Synchronized

  • Captions appear within 1 second of speech
  • Duration matches speech timing
  • Line breaks at natural pause points

Complete

  • All spoken content included
  • Relevant non-speech sounds described: [music], [applause], [phone rings]
  • Speaker changes indicated
  • No missing or dropped words

Readable

  • 2-3 lines maximum per caption frame
  • 32-42 characters per line
  • Sufficient display time to read (minimum 1 second per frame)
  • Proper line breaks (don't split phrases awkwardly)

Captioning workflow at UA

Option 1: Panopto (recommended for courses)

UA's video platform with integrated captioning:

  1. Upload or record video in Panopto
  2. Enable ASR: Settings → Captions → Enable automatic captions
  3. Wait for processing (usually 1-2 hours for ASR)
  4. Edit captions: Click "Captions" in the video editor
  5. Review and correct all auto-generated text
  6. Publish when captions are accurate

Full Panopto workflow guide

Option 2: YouTube Studio

  1. Upload video to YouTube
  2. Go to Subtitles in YouTube Studio
  3. Wait for auto-generated captions
  4. Click Edit to review and correct
  5. Publish captions

Option 3: Professional captioning services

For high-volume or high-stakes content, use professional services:

ServiceTurnaroundCost estimateBest for
3Play Media1-4 business days$2-4/minuteHigh accuracy, large volumes
Rev12-24 hours$1.50-2/minuteFast turnaround
VerbitSame day$2-5/minuteLive events, lectures

Contact accessibility@arizona.edu for department purchasing guidance.

Option 4: Manual captioning

For short videos or when other options aren't available:

  • Amara: amara.org — Free web-based caption editor
  • YouTube caption editor — Built into YouTube Studio
  • Subtitle Edit: Free desktop software for Windows

Caption editing best practices

Common auto-caption errors to fix

  • Names and proper nouns: "Dr. Smith" not "doctor smith"
  • Technical terms: "WCAG" not "W cag" or "double-u cag"
  • Homophones: "their/there/they're" — check context
  • Numbers: Verify dates, statistics, phone numbers
  • Acronyms: "UA" not "you a"

Formatting conventions

  • Speaker identification: Use [Professor]: or >> Professor: for multiple speakers
  • Sound effects: [music], [applause], [inaudible]
  • Emphasis: Use italics for emphasis or foreign words
  • Numbers: Spell out one through ten; use digits for 11+

Line breaking

Good:

The accessibility guidelines
require captions on all videos.

Poor:

The accessibility guidelines require
captions on all videos.

Break at natural phrase boundaries, not in the middle of thoughts.

Live event captioning

For live events, webinars, and synchronous classes:

Zoom automatic captions

  • Enable in meeting settings
  • Quality varies (70-85% accuracy)
  • Suitable for informal meetings
  • Not sufficient for formal accommodations

Professional CART (Communication Access Realtime Translation)

  • Human captioner provides 98%+ accuracy
  • Required for formal accommodations
  • Schedule through DRC for student accommodations
  • Book 2+ weeks in advance for events

Contact uadrc@arizona.edu to arrange CART services.

Transcripts

In addition to captions, provide downloadable transcripts for:

  • Audio-only content (podcasts)
  • Long-form videos
  • Content where users may want to search or reference

Transcript best practices

  • Include speaker names
  • Add timestamps for long content
  • Use headings for sections/topics
  • Note significant non-speech audio
  • Provide in accessible format (HTML or accessible Word/PDF)

Audio description

Audio description narrates important visual information for blind and low-vision viewers. Required when:

  • Visual content conveys information not in the audio track
  • Demonstrations, charts, or on-screen text
  • Videos where action is important to understanding

When audio description is needed

Content typeAudio description needed?
Talking head lectureUsually no (if slides are described verbally)
Software demoYes (describe on-screen actions)
Lab demonstrationYes (describe physical actions)
Chart or graph shownYes (describe data and trends)
InterviewUsually no (unless visual context matters)

Providing audio description

  • Integrate into script: Narrate what you're showing as you present
  • Extended description: Pause video to describe complex visuals
  • Separate track: Create alternative version with AD
  • Text description: Provide written description of visual content

Caption quality checklist

  • check_box_outline_blank All speech is captioned verbatim
  • check_box_outline_blank Speaker identification included
  • check_box_outline_blank Names and technical terms spelled correctly
  • check_box_outline_blank Non-speech sounds described [music], [applause]
  • check_box_outline_blank Captions synchronized with audio
  • check_box_outline_blank Line breaks at natural phrases
  • check_box_outline_blank 99%+ accuracy achieved
  • check_box_outline_blank Captions display long enough to read

Resources