Datasets
Audio
Sports Commentary
Recorded over live and replayed sporting events, sports commentary captures play-by-play narration, color analysis, crowd reactions, and the conversational rhythm that frames athletic action for an audience. The form spans solo announcers, two- and three-person booths, and post-game studio talk across major and niche sports.
Hours1.1M+
Sports30+
Languages20+
Training Use Cases
✓Speech recognition for live sports broadcast captioning
✓Text-to-speech voice cloning for sports narration
✓Speaker diarization and turn-taking
✓Sports event detection from audio cues
Key Highlights
✓30+ sports represented including football, basketball, baseball, soccer, MMA, tennis, golf, motorsports, and esports
✓20+ languages of commentary captured across regional and international broadcasts
✓Solo, two-person, and three-person commentary booths covered alongside studio analysis
✓Live in-game calls preserved alongside pre-game previews and post-game breakdowns
Metadata Fields
durationLength of recording in HH:MM:SS
sample_rateAudio sample rate (e.g., 44.1kHz, 48kHz)
channelsmono | stereo
languagePrimary spoken language (ISO 639-1 code)
primary_categoryDominant content category assigned to a recording
commentary_typeplay_by_play | color_commentary | studio_analysis | post_game_breakdown
sportfootball | basketball | baseball | soccer | mma | tennis | golf | motorsports | esports | other
booth_sizesolo | two_person | three_person
live_or_recordedlive | recorded