Datasets
Audio

Sports Commentary

Recorded over live and replayed sporting events, sports commentary captures play-by-play narration, color analysis, crowd reactions, and the conversational rhythm that frames athletic action for an audience. The form spans solo announcers, two- and three-person booths, and post-game studio talk across major and niche sports.

Hours1.1M+
Sports30+
Languages20+

Training Use Cases

Speech recognition for live sports broadcast captioning
Text-to-speech voice cloning for sports narration
Speaker diarization and turn-taking
Sports event detection from audio cues
Key Highlights
30+ sports represented including football, basketball, baseball, soccer, MMA, tennis, golf, motorsports, and esports
20+ languages of commentary captured across regional and international broadcasts
Solo, two-person, and three-person commentary booths covered alongside studio analysis
Live in-game calls preserved alongside pre-game previews and post-game breakdowns

Metadata Fields

durationLength of recording in HH:MM:SS
sample_rateAudio sample rate (e.g., 44.1kHz, 48kHz)
channelsmono | stereo
languagePrimary spoken language (ISO 639-1 code)
primary_categoryDominant content category assigned to a recording
commentary_typeplay_by_play | color_commentary | studio_analysis | post_game_breakdown
sportfootball | basketball | baseball | soccer | mma | tennis | golf | motorsports | esports | other
booth_sizesolo | two_person | three_person
live_or_recordedlive | recorded