Troveo AI: Training ready video data for the world's top AI labs

Browse all datasets

Multichannel Audio

Multi-Speaker Podcasts

A large-scale collection of natural, multi-speaker podcast conversations spanning diverse topics, accents, and languages. Sourced from real-world recordings to reflect authentic human dialogue.

500K+
500K+
500K+
Hours
Hours
Hours
An extensive audio corpus providing the volume and variety needed to train robust speech recognition, diarization, and language models at scale.
450K+
450K+
450K+
Conversations Available
Conversations Available
Conversations Available
Each entry is a distinct, real-world podcast conversation featuring multiple speakers in natural, unscripted dialogue.
15+
15+
15+
Languages
Languages
Languages
Spanning a broad range of languages and regional dialects, from Mandarin and Hindi to European and Latin American varieties, enabling truly multilingual model development.
30+
30+
30+
Accents & Dialects
Accents & Dialects
Accents & Dialects
Featuring a wide spectrum of regional accents and dialects within each language, ensuring models trained on this data generalize across real-world speaker variation.

Singaporean Accents

English

0:000:00

Metadata

Number of Speakers

Primary Category

Podcasts

Quality Score

Split Audio Channels

Yes

Samples

Talk to sales

Request full dataset

College Football Analysis

English

60 secs

2 speakers

German Language Dialects

German (Austrian)

60 secs

2 speakers

Formula 1 Racing

English

60 secs

2 speakers

Irish Culture

English

60 secs

4 speakers

Singaporean Accents

English

30 secs

11 speakers

Wealth And Privacy

English

60 secs

2 speakers

Chile, Travel, and Language Learning

Spanish

60 secs

2 speakers

Audio Equipment Manufacturing

Hindi

60 secs

3 speakers

Learning Chinese As A Foreigner

Chinese (Mandarin)

60 secs

2 speakers

Skyrim Gameplay

Polish

60 secs

2 speakers

Content Overview
Dataset Coverage
This dataset contains multi-speaker podcast audio capturing natural, unscripted conversations across a wide range of topics, languages, and accents, enabling robust speaker diarization, accent-aware speech recognition, and conversational language modelling across real-world dialogue conditions.
Region
North American
European
MENA
Global
Topic
Sports
News
Business
Technology
Content Overview
Dataset Coverage
This dataset contains multi-speaker podcast audio capturing natural, unscripted conversations across a wide range of topics, languages, and accents, enabling robust speaker diarization, accent-aware speech recognition, and conversational language modelling across real-world dialogue conditions.
Region
North American
European
MENA
Global
Topic
Sports
News
Business
Technology
Content Overview
Dataset Coverage
This dataset contains multi-speaker podcast audio capturing natural, unscripted conversations across a wide range of topics, languages, and accents, enabling robust speaker diarization, accent-aware speech recognition, and conversational language modelling across real-world dialogue conditions.
Region
North American
European
MENA
Global
Topic
Sports
News
Business
Technology
Content Overview
Dataset Coverage
This dataset contains multi-speaker podcast audio capturing natural, unscripted conversations across a wide range of topics, languages, and accents, enabling robust speaker diarization, accent-aware speech recognition, and conversational language modelling across real-world dialogue conditions.
Region
North American
European
MENA
Global
Topic
Sports
News
Business
Technology

Content Overview
Dataset Coverage
This dataset contains multi-speaker podcast audio capturing natural, unscripted conversations across a wide range of topics, languages, and accents, enabling robust speaker diarization, accent-aware speech recognition, and conversational language modelling across real-world dialogue conditions.
Region
North American
European
MENA
Global
Topic
Sports
News
Business
Technology
Content Overview
Dataset Coverage
This dataset contains multi-speaker podcast audio capturing natural, unscripted conversations across a wide range of topics, languages, and accents, enabling robust speaker diarization, accent-aware speech recognition, and conversational language modelling across real-world dialogue conditions.
Region
North American
European
MENA
Global
Topic
Sports
News
Business
Technology
Content Overview
Dataset Coverage
This dataset contains multi-speaker podcast audio capturing natural, unscripted conversations across a wide range of topics, languages, and accents, enabling robust speaker diarization, accent-aware speech recognition, and conversational language modelling across real-world dialogue conditions.
Region
North American
European
MENA
Global
Topic
Sports
News
Business
Technology
Content Overview
Dataset Coverage
This dataset contains multi-speaker podcast audio capturing natural, unscripted conversations across a wide range of topics, languages, and accents, enabling robust speaker diarization, accent-aware speech recognition, and conversational language modelling across real-world dialogue conditions.
Region
North American
European
MENA
Global
Topic
Sports
News
Business
Technology

Content Overview
Dataset Coverage
This dataset contains multi-speaker podcast audio capturing natural, unscripted conversations across a wide range of topics, languages, and accents, enabling robust speaker diarization, accent-aware speech recognition, and conversational language modelling across real-world dialogue conditions.
Region
North American
European
MENA
Global
Topic
Sports
News
Business
Technology
Content Overview
Dataset Coverage
This dataset contains multi-speaker podcast audio capturing natural, unscripted conversations across a wide range of topics, languages, and accents, enabling robust speaker diarization, accent-aware speech recognition, and conversational language modelling across real-world dialogue conditions.
Region
North American
European
MENA
Global
Topic
Sports
News
Business
Technology
Content Overview
Dataset Coverage
This dataset contains multi-speaker podcast audio capturing natural, unscripted conversations across a wide range of topics, languages, and accents, enabling robust speaker diarization, accent-aware speech recognition, and conversational language modelling across real-world dialogue conditions.
Region
North American
European
MENA
Global
Topic
Sports
News
Business
Technology
Content Overview
Dataset Coverage
This dataset contains multi-speaker podcast audio capturing natural, unscripted conversations across a wide range of topics, languages, and accents, enabling robust speaker diarization, accent-aware speech recognition, and conversational language modelling across real-world dialogue conditions.
Region
North American
European
MENA
Global
Topic
Sports
News
Business
Technology

Content Overview

Dataset Coverage

This dataset contains multi-speaker podcast audio capturing natural, unscripted conversations across a wide range of topics, languages, and accents, enabling robust speaker diarization, accent-aware speech recognition, and conversational language modelling across real-world dialogue conditions.

Region

North American

European

MENA

Global

Topic

Sports

News

Business

Technology

Explore more datasets

Multi-Speaker Podcasts

500K+

500K+

500K+

Hours

Hours

Hours

450K+

450K+

450K+

Conversations Available

Conversations Available

Conversations Available

15+

15+

15+

Languages

Languages

Languages

30+

30+

30+

Accents & Dialects

Accents & Dialects

Accents & Dialects

Singaporean Accents

Dataset Coverage

Dataset Coverage

Dataset Coverage

Dataset Coverage

Dataset Coverage

Dataset Coverage

Dataset Coverage

Dataset Coverage

Dataset Coverage

Dataset Coverage

Dataset Coverage

Dataset Coverage

Dataset Coverage

Animation

Media

Broadcast

Media

Core Library

All Categories

Gaming Commentary

Gaming

HDR

Specialized Formats

Multi-Speaker Podcasts

Multichannel Audio

Synchronized Multi-Camera

Specialized Formats

Talking Head

Media

Task Demonstration

Daily Living