相关数据分类
124
1
0
0
3
2
0
0
1
2
0
0
0
0
CN-Celeb 一个室外收集的大规模说话人识别数据集
This is a large-scale speaker recognition dataset collected 'in the wild'. The dataset consists of two subsets,...Common Audio
29.66G
1576
openslr
1.4G
749
Samira klaylat
DARPA TIMIT声学语音连续语音语料库
TIMIT(英语:The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus),是由德州仪器、麻省理工学院和SRI International...Music Analysis Audio
812.64M
1080
robot
牛津大学 VGG Condensed Movies 数据集
牛津大学 VGG 组学者创建了 Condensed Movies 数据集(CMD),由 3K 多部电影中的关键场景组成:每个关键场景都附有场景的高级语...Music Analysis Audio
6.24M
897
Max Bain
免费 ST 美国英语语料库
Thiscorpuswererecordedinsilencein-doorenvironmentusingcellphone.Ithas10speakers.Eachspeakerhasabout350utterances.Allutte...Music Analysis Audio
351M
852
surfing.ai
免费 ST 中文普通话语料库
Thiscorpuswererecordedinsilencein-doorenvironmentusingcellphone.Ithas855speakers.Eachspeakerhas120utterances.Allutteranc...Music Analysis Audio
8.2G
1130
surfing.ai
大型爪哇 ASR 训练数据集
This data set contains transcribed audio data for Javanese. The data set consists of wave files, and a TSV file. The fil...Music Analysis Audio
1.1G
898
Google, Inc.
高质量古吉拉特语(女性)多说话者语音数据集
This data set contains transcribed high-quality audio of Gujarati sentencesrecorded by volunteers. The data set consists...Music Analysis Audio
917M
817
Google, Inc.
高质量古吉拉特语(男性)多说话者语音数据集
This data set contains transcribed high-quality audio of Gujarati sentencesrecorded by volunteers. The data set consists...Music Analysis Audio
825M
823
Google, Inc
英国中部女性录音数据集
This data set contains transcribed high-quality audio of English sentencesrecorded by volunteers speaking different dial...Music Analysis Audio
103M
860
Google, Inc.
爱尔兰男性录音的数据集
This data set contains transcribed high-quality audio of English sentencesrecorded by volunteers speaking different dial...Music Analysis Audio
164M
790
Google, Inc.
Deeply Korean read speech corpus 深度韩语阅读语料库
about this resource:Recording environment: Studio apartment(moderate reverb), Dance studio(high reverb), Anechoic chambe...Music Analysis Audio
281M
909
Deeply Inc
Kazakh Speech Corpus (KSC) 哈萨克语语料库(KSC)
A crowdsourced open-source speech corpus for the Kazakh language. The KSC contains around 332 hoursof transcribed audio...Music Analysis Audio
19G
1459
NET
Thorsten Müller(德国情感-TTS 数据集)
I contribute my personal voice as a person believing in a world where all people are equal. No matter of gender, se...Music Analysis Audio
399M
814
Thorsten Müller
Hi-Fi 多扬声器英语 TTS 数据集 (Hi-Fi TTS)
用于训练文本到语音模型的多说话者英语数据集about this resource:Hi-Fi Multi-Speaker English TTS Dataset (Hi-Fi TTS) is a m...Music Analysis Audio
41G
849
LibriVox
Nonverbal Vocalization Dataset 深度非言语发声数据集
about this resource:Volume(full set): ~0.6(~57) hours, ~800(~70,000) utterances, ~500(~1500) speakersFormat: 16kHz, 16-b...Music Analysis Audio
43.7M
825
Deeply Inc
LibriSpeech ASR corpus 语音数据
LibriSpeech ASR corpus 是一个语音数据,包括 1000小时 的英文发音和对应文字。标识符:SLR12摘要:大规模(1000小时)阅读英语...NLP Audio
8.14G
1381
Vassil Panayotov
Google Audioset 音频数据集
AudioSet 包含了 632 类的音频类别以及 2084320 条人工标记的每段 10 秒长度的声音剪辑片段(片段来自 YouTube 视频)。音频本体...NLP Audio
2.41G
1800
Google
419.81M
1047
宾夕法尼亚大学
THUYG-20 维吾尔语语音数据
摘要:免费的维吾尔语言数据库由CSLT @清华大学和新疆大学发布类别:演讲许可证:Apache License v.2.0介绍THUGY20是由语音和语...NLP Audio
6.12G
1666
清华大学