用于语音克隆的英语多说话人语料库 CSTR-VCTK语料库
This CSTR VCTK Corpus includes speech data uttered by 109 native speakers of English with various accents. Each speaker...NLP,Audio Data Classification
15.22G
455
Michael Fekadu
SAVEE 数据库 用于情感识别系统的语音情感注释数据
The SAVEE database was recorded from four native English male speakers (identified as DC, JE, JK, KL), postgraduate stud...NLP,Business,Social Science Classification
162.57M
519
Tarun Sunkaraneni
VGG-Sound 大规模视听数据集
VGG-Sound是一个视听对应数据集,由从上传到YouTube的视频中提取的音频短片组成.Citation@InProceedings{Chen20,author=HonglieC...Music Analysis,Others Audio
21.7M
1625
Honglie Chen