Select Language

AI社区

公开数据集

CN-Celeb 一个室外收集的大规模说话人识别数据集

CN-Celeb 一个室外收集的大规模说话人识别数据集

29.66G
1453 浏览
0 喜欢
5 次下载
0 条讨论
Common Audio

This is a large-scale speaker recognition dataset collected 'in the wild'. The dataset consists of two subsets,......

数据结构 ? 29.66G

    README.md

    This is a large-scale speaker recognition dataset collected 'in the wild'. The dataset consists of two subsets, CN-Celeb1 and CN-Celeb2. All the audio files are coded as single channel and sampled at 16kHz with 16-bit precision. For CN-Celeb1, it contains more than 130,000 utterances from 1,000 Chinese celebrities, and covers 11 different genres in real world. For CN-Celeb2, it contains more than 520,000 utterances from 2,000 Chinese celebrities, and covers 11 different genres in real world. The data collection process was organized by the Center for Speech and Language Technologies, Tsinghua University. It was also funded by the National Natural Science Foundation of China No. 61633013, and the Postdoctoral Science Foundation of China No. 2018M640133. You can cite the data using the following BibTeX entry:

    @misc{fan2019cnceleb,
      title={CN-CELEB: a challenging Chinese speaker recognition dataset},
      author={Yue Fan and Jiawen Kang and Lantian Li and Kaicheng Li and Haolin Chen and Sitong Cheng and Pengyuan Zhang and Ziya Zhou and Yunqi Cai and Dong Wang},
      year={2019},
      eprint={1911.01799},
      archivePrefix={arXiv},
      primaryClass={eess.AS}
     }

    PEOPLE

    Dong Wang, Yue Fan, Hao Cui, Jiawen Kang, Lantian Li, Kaicheng Li, Haolin Chen, Sitong Cheng, Pengyuan Zhang, Ziya Zhou, Yunqi Cai

    ConTACTOR

    Address: ROOM 1-303, BLDG FIT, CSLT, Tsinghua University

    Homepage: http://cslt.org or http://cslt.riit.tsinghua.edu.cn

    暂无相关内容。
    暂无相关内容。
    • 分享你的想法
    去分享你的想法~~

    全部内容

      欢迎交流分享
      开始分享您的观点和意见,和大家一起交流分享.
    所需积分:30 去赚积分?
    • 1453浏览
    • 5下载
    • 0点赞
    • 收藏
    • 分享