Select Language

AI社区

公开数据集

10484.2M 194
图里安嵌入 Computer Science,Programming,NLP Classification
1246.83M 185
蔬菜(谷歌Word2Vec自由基) Food,NLP Classification
4872.67M 200
蔬菜 (谷歌 Word2vec 自由库,. tsv 文件) Vegetables (Google Word2Vec Freebase, .tsv file)...NLP,Computer Science,Food Classification
20G 706
性别猜测的推文文件 Online Communities,Social Science,Social Networks,NLP,Binary Classification,Gender Classification
5.74M 466
DARPA TIMIT 声学语音连续语音 #DARPA TIMIT声学语音连续语音语料库-**特别感谢**:**https://github.com/philipperemy/timit/edit/master/README.md**-**下载...NLP,Audio Data Classification
1198.1M 238
蔬菜(塞纳嵌入) Unsplash上[NeONBRAND][1]的鸣谢横幅和徽标照片...NLP,Earth and Nature,Computer Science Classification
129M 199
第三眼数据:电视新闻档案 Arts and Entertainment,Health,News,NLP Classification
214.53M 165
弗里德里希·尼采的书 NLP,Literature,Linguistics,Philosophy Classification
1.7M 197
Hearthstone Hearthstone卡名称和描述的翻译数据 Translation of Hearthstone card names and descriptions.Languages: German, English, Spanish, French, Italian, Japanese, K...NLP,Arts and Entertainment,Video Games,Games,Comics and Animation,Card Games Classification
54.7M 488
中等文章标题 Arts and Entertainment,Computer Science,NLP Classification
0.01M 189
维基百科文章数据集 wikipedia fr 2008 dump of wikipedia...NLP Classification
2.12G 304
134.5M 609
带有偏差数据集的毒性清理版本 cleaned tox bias cleaned up version of toxicity with bias data set...NLP,Data Cleaning,Health Classification
535.39M 680
俄罗斯电报聊天记录,公开俄罗斯电报聊天中解析的数据 Russian Telegram chats history Data parsed from must popular public Russian Telegram chats...NLP,Text Data,Russia Classification
11.08G 351
自然语言处理中的情感分析 #数据集此数据集由NowYSM在Database:Open Database,Contents:Database Contents#Contents下创建。它包含以下文件:...NLP,Arts and Entertainment Classification
2.52M 350
NLP 数据 # DatasetThis dataset was created by AbiyuGReleased under CC BY-NC-SA 4.0# ContentsIt contains the following files:...NLP,Psychology Classification
3.14M 331
星际迷航脚本,所有《星际迷航》系列脚本的原始文本脚本和处理行 Star Trek Scripts TextData scraped from data from http://www.chakoteya.net/StarTrek/index.htmlCode here: https://github....NLP,Movies and TV Shows,Text Data,Text Mining Classification
42.63M 322
有毒嵌入物,拼图有毒评论挑战中的通用句子编码文本 There's no need for everyone to encode the same text with the Universal Sentence EmbeddingThis data set contains the...NLP,Deep Learning,Earth and Nature Classification
610.81M 622
CoNLL003 命名实体识别(NER)问题的注释数据集 This is an annotated dataset for Named Entity Recognition (NER) problemContentThis dataset is divided into train.txt, te...NLP,Arts and Entertainment,Computer Science,Text Data,Games,Text Mining Classification
4.63M 498