Select Language

AI社区

公开数据集

10484.2M 267
图里安嵌入 Computer Science,Programming,NLP Classification
1246.83M 235
蔬菜(谷歌Word2Vec自由基) Food,NLP Classification
4872.67M 258
蔬菜 (谷歌 Word2vec 自由库,. tsv 文件) Vegetables (Google Word2Vec Freebase, .tsv file)...NLP,Computer Science,Food Classification
20G 767
性别猜测的推文文件 Online Communities,Social Science,Social Networks,NLP,Binary Classification,Gender Classification
5.74M 605
DARPA TIMIT 声学语音连续语音 #DARPA TIMIT声学语音连续语音语料库-**特别感谢**:**https://github.com/philipperemy/timit/edit/master/README.md**-**下载...NLP,Audio Data Classification
1198.1M 321
蔬菜(塞纳嵌入) Unsplash上[NeONBRAND][1]的鸣谢横幅和徽标照片...NLP,Earth and Nature,Computer Science Classification
129M 231
第三眼数据:电视新闻档案 Arts and Entertainment,Health,News,NLP Classification
214.53M 234
弗里德里希·尼采的书 NLP,Literature,Linguistics,Philosophy Classification
1.7M 268
Hearthstone Hearthstone卡名称和描述的翻译数据 Translation of Hearthstone card names and descriptions.Languages: German, English, Spanish, French, Italian, Japanese, K...NLP,Arts and Entertainment,Video Games,Games,Comics and Animation,Card Games Classification
54.7M 651
中等文章标题 Arts and Entertainment,Computer Science,NLP Classification
0.01M 278
维基百科文章数据集 wikipedia fr 2008 dump of wikipedia...NLP Classification
2.12G 389
134.5M 745
带有偏差数据集的毒性清理版本 cleaned tox bias cleaned up version of toxicity with bias data set...NLP,Data Cleaning,Health Classification
535.39M 823
俄罗斯电报聊天记录,公开俄罗斯电报聊天中解析的数据 Russian Telegram chats history Data parsed from must popular public Russian Telegram chats...NLP,Text Data,Russia Classification
11.08G 431
自然语言处理中的情感分析 #数据集此数据集由NowYSM在Database:Open Database,Contents:Database Contents#Contents下创建。它包含以下文件:...NLP,Arts and Entertainment Classification
2.52M 440
NLP 数据 # DatasetThis dataset was created by AbiyuGReleased under CC BY-NC-SA 4.0# ContentsIt contains the following files:...NLP,Psychology Classification
3.14M 408
星际迷航脚本,所有《星际迷航》系列脚本的原始文本脚本和处理行 Star Trek Scripts TextData scraped from data from http://www.chakoteya.net/StarTrek/index.htmlCode here: https://github....NLP,Movies and TV Shows,Text Data,Text Mining Classification
42.63M 426
有毒嵌入物,拼图有毒评论挑战中的通用句子编码文本 There's no need for everyone to encode the same text with the Universal Sentence EmbeddingThis data set contains the...NLP,Deep Learning,Earth and Nature Classification
610.81M 824
CoNLL003 命名实体识别(NER)问题的注释数据集 This is an annotated dataset for Named Entity Recognition (NER) problemContentThis dataset is divided into train.txt, te...NLP,Arts and Entertainment,Computer Science,Text Data,Games,Text Mining Classification
4.63M 657