Select Language

AI社区

公开数据集

中国古代文字(文言文) Business,NLP,Text Data,Text Mining Classification
1572.11M 704
亚马逊Alexa评论 NLP Classification
0.49M 535
arxiv数据集,过去18个月的存档数据集 arxiv dataset arxiv dataset for the past 18 months...NLP Classification
94.28M 581
专利摘要 Computer Science,Law,NLP,Deep Learning,LSTM,RNN Classification
3.2M 288
假新闻检测 News,NLP,Text Data Classification
118.08M 499
智能非索引字表 NLP,Text Mining Classification
0M 235
塞思·戈丁的博客数据集 Business,Internet,Online Communities,NLP,Literature,Text Data,Text Mining,Marketing Classification
16.49M 271
恢复数据集 Computer Science,NLP,Text Data Classification
11.81M 220
巴西新闻 News,Finance,NLP,Brazil Classification
1102.35M 536
CORD-19完整索引,在完整CORD-19数据集上嵌入索引 Sentence embeddings index over full CORD-19 dataset. Includes both COVID-19 and non-COVID-19 tagged literature on Corona...NLP,Computer Science,Coronavirus Classification
7.61G 226
推特预测灾难 NLP,Classification,Text Data,Geospatial Analysis,Binary Classification Classification
1.34M 515
零售交易[于2020年7月17日发布] Online Communities,Retail and Shopping,NLP,Data Visualization,Tabular Data,Data Cleaning Classification
1.3M 236
OSCAR尼泊尔语语料库,尼泊尔语文本语料库,用于训练NLP的无监督语言模型 The files are from [OSCAR Corpus](https://oscar-corpus.com/). Please visit their site for more information.The dataset i...NLP,Computer Science,Movies and TV Shows,Text Data,Languages Classification
3.1G 326
乌克兰语词汇描述 Earth and Nature,Education,NLP,Classification,Text Data Classification
0.04M 521
矛盾的,我亲爱的华生翻译的文本 NLP,Classification,Text Data Classification
9.47M 218
JigSaw有毒评论分类清理数据,竖锯评论,带感情,评论长度和翻译文本 I've been working on the JigSaw Multilingual Toxic Comment classification competition and found that the data requir...NLP,Deep Learning,Feature Engineering,Text Data Classification
263.44M 347
COVID19法语推特 Coronavirus,NLP,Text Mining Classification
47.18M 236
古腾堡档案 Education,NLP,Literature Classification
6.76M 175
罗伯塔大 Computer Science,Exercise,NLP Classification
1361.18M 223
Reddit印度NLP数据集,数据集包括2017-2020年从R/India子版块的帖子 [![forthebadge made-with-python](http://ForTheBadge.com/images/badges/made-with-python.svg)](https://www.python.org/) [!...NLP,Classification,Multiclass Classification,India Classification
117.86M 266