Select Language

AI社区

公开数据集

贾纳塔哈克:独立日 Earth and Nature,Internet,Education,Sports,NLP,Beginner Classification
451.85M 261
产品分类 NLP,Multiclass Classification Classification
1.04M 593
丰富的数据 NLP,Text Mining Classification
25.32M 286
提交100万份 Internet,Online Communities,Social Networks,NLP,Popular Culture Classification
700.01M 246
经典英语文学语料库与元数据,经典英语书籍及其作者 This is a dataset about classic readings in English, some cases other language translated to English.Dickens, Plato, Sha...NLP,Arts and Entertainment,Literature Classification
431.55M 313
WikiText长期依赖性语言建模数据集 WikiText语言建模数据集是从维基百科上一组经过验证的好文章和特色文章中提取的超过1亿个令牌的集合。与宾夕法尼亚树库(PTB)的...NLP,Deep Learning,Text Data Classification
1.11G 256
中国古代文字(文言文) Business,NLP,Text Data,Text Mining Classification
1572.11M 740
亚马逊Alexa评论 NLP Classification
0.49M 560
arxiv数据集,过去18个月的存档数据集 arxiv dataset arxiv dataset for the past 18 months...NLP Classification
94.28M 604
专利摘要 Computer Science,Law,NLP,Deep Learning,LSTM,RNN Classification
3.2M 294
假新闻检测 News,NLP,Text Data Classification
118.08M 526
智能非索引字表 NLP,Text Mining Classification
0M 246
塞思·戈丁的博客数据集 Business,Internet,Online Communities,NLP,Literature,Text Data,Text Mining,Marketing Classification
16.49M 282
恢复数据集 Computer Science,NLP,Text Data Classification
11.81M 226
巴西新闻 News,Finance,NLP,Brazil Classification
1102.35M 565
CORD-19完整索引,在完整CORD-19数据集上嵌入索引 Sentence embeddings index over full CORD-19 dataset. Includes both COVID-19 and non-COVID-19 tagged literature on Corona...NLP,Computer Science,Coronavirus Classification
7.61G 229
推特预测灾难 NLP,Classification,Text Data,Geospatial Analysis,Binary Classification Classification
1.34M 540
零售交易[于2020年7月17日发布] Online Communities,Retail and Shopping,NLP,Data Visualization,Tabular Data,Data Cleaning Classification
1.3M 250
OSCAR尼泊尔语语料库,尼泊尔语文本语料库,用于训练NLP的无监督语言模型 The files are from [OSCAR Corpus](https://oscar-corpus.com/). Please visit their site for more information.The dataset i...NLP,Computer Science,Movies and TV Shows,Text Data,Languages Classification
3.1G 339
乌克兰语词汇描述 Earth and Nature,Education,NLP,Classification,Text Data Classification
0.04M 547