Select Language

AI社区

公开数据集

名称实体识别数据集 The label annotation mistakes by human annotators brings up two challenges to NER:mistakes in the test set can interfere...NLP Classification
5.64M 262
罗伯特·弗罗斯特系列 Arts and Entertainment,Education,NLP,Literature,Text Data,Transformers Classification
0.22M 565
BERT英语无冠词双冠词,BERT英语无上限训练数据的双谱图频率 Is BERT the right model to fine tune your data on? Or do you need to pretrain from scratch?Know your model's trainin...NLP,Music Classification
1.99G 257
阿拉伯文圣训九册 NLP,Multiclass Classification,Clustering Classification
94.48M 272
客户服务中的关系策略,来自四个来源的旅行相关客户服务数据集 Relational Strategies in Customer Service (RSiCS) DatasetHuman-computer data from three live customer service Intelligen...NLP,Business,Text Data Classification
57.78M 311
Virgool数据集,这是一套从virgool.io收集的波斯文章数据 This could be a nice tool for Persian writers or bloggers to automatically pick the suggested hashtag or even subject fo...NLP,Education,Software,Literature Classification
58.89M 312
荷兰新闻文章 Internet,News,NLP,Text Data,Exploratory Data Analysis,Text Mining Classification
351.62M 237
在大流行期间担任总统 Earth and Nature,Education,News,NLP,Data Visualization,Psychology Classification
0.25M 229
所有NeurIPS(NIPS)文件 Computer Science,Sports,NLP,Deep Learning,Artificial Intelligence,Neural Networks Classification
310.53M 212
所有英文停止字(700+;) Computer Science,Education,NLP,Feature Engineering,Python Classification
0.01M 224
Tanglish情绪分析推文,使用了4个标签来描述推特的情绪 So it all started when I was looking for Abusive Tamil tweets in the Roman Script to use for a project and instead of fi...NLP,Deep Learning,Online Communities,People Classification
0.85M 278
汉语停止词 Earth and Nature,NLP Classification
0.03M 218
GENIA生物医学事件数据集 ContextBio-medical texts have a lot of information which can be used for developments in the medical field. Traditionall...NLP,Biology,Text Mining,Medicine Classification
2.67M 539
COVID19相关常见问题,此数据包含与新冠肺炎相关的问答集19 What is this?This data contains collection of question and answers related to COVID19.Where does this come from?Thi...NLP,Health,Coronavirus,Psychology,Diseases Classification
0.1M 273
科研论文主题建模 Business,Earth and Nature,Education,NLP,Psychology Classification
21.96M 255
疾病 Health,Health Conditions,NLP,Russia Classification
4.47M 194
COVID假新闻数据集 Health,News,Coronavirus,NLP Classification
1.06M 229
NLP:报告和新闻分类 Social Science,Investing,NLP,Literature,Environment,Binary Classification,Multilabel Classification,Water Bodies Classification
0.03M 220
俄罗斯有毒评论 Internet,Social Networks,NLP,Text Data Classification
37.45M 206
测试用例数据集,软件测试中使用的样本数据集的集合 There are lots of datasets available for different machine learning tasks like NLP, Computer vision etc. However I could...NLP,Deep Learning,Earth and Nature Classification
1.3M 350