Select Language

AI社区

公开数据集

相关搜索
您是不是在找?
今日排行
本周排行
本月排行
SComedy Earth and Nature,NLP,Text Data,Text Mining Classification
2.99M 367
大量波斯诗歌 NLP Classification
61.49M 333
NLTK路透社新闻文件,NLTK路透社料库中的所有文件 This dataset contains the ID, categories, and raw text from each file in NLTK's Reuters corpus.ContentEach file (row...NLP,Computer Science,News,Text Data,Text Mining Classification
3.3M 561
reddit自我发布分类任务,包含1000 多个精心挑选的类别 Welcome to the Reddit Self-Post Classification Task (RSPCT)!The aim of this dataset was to create an interesting, large...NLP,Classification,Computer Science,Multiclass Classification Classification
839.37M 267
医学成绩单,从mtsamples获取的医学转录数据 Medical data is extremely hard to find due to HIPAA privacy regulations. This dataset offers a solution by providing med...NLP,Health,Medicine Classification
16.22M 237
FakeNewsNet 假新闻研究数据收集,假新闻、虚假信息、数据挖掘 This is a repository for an ongoing data collection project for fake news research at ASU. We describe and compare FakeN...NLP,News,Social Science,Social Networks Classification
72.61M 939
reddit向量数据集,用于训练 sence2vec模型 Sence2vec word embeddings model works better than word2vec , since it utilises contextual information from words.This re...NLP,Computer Science,Text Data,spaCy Classification
635.76M 346
Strongbad邮件 Business,NLP,Text Data Classification
0.11M 231
科学流行评论删除 Business,NLP,Text Data,Binary Classification,Bigquery Classification
74.17M 227
Medium Articles 包含标记为AI、机器学习、数据科学或人工智能的帖子,以及用户信息 Medium taps into the brains of the world’s most insightful writers, thinkers, and storytellers to bring you the smartes...NLP,Text Data,Literature Classification
1.8G 294
实体提取从Pitchfork评论 Business,Arts and Entertainment,Music,Retail and Shopping,NLP,Popular Culture Classification
14.49M 548
Stack Overflow 2018 问题数据集 In this dataset, we explore StackOverflow questions and try to use unsupervised algorithms to extract tags, then train c...NLP,Earth and Nature,Computer Science,Multiclass Classification Classification
230.27M 330
圣诞节的食谱 Religion and Belief Systems,NLP,Cooking and Recipes,Holidays and Cultural Events Classification
2.51M 481
数以千计的关于爱情的问题,该数据集包含来自QA服务的爱情类问题和答案 ContextRUSSIAN LANGUAGEThis dataset collected from real answers to questions of the mail.ru service: https://otvet.mail....NLP,Education,Text Data,Languages Classification
176.23M 244
普莱诺斯总督 埃里总统2018年 NLP,Brazil Classification
16.5M 449
ACL论文选集,论文数据来自ACL选集 The Accepted paper's data from ACL Anthology. An abstract of a paper is extracted from arXiv if it exists.The data i...NLP,Education,Literature Classification
1.14M 233
电子邮件垃圾邮件 ContextSome emails from [Spam Assassin][1] to create models that can differentiate between spam and ham (non - spam) ema...NLP,Classification,Software,Email and Messaging Classification
12.08M 245
curationCorpus 策展料库 策展语料库汇集了 40,000 篇专业撰写的新闻文章摘要,并附有文章本身的链接。这个存储库提供了一个抓取工具来访问它们。如果您对...NLP Text
123.13M 428
ScanRefer数据:使用自然言在RGB-D扫描中进行3D对象定位 IntroductionWe introduce the task of 3D object localization in RGB-D scans using natural language descriptions.As input,...Others Classification
58.59M 567
阿拉伯手写数字 Please visit the official website of this dataset for details...OCR/Text Detection Text
0M 451