Select Language

AI社区

公开数据集

相关搜索
您是不是在找?
今日排行
本周排行
本月排行
纯文本维基百科,每个文件都包含维基百科文章的集合 Wikipedia dumps contain a tremendous amount of markup. WikiMedia Text is a hybrid of markdown and HTML, making it very d...NLP,Computer Science,Text Data,Text Mining Classification
23.71G 263
德国新闻数据集 Computer Science,Internet,Education,Software,News,NLP Classification
726.72M 261
品酒师给出的葡萄酒评论数据 ,使用文本分类来对评论中的评论者进行分类 Thinking of Natural Language Processing as a beginner!!The dataset has been about the wine comments or reviews that has...NLP,Business,News,Text Data,Multiclass Classification,Alcohol Classification
50.35M 575
阿拉伯新闻文章半岛电视台.net Business,Education,News,NLP,Text Data,Psychology,Text Mining Classification
111.89M 521
班加罗尔地区在线食品配送偏好 Business,Food,NLP,Text Data,Geospatial Analysis,Jobs and Career Classification
0.23M 233
多模态仇恨言,150000条带有文本和图像的推特,用于仇恨检测 现有的仇恨语音数据集仅包含文本数据。我们创建了一个新的手动注释的多模态仇恨语音数据集,该数据集由150000条推文组成,每条推...NLP,Online Communities,Image Data,Multiclass Classification,Social Networks Classification
6.55G 608
来自202个Stackexchange站点的标记集合 This data is extracted from StackExchange for over 200+ websites under the Umbrella. This data consists of all possible...NLP,Business,Online Communities,Text Data Classification
16.75M 244
名称 Email and Messaging,NLP,Deep Learning,LSTM Classification
0.16M 203
短篇小说料库,埃德加·爱伦·坡的短篇小说集 ContentThe present data set includes the full corpus of 69 Edgar Allan Poe's short stories in tabular format. In add...NLP,Text Data,Literature,Text Mining Classification
1.86M 550
电子邮件分类NLP Business,Computer Science,Internet,Email and Messaging,NLP Classification
0.1M 223
阿拉伯RT新闻标题20200419 News,NLP,Text Data,Languages Classification
88.17M 214
Reddit数据量巨大 Online Communities,Social Networks,NLP,Basketball Classification
38.72M 464
CC-100 卡纳达言数据集:来自Web爬网数据的1300万条单言数据集 This monolingual dataset includes roughly 13 million uncleaned Kannada sentences crawled from numerous websites....NLP,Text Data,Languages Classification
3.51G 314
文本中的情感,句子中表达主要情感的文本数据 I was looking for a well labeled dataset to perform a multiclass classification. I wanted to do something more than just...NLP,Earth and Nature,Text Data,Multiclass Classification Classification
2.15M 282
四元 2.0 NLP,Deep Learning,Brazil Classification
74.9M 253
肯尼迪大学演讲 NLP,Text Data,Websites Classification
7.5M 190
OZON产品类别 Business,NLP,Text Data,Multiclass Classification,Marketing Classification
181.16M 229
来自AskUbuntu的意图识别聊天机器人料库 Context190 questions and answers from https://askubuntu.com. ContentWhat's inside is more than just rows and columns...NLP,Artificial Intelligence Classification
0.23M 497
西亚姆鲁帕利孟加拉字体 NLP,International Relations Classification
0.38M 231
团队数据集(csv) Transportation,NLP Classification
72.43M 243