Select Language

AI社区

公开数据集

相关搜索
您是不是在找?
今日排行
本周排行
本月排行
WMT 18翻译任务 Deep Learning,Computer Science Classification
15743.1M 208
你好worldgit Others Classification
8.92M 36
亚马逊w2Vec Others Classification
3475.47M 60
12241.4M 235
Twitter评论:在线食品订购服务 Business,Linguistics Classification
7.77M 257
层压板wilsonart Others Classification
10.07M 233
CPE的Twitter数据 Email and Messaging Classification
0.14M 238
Erowid经验报告word2vec向量 Online Communities,NLP,Psychology,Clustering Classification
19.95M 279
WebMD药物评论数据集,各种药物的用户评论数据集 The dataset provides user reviews on specific drugs along with related conditions, side effects, age, sex, and ratings r...NLP,Computer Science,Education,Tabular Data,Drugs and Medications Classification
168.58M 323
CC-100 卡纳达语单语言数据集:来自Web爬网数据的1300万条单语言数据集 This monolingual dataset includes roughly 13 million uncleaned Kannada sentences crawled from numerous websites....NLP,Text Data,Languages Classification
3.51G 348
covid19 西班牙语 es py tweets 早 2020年4月底 Earth and Nature,Health,Social Networks,Coronavirus,NLP,Text Data Classification
805.29M 650
来自wallstreetbets等的Subreddit数据,用于后验量化交易算法的情绪分析 All of the submissions to each of the r/wallstreetbets, r/investing, r/options, and r/SecurityAnalysis subreddits since...NLP,Online Communities,Investing Classification
1.49G 301
泰米尔二进制分类1K tweets标签V1 NLP,Classification Classification
0.38M 293
Stackoverflow问题分类挑战 ContextAsking questions is a part of learning. There's no shame in not knowing something and coming to others for he...NLP Classification
6.37M 674
有毒评论检测多语言[扩展],Jigsaw有毒通信网络分类竞赛的补充 This is a compilation of all the toxic comment databases out there. I made this for ease of use during the Jigsaw Toxic...NLP,Deep Learning,Classification,Binary Classification Classification
117.55M 298
JigSaw有毒评论分类清理数据,竖锯评论,带感情,评论长度和翻译文本 I've been working on the JigSaw Multilingual Toxic Comment classification competition and found that the data requir...NLP,Deep Learning,Feature Engineering,Text Data Classification
263.44M 394
冠状病毒tweets NLP 文本分类 Earth and Nature,Health,NLP,Text Data Classification
10.97M 560
WikiText长期依赖性语言建模数据集 WikiText语言建模数据集是从维基百科上一组经过验证的好文章和特色文章中提取的超过1亿个令牌的集合。与宾夕法尼亚树库(PTB)的...NLP,Deep Learning,Text Data Classification
1.11G 287
神经链接Tweets Business,Online Communities,News,NLP,Artificial Intelligence Classification
1.08M 278
Whatsapp状态的情感数据 Email and Messaging,NLP,Deep Learning,spaCy Classification
2.49M 332