Select Language

AI社区

公开数据集

相关搜索
您是不是在找?
今日排行
本周排行
本月排行
WMT 18翻译任务 Deep Learning,Computer Science Classification
15743.1M 175
你好worldgit Others Classification
8.92M 32
亚马逊w2Vec Others Classification
3475.47M 41
12241.4M 191
Twitter评论:在线食品订购服务 Business,Linguistics Classification
7.77M 219
层压板wilsonart Others Classification
10.07M 213
CPE的Twitter数据 Email and Messaging Classification
0.14M 211
Erowid经验报告word2vec向量 Online Communities,NLP,Psychology,Clustering Classification
19.95M 248
WebMD药物评论数据集,各种药物的用户评论数据集 The dataset provides user reviews on specific drugs along with related conditions, side effects, age, sex, and ratings r...NLP,Computer Science,Education,Tabular Data,Drugs and Medications Classification
168.58M 276
CC-100 卡纳达语单语言数据集:来自Web爬网数据的1300万条单语言数据集 This monolingual dataset includes roughly 13 million uncleaned Kannada sentences crawled from numerous websites....NLP,Text Data,Languages Classification
3.51G 316
covid19 西班牙语 es py tweets 早 2020年4月底 Earth and Nature,Health,Social Networks,Coronavirus,NLP,Text Data Classification
805.29M 540
来自wallstreetbets等的Subreddit数据,用于后验量化交易算法的情绪分析 All of the submissions to each of the r/wallstreetbets, r/investing, r/options, and r/SecurityAnalysis subreddits since...NLP,Online Communities,Investing Classification
1.49G 264
泰米尔二进制分类1K tweets标签V1 NLP,Classification Classification
0.38M 267
Stackoverflow问题分类挑战 ContextAsking questions is a part of learning. There's no shame in not knowing something and coming to others for he...NLP Classification
6.37M 588
有毒评论检测多语言[扩展],Jigsaw有毒通信网络分类竞赛的补充 This is a compilation of all the toxic comment databases out there. I made this for ease of use during the Jigsaw Toxic...NLP,Deep Learning,Classification,Binary Classification Classification
117.55M 266
JigSaw有毒评论分类清理数据,竖锯评论,带感情,评论长度和翻译文本 I've been working on the JigSaw Multilingual Toxic Comment classification competition and found that the data requir...NLP,Deep Learning,Feature Engineering,Text Data Classification
263.44M 350
冠状病毒tweets NLP 文本分类 Earth and Nature,Health,NLP,Text Data Classification
10.97M 475
WikiText长期依赖性语言建模数据集 WikiText语言建模数据集是从维基百科上一组经过验证的好文章和特色文章中提取的超过1亿个令牌的集合。与宾夕法尼亚树库(PTB)的...NLP,Deep Learning,Text Data Classification
1.11G 245
神经链接Tweets Business,Online Communities,News,NLP,Artificial Intelligence Classification
1.08M 237
Whatsapp状态的情感数据 Email and Messaging,NLP,Deep Learning,spaCy Classification
2.49M 288