Select Language

AI社区

公开数据集

相关搜索
您是不是在找?
今日排行
本周排行
本月排行
WMT 18翻译任务 Deep Learning,Computer Science Classification
15743.1M 157
你好worldgit Others Classification
8.92M 24
亚马逊w2Vec Others Classification
3475.47M 33
12241.4M 176
Twitter评论:在线食品订购服务 Business,Linguistics Classification
7.77M 202
层压板wilsonart Others Classification
10.07M 198
CPE的Twitter数据 Email and Messaging Classification
0.14M 196
Erowid经验报告word2vec向量 Online Communities,NLP,Psychology,Clustering Classification
19.95M 229
WebMD药物评论数据集,各种药物的用户评论数据集 The dataset provides user reviews on specific drugs along with related conditions, side effects, age, sex, and ratings r...NLP,Computer Science,Education,Tabular Data,Drugs and Medications Classification
168.58M 250
CC-100 卡纳达语单语言数据集:来自Web爬网数据的1300万条单语言数据集 This monolingual dataset includes roughly 13 million uncleaned Kannada sentences crawled from numerous websites....NLP,Text Data,Languages Classification
3.51G 296
covid19 西班牙语 es py tweets 早 2020年4月底 Earth and Nature,Health,Social Networks,Coronavirus,NLP,Text Data Classification
805.29M 485
来自wallstreetbets等的Subreddit数据,用于后验量化交易算法的情绪分析 All of the submissions to each of the r/wallstreetbets, r/investing, r/options, and r/SecurityAnalysis subreddits since...NLP,Online Communities,Investing Classification
1.49G 250
泰米尔二进制分类1K tweets标签V1 NLP,Classification Classification
0.38M 251
Stackoverflow问题分类挑战 ContextAsking questions is a part of learning. There's no shame in not knowing something and coming to others for he...NLP Classification
6.37M 539
有毒评论检测多语言[扩展],Jigsaw有毒通信网络分类竞赛的补充 This is a compilation of all the toxic comment databases out there. I made this for ease of use during the Jigsaw Toxic...NLP,Deep Learning,Classification,Binary Classification Classification
117.55M 245
JigSaw有毒评论分类清理数据,竖锯评论,带感情,评论长度和翻译文本 I've been working on the JigSaw Multilingual Toxic Comment classification competition and found that the data requir...NLP,Deep Learning,Feature Engineering,Text Data Classification
263.44M 326
冠状病毒tweets NLP 文本分类 Earth and Nature,Health,NLP,Text Data Classification
10.97M 419
WikiText长期依赖性语言建模数据集 WikiText语言建模数据集是从维基百科上一组经过验证的好文章和特色文章中提取的超过1亿个令牌的集合。与宾夕法尼亚树库(PTB)的...NLP,Deep Learning,Text Data Classification
1.11G 226
神经链接Tweets Business,Online Communities,News,NLP,Artificial Intelligence Classification
1.08M 219
Whatsapp状态的情感数据 Email and Messaging,NLP,Deep Learning,spaCy Classification
2.49M 269