Select Language

AI社区

公开数据集

相关搜索
您是不是在找?
今日排行
本周排行
本月排行
医生费预测 Exploratory Data Analysis,Regression Classification
0.31M 173
腌制爬虫 300D 2M于果仁比赛 Computer Science,Food Classification
2433.63M 510
于学习数据挖掘的病人入院数据集 Education,Health Classification
0.27M 337
自行车 Washington Classification
0.05M 266
电气足球俱乐部新闻 News,Sports,NLP,Football,Brazil Classification
269.65M 237
WebMD药物评论数据集,各种药物的户评论数据集 The dataset provides user reviews on specific drugs along with related conditions, side effects, age, sex, and ratings r...NLP,Computer Science,Education,Tabular Data,Drugs and Medications Classification
168.58M 243
品酒师给出的葡萄酒评论数据 ,使文本分类来对评论中的评论者进行分类 Thinking of Natural Language Processing as a beginner!!The dataset has been about the wine comments or reviews that has...NLP,Business,News,Text Data,Multiclass Classification,Alcohol Classification
50.35M 513
多模态仇恨言语,150000条带有文本和图像的推特,于仇恨检测 现有的仇恨语音数据集仅包含文本数据。我们创建了一个新的手动注释的多模态仇恨语音数据集,该数据集由150000条推文组成,每条推...NLP,Online Communities,Image Data,Multiclass Classification,Social Networks Classification
6.55G 534
Bash.im公司引 Internet,NLP,Text Data,Text Mining,Russia Classification
38.65M 246
带有语言标签的文本数据。它可以于语言检测。 Language Detection Dataset Text data with language labels. It can be used for language detection....NLP,Classification,Computer Science,Multiclass Classification,Languages Classification
31.7M 507
测试例数据集,软件测试中使的样本数据集的集合 There are lots of datasets available for different machine learning tasks like NLP, Computer vision etc. However I could...NLP,Deep Learning,Earth and Nature Classification
1.3M 300
Tanglish情绪分析推文,使了4个标签来描述推特的情绪 So it all started when I was looking for Abusive Tamil tweets in the Roman Script to use for a project and instead of fi...NLP,Deep Learning,Online Communities,People Classification
0.85M 246
户评级为10M的Goodreads图书数据集 Arts and Entertainment,Social Science,NLP,Literature,Recommender Systems Classification
1128.5M 425
来自wallstreetbets等的Subreddit数据,于后验量化交易算法的情绪分析 All of the submissions to each of the r/wallstreetbets, r/investing, r/options, and r/SecurityAnalysis subreddits since...NLP,Online Communities,Investing Classification
1.49G 243
ELI5记分器训练数据原型816000例,于创建评分模型 ELI5 means Explain like I am 5 . It's originally a long and free form Question-Answering scraping from reddit eli5 s...NLP,Earth and Nature,Arts and Entertainment,Education,Social Science,Sports,Regression,Transformers Classification
672.61M 250
IMBD情绪分类数据集,spacy标记并以JSON格式存储 ContextIMDB sentiment classification dataset from derived from torchtext, tokenized using spacy and then stored as JSON...NLP,Beginner,Earth and Nature,Movies and TV Shows,Text Data,Binary Classification,spaCy Classification
104.31M 236
于Sarcasm检测的新闻标题数据集,于讽刺和假新闻检测任务的高质量数据集 Past studies in Sarcasm Detection mostly make use of Twitter datasets collected using hashtag based supervision but such...NLP,Deep Learning,Classification,Earth and Nature,Computer Science,Programming Classification
11.13M 256
OSCAR尼泊尔语语料库,尼泊尔语文本语料库,于训练NLP的无监督语言模型 The files are from [OSCAR Corpus](https://oscar-corpus.com/). Please visit their site for more information.The dataset i...NLP,Computer Science,Movies and TV Shows,Text Data,Languages Classification
3.1G 302
堆栈溢出户情景 Computer Science,Programming,NLP,Text Data Classification
42.27M 218
于语音克隆的英语多说话人语料库 CSTR-VCTK语料库 This CSTR VCTK Corpus includes speech data uttered by 109 native speakers of English with various accents. Each speaker...NLP,Audio Data Classification
15.22G 381