相关搜索
您是不是在找?
今日排行
本周排行
本月排行
医学成绩单,从mtsamples获取的医学转录数据
Medical data is extremely hard to find due to HIPAA privacy regulations. This dataset offers a solution by providing med...NLP,Health,Medicine Classification
16.22M
250
Tara Boyle
FakeNewsNet 假新闻研究数据收集,假新闻、虚假信息、数据挖掘
This is a repository for an ongoing data collection project for fake news research at ASU. We describe and compare FakeN...NLP,News,Social Science,Social Networks Classification
72.61M
955
Deepak Mahudeswaran
reddit向量数据集,用于训练 sence2vec模型
Sence2vec word embeddings model works better than word2vec , since it utilises contextual information from words.This re...NLP,Computer Science,Text Data,spaCy Classification
635.76M
351
Poonam Ligade
Medium Articles 包含标记为AI、机器学习、数据科学或人工智能的帖子,以及用户信息
Medium taps into the brains of the world’s most insightful writers, thinkers, and storytellers to bring you the smartes...NLP,Text Data,Literature Classification
1.8G
299
AiswaryaRamachandran
实体提取从Pitchfork评论
Business,Arts and Entertainment,Music,Retail and Shopping,NLP,Popular Culture Classification
14.49M
561
Justin K
Stack Overflow 2018 问题数据集
In this dataset, we explore StackOverflow questions and try to use unsupervised algorithms to extract tags, then train c...NLP,Earth and Nature,Computer Science,Multiclass Classification Classification
230.27M
336
Réda
圣诞节的食谱
Religion and Belief Systems,NLP,Cooking and Recipes,Holidays and Cultural Events Classification
2.51M
494
Gary Broughton
数以千计的关于爱情的问题,该数据集包含来自QA服务的爱情类问题和答案
ContextRUSSIAN LANGUAGEThis dataset collected from real answers to questions of the mail.ru service: https://otvet.mail....NLP,Education,Text Data,Languages Classification
176.23M
249
Boris Zubarev
ACL论文选集,论文数据来自ACL选集
The Accepted paper's data from ACL Anthology. An abstract of a paper is extracted from arXiv if it exists.The data i...NLP,Education,Literature Classification
1.14M
239
Takahiro Kubo
电子邮件垃圾邮件
ContextSome emails from [Spam Assassin][1] to create models that can differentiate between spam and ham (non - spam) ema...NLP,Classification,Software,Email and Messaging Classification
12.08M
251
Wessel van Lit
123.13M
432
Henry Dashwood
0M
460
Mohamed Loey
MJSynth Synthetic Word Dataset 合成词数据集
This is synthetically generated dataset which we found sufficient for training text recognitionon real-world imagesThis...NLP Classification
9.95G
1484
Visual Geometry Group
ICDAR 2013 数据集
1 150 images written in Greek and English language as well as 50 images written inIndian Bangla language.2 BlackWhite ha...NLP Text
172.61M
1364
Computational Intelligence Laboratory Institute of Informatics and Telecommunications National Center for Scientific Research