Select Language

AI社区

公开数据集

相关搜索
您是不是在找?
今日排行
本周排行
本月排行
wallstreetbets等的Subreddit数据,用于后验量化交易算法的情绪分析 All of the submissions to each of the r/wallstreetbets, r/investing, r/options, and r/SecurityAnalysis subreddits since...NLP,Online Communities,Investing Classification
1.49G 240
IMDB摘要 Arts and Entertainment,Movies and TV Shows,NLP,Text Data Classification
93.03M 256
日语-英语字幕语料库(JESC)[CLEANED],由280万个句子组成的大型语料库 This dataset is cleaned version of JESC by handling misplelled English words and doing word segmentation using:English=...NLP,Business,Computer Science,Languages Classification
220.08M 273
古腾堡 Education,Software,NLP,Text Data Classification
14.25M 198
ELI5记分器训练数据原型816000例,用于创建评分模型 ELI5 means Explain like I am 5 . It's originally a long and free form Question-Answering scraping from reddit eli5 s...NLP,Earth and Nature,Arts and Entertainment,Education,Social Science,Sports,Regression,Transformers Classification
672.61M 246
NERu数据集 NLP,Text Data,LSTM Classification
14.5M 181
海得拉巴Zomato餐厅 NLP,Ratings and Reviews,Cooking and Recipes,spaCy Classification
3.44M 477
泰米尔二进制分类1K tweets标签V1 NLP,Classification Classification
0.38M 242
标记化器 NLP Classification
14.88M 243
印度快报的新闻文章数据集 Business,Arts and Entertainment,News,NLP,Classification,Deep Learning,Linguistics,Recommender Systems Classification
63.24M 211
1994 2009年Arxiv量子物理学论文 Education,NLP,Physics Classification
92.19M 264
多种语言 Software,NLP,Deep Learning Classification
0.23M 523
大型影评数据集 Original: https://ai.stanford.edu/~amaas/data/sentiment/Это удобное изложение датасета отз...NLP,Arts and Entertainment,Movies and TV Shows Classification
63.18M 219
Stackoverflow问题分类挑战 ContextAsking questions is a part of learning. There's no shame in not knowing something and coming to others for he...NLP Classification
6.37M 513
芬兰动词的补语格 NLP Classification
1.56M 182
线缆 19 QA Coronavirus,NLP Classification
23.62M 229
Zeki MFC;任15E;ark131;SF6;zleri |歌词 Music,NLP,Artificial Intelligence,LSTM Classification
0.33M 220
IMBD情绪分类数据集,用spacy标记并以JSON格式存储 ContextIMDB sentiment classification dataset from derived from torchtext, tokenized using spacy and then stored as JSON...NLP,Beginner,Earth and Nature,Movies and TV Shows,Text Data,Binary Classification,spaCy Classification
104.31M 232
HuggingFace变压器库的Lonformer基础模型 allenai-longformer-base-4096 Lonformer-base model for HuggingFace Transformers library...NLP,Arts and Entertainment,Transfer Learning Classification
568.34M 227
新闻点击诱饵数据集 Business,Online Communities,News,NLP,Classification,Deep Learning,Text Data Classification
3.41M 295