日语-英语字幕语料库(JESC)[CLEANED],由280万个句子组成的大型语料库
This dataset is cleaned version of JESC by handling misplelled English words and doing word segmentation using:English=...NLP,Business,Computer Science,Languages Classification
220.08M
288
Wahyu Setianto
ELI5记分器训练数据原型816000例,用于创建评分模型
ELI5 means Explain like I am 5 . It's originally a long and free form Question-Answering scraping from reddit eli5 s...NLP,Earth and Nature,Arts and Entertainment,Education,Social Science,Sports,Regression,Transformers Classification
672.61M
262
Neuron Engineer
来自印度快报的新闻文章数据集
Business,Arts and Entertainment,News,NLP,Classification,Deep Learning,Linguistics,Recommender Systems Classification
63.24M
222
Pulkit Komal
大型影评数据集
Original: https://ai.stanford.edu/~amaas/data/sentiment/Это удобное изложение датасета отз...NLP,Arts and Entertainment,Movies and TV Shows Classification
63.18M
224
George
Stackoverflow问题分类挑战
ContextAsking questions is a part of learning. There's no shame in not knowing something and coming to others for he...NLP Classification
6.37M
550
Nasser Boan
Zeki MFC;任15E;ark131;SF6;zleri |歌词
Music,NLP,Artificial Intelligence,LSTM Classification
0.33M
233
ferhatmetin34
IMBD情绪分类数据集,用spacy标记并以JSON格式存储
ContextIMDB sentiment classification dataset from derived from torchtext, tokenized using spacy and then stored as JSON...NLP,Beginner,Earth and Nature,Movies and TV Shows,Text Data,Binary Classification,spaCy Classification
104.31M
244
Manoj Patra
HuggingFace变压器库的Lonformer基础模型
allenai-longformer-base-4096 Lonformer-base model for HuggingFace Transformers library...NLP,Arts and Entertainment,Transfer Learning Classification
568.34M
241
Akim Tsvigun
新闻点击诱饵数据集
Business,Online Communities,News,NLP,Classification,Deep Learning,Text Data Classification
3.41M
314
Vikas Singh
判例法数据集(伊利诺伊州)
Business,Social Science,Law,Universities and Colleges,NLP,Text Data,History Classification
928.44M
173
Caselaw Access Project