Steam官方网站的大约 80000 款游戏数据集
这是一个数据集,包含任何可抓取的信息,关于来自 Steam 官方网站的大约 80000 款游戏。大多数列包含有价值的信息,可以让您更好...Video Games,Games Classification
98.8M
1004
Deepan.N
28种语言中的停止词,自然语言处理中的文本预处理
Stopwords are the words in any language which does not add much meaning to a sentence. They can safely be ignored withou...NLP,Computer Science,Text Data,Languages Classification
0.09M
710
Heeral Dedhia
日语-英语字幕语料库(JESC)[CLEANED],由280万个句子组成的大型语料库
This dataset is cleaned version of JESC by handling misplelled English words and doing word segmentation using:English=...NLP,Business,Computer Science,Languages Classification
220.08M
370
Wahyu Setianto
ELI5记分器训练数据原型816000例,用于创建评分模型
ELI5 means Explain like I am 5 . It's originally a long and free form Question-Answering scraping from reddit eli5 s...NLP,Earth and Nature,Arts and Entertainment,Education,Social Science,Sports,Regression,Transformers Classification
672.61M
341
Neuron Engineer