基于Reddit评论的单词表示法的全局矢量数据集
GloVe Reddit Comments Global Vectors for Word Representation based on Reddit comments...NLP Classification
19.1G
350
Leigh
整个月的Reddit聊天评论数据
Reddit chat comments of an entire month...NLP,Online Communities,Text Data,Text Mining,Languages Classification
1.65G
520
SAI HARISH REDDY GUNDA
Reddit 评论分数预测,使用 NLP 预测评论分数
The idea behind this dataset is to try to predict whether a particular comment would be highly up-voted or down-voted gi...NLP,Computer Science,Social Science Classification
1.8G
262
Evan Hallmark
reddit自我发布分类任务,包含1000 多个精心挑选的类别
Welcome to the Reddit Self-Post Classification Task (RSPCT)!The aim of this dataset was to create an interesting, large...NLP,Classification,Computer Science,Multiclass Classification Classification
839.37M
272
Mike Swarbrick Jones
reddit向量数据集,用于训练 sence2vec模型
Sence2vec word embeddings model works better than word2vec , since it utilises contextual information from words.This re...NLP,Computer Science,Text Data,spaCy Classification
635.76M
352
Poonam Ligade