万篇德国新闻文章数据集,10kGNAD基于一百万篇文章语料库
10kGNAD数据集旨在作为第一个德国主题分类数据集解决部分问题。它由一家奥地利在线报纸的10273篇德语新闻文章组成,分为9个主题...NLP,Classification,Computer Science,Programming,News,Social Science Classification
51.81M
255
Timo Block
纯文本维基百科,每个文件都包含维基百科文章的集合
Wikipedia dumps contain a tremendous amount of markup. WikiMedia Text is a hybrid of markdown and HTML, making it very d...NLP,Computer Science,Text Data,Text Mining Classification
23.71G
246
DavidShapiro
阿拉伯新闻文章半岛电视台.net
Business,Education,News,NLP,Text Data,Psychology,Text Mining Classification
111.89M
487
Abdelkader Rhouati
荷兰新闻文章
Internet,News,NLP,Text Data,Exploratory Data Analysis,Text Mining Classification
351.62M
212
Max Scheijen
Virgool数据集,这是一套从virgool.io收集的波斯文章数据
This could be a nice tool for Persian writers or bloggers to automatically pick the suggested hashtag or even subject fo...NLP,Education,Software,Literature Classification
58.89M
293
Mehran Rafiee
媒体文章集2020版
Arts and Entertainment,Computer Science,Education,NLP Classification
1.63M
315
KAUSTUBH1828
来自印度快报的新闻文章数据集
Business,Arts and Entertainment,News,NLP,Classification,Deep Learning,Linguistics,Recommender Systems Classification
63.24M
223
Pulkit Komal