文字袋数据集 可用于了解路透社的哪些文章是关于“企业收购”的
Data Set Information:The original data were formatted by Thorsten Joachims in the “bag-of-words” representation. There...NLP Classification
1.21M
767
University of California, Irvine
文字袋(包含文本和单词)数据集
Data Set Information:For each text collection, D is the number of documents, W is the number of words in the vocabulary,...NLP Classification
1.77G
733
University of California, Irvine
大型室外中文字符OCR标注数据集,包含3850个独特字符的约100 万个汉字
In this paper, we introduce a very large Chinese text dataset in the wild. While optical character recognition (OCR) in...Action/Event Detection Classification
36.23G
342
清华大学-腾讯联合实验室
玩转文字分类广告,文本分类广告的分类数据集
ContextWe need to categorize ads automatically sinse many times it is not provided by external data sources.Contentcatid...Business,Real Estate Classification
250.73M
510
CristhianBoujon