用户分析和滥用语言检测数据集数据集
Abubakar Umar, abubakar.pg715673 '@' st.futminna.edu.ng, Federal University of Technology Minna, Nigeria,Data Se...Computer Classification
3.2M
672
University of California, Irvine
UJI笔字符(第2版)数据集
F. Prat(*), M. J. Castro(+), D. Llorens(*), A. Marzal(*), and J. M. Vilar(*)* Departamento de Lenguajes y Sistemas Infor...Computer Classification
2.03M
739
University of California, Irvine
TTC-3600:土耳其文本分类数据集的基准数据集
Assist.Prof.Dr. Deniz KILIN??, Faculty of Technology, Celal Bayar University, Turkeydrdenizkilinc'@'gmail.comDat...Computer Classification
2.5M
825
University of California, Irvine
句子分类数据集
America Chambers, ahollowa '@' ics.uci.edu, University of California, IrvineData Set Information:Please see the...N/A Classification
2.9M
667
University of California, Irvine
Semeion手写数字数据集
The dataset was created by Tactile Srl, Brescia, Italy (http://www.tattile.it) and donated in 1994 to Semeion Research C...Computer Classification
2.8M
1226
University of California, Irvine
路透社21578文本分类收集数据集
David D. LewisATT Labs - Research lewis '@' research.att.comdocuments came from Reuters newswire in 1987.Data Se...Agriculture Classification
7.8M
2630
University of California, Irvine
手写数字数据集的笔式识别
Data Set Information:我们从44位作者那里收集了250个样本,创建了一个数字数据库。由30位作者编写的样本用于培训、交叉验证和作...NLP Classification
1.58M
709
University of California, Irvine
54个不同的文档分类数据集
Data Set Information:The 5473 examples comes from 54 distinct documents. Each observation concerns one block. All attrib...Computer Classification
103K
580
University of California, Irvine
NYSK数据集,用于文本挖掘任务中的主题提取、情绪分析
Data Set Information:Documents are first obtained via a Web search using AMIEI: an integrated platform for delivering en...NLP,Social Clustering
17.5M
546
University of California, Irvine
荷兰实用地图集合中提取的手写数字(`0'--`9')的特征数据集
Data Set Information:该数据集包含从荷兰实用地图集合中提取的手写数字(`0'--`9')的特征。每类200个图案(总共2000个...Computer Classification
4.93M
989
University of California, Irvine
26个大写字母识别数据集
Data Set Information:The objective is to identify each of a large number of black-and-white rectangular pixel displays a...NLP Classification
366K
849
University of California, Irvine
KDC-4007文本数据集(体育、宗教、艺术、经济、教育、社会、风格和健康)
Data Set Information:该数据集最重要的特点是其使用简单且有良好的文档记录,可广泛用于有关库尔德索拉尼新闻和文章的各种文本...Computer Classification
853K
571
University of California, Irvine
语法面部表情数据集,A-Z语法面部表情的名称和规格(目标或数据点)
Data Set Information:面部表情的自动分析已广泛应用于不同的研究领域,如生物特征识别或情感分析。手语领域特别重视面部表情,...NLP Classification
19.2M
591
University of California, Irvine
OCR图像数据集,可用于OCR系统分类算法的基准测试
Data Set Information:Data Type: GrayScale Image The image dataset can be used to benchmark classification algorithm for...NLP Classification
76.7M
705
University of California, Irvine
Nettalk语料库数据集 包括20008个带重音的对齐字母和语音表示法
该数据集是由Terry Sejnowski,现在在索尔克研究所和加利福尼亚大学圣地亚哥的基准集合。该数据集是与普林斯顿大学的查尔斯·罗...N/A N/A
175K
697
University of California, Irvine
facens nlp语料库分类 用于FACENS DS 课程的第一个 NLP 活动的基础数据集
Passado como dataset para atividade 01 de NLP do curso Ciência de Dados da FACENS.Mantido integralmente no Kaggle, mesm...NLP,Health,Multiclass Classification Classification
0.89M
545
Renan Renger
3.64G
433
ymtoo
词袋模型的描述数据
词袋模型模型下,像是句子或是文件这样的文字可以用一个袋子装着这些词的方式表现,这种表现方式不考虑文法以及词的顺序。最近词...NLP,Business Classification
208.03M
433
VictorHBD
预训练的 word2vec 模型,英语维基百科数据
# DatasetThis dataset was created by RabidBear# ContentsIt contains the following files:...Others Classification
3.2G
427
RabidBear