DBpedia语义网应用范例,提供42782篇维基百科文章提供了分类、分层类别
DBpedia (from DB for database) is a project aiming to extract structured content from the information created in Wikiped...Education,Text Data,Multiclass Classification,Text Mining Classification
443.28M
362
Dan Ofer
中国机器翻译研讨会,语料数据集
# DatasetThis dataset was created by Liling TanReleased under Other (specified in description)# ContentsIt contains the...Deep Learning,Computer Science Classification
6.6G
436
Warmth
星际迷航脚本,所有《星际迷航》系列脚本的原始文本脚本和处理行
Star Trek Scripts TextData scraped from data from http://www.chakoteya.net/StarTrek/index.htmlCode here: https://github....NLP,Movies and TV Shows,Text Data,Text Mining Classification
42.63M
320
Gary Broughton
机器人先生中的单词事件,了解F-Society最喜欢的行话
Mr. Robot is all about data whether it's corrupting it, encrypting it, or deleting it. I wanted to dig up some data...Arts and Entertainment,Games Classification
0.31M
356
Emma
NLP 数据
# DatasetThis dataset was created by AbiyuGReleased under CC BY-NC-SA 4.0# ContentsIt contains the following files:...NLP,Psychology Classification
3.14M
331
AbiyuG
手写数学符号数据集,超过10万个图像样本
Dataset consists of jpg files(45x45)DISCLAIMER: dataset does not contain Hebrew alphabet at all. It includes basic Greek...NLP,Computer Science,Law,Email and Messaging Classification
410.19M
562
Xai Nano
自然语言处理中的情感分析
#数据集此数据集由NowYSM在Database:Open Database,Contents:Database Contents#Contents下创建。它包含以下文件:...NLP,Arts and Entertainment Classification
2.52M
349
NowYSM
俄罗斯电报聊天记录,公开俄罗斯电报聊天中解析的数据
Russian Telegram chats history Data parsed from must popular public Russian Telegram chats...NLP,Text Data,Russia Classification
11.08G
351
Nick
带有偏差数据集的毒性清理版本
cleaned tox bias cleaned up version of toxicity with bias data set...NLP,Data Cleaning,Health Classification
535.39M
679
Ilya Evenbach
134.5M
609
utsav
MNIST 类似字母的数据集(A-Z)
Consist 28x28 handwritten Alphabet imagesContentThere are total 785 columns, each row consists an image of alphabets. Th...NLP,MNIST,CNN Classification
665.89M
663
Ashish Gupta
谷歌Word2Vec模型,包括 300 万个单词和短语的词汇表的单词向量
It’s 1.5GB! It includes word vectors for a vocabulary of 3 million words and phrases that they trained on roughly 100 b...Computer Science,Programming Classification
3.64G
362
Umberto
短信垃圾邮件收集,自然语言处理预测短信是否是垃圾邮件
ContextThe SMS Spam Collection.ContentBase on the text on SMS message, we should predict it is spam or not spam....Classification,Text Data,Email and Messaging,Text Mining Classification
50K
307
Truoc Pham
中国字符 20950,utf-8中的所有汉字及其发音数据集
中国字符 20950,utf-8中的所有汉字及其发音数据集...NLP,Comics and Animation Classification
1.2M
349
-T.K.-
斯坦福大学的GloVe预训练的单词向量
GloVe is an unsupervised learning algorithm for obtaining vector representations for words. Training is performed on agg...Computer Science,Sports,Retail and Shopping Classification
989.9M
405
Mohamed Atef Khalaf
Hearthstone Hearthstone卡名称和描述的翻译数据
Translation of Hearthstone card names and descriptions.Languages: German, English, Spanish, French, Italian, Japanese, K...NLP,Arts and Entertainment,Video Games,Games,Comics and Animation,Card Games Classification
54.7M
487
Liling Tan