DBpedia语义网应用范例,提供42782篇维基百科文章提供了分类、分层类别
DBpedia (from DB for database) is a project aiming to extract structured content from the information created in Wikiped...Education,Text Data,Multiclass Classification,Text Mining Classification
443.28M
443
Dan Ofer![](https://www.payititi.com/api/avatar/show.php?username=ceshishuju009&size=large)
中国机器翻译研讨会,语料数据集
# DatasetThis dataset was created by Liling TanReleased under Other (specified in description)# ContentsIt contains the...Deep Learning,Computer Science Classification
6.6G
502
Warmth![](https://www.payititi.com/api/avatar/show.php?username=ceshishuju009&size=large)
星际迷航脚本,所有《星际迷航》系列脚本的原始文本脚本和处理行
Star Trek Scripts TextData scraped from data from http://www.chakoteya.net/StarTrek/index.htmlCode here: https://github....NLP,Movies and TV Shows,Text Data,Text Mining Classification
42.63M
366
Gary Broughton![](https://www.payititi.com/api/avatar/show.php?username=xiaochengxu&size=large)
机器人先生中的单词事件,了解F-Society最喜欢的行话
Mr. Robot is all about data whether it's corrupting it, encrypting it, or deleting it. I wanted to dig up some data...Arts and Entertainment,Games Classification
0.31M
426
Emma![](https://www.payititi.com/api/avatar/show.php?username=ceshishuju009&size=large)
NLP 数据
# DatasetThis dataset was created by AbiyuGReleased under CC BY-NC-SA 4.0# ContentsIt contains the following files:...NLP,Psychology Classification
3.14M
371
AbiyuG![](https://www.payititi.com/api/avatar/show.php?username=ceshishuju009&size=large)
手写数学符号数据集,超过10万个图像样本
Dataset consists of jpg files(45x45)DISCLAIMER: dataset does not contain Hebrew alphabet at all. It includes basic Greek...NLP,Computer Science,Law,Email and Messaging Classification
410.19M
626
Xai Nano![](https://www.payititi.com/api/avatar/show.php?username=ceshishuju009&size=large)
自然语言处理中的情感分析
#数据集此数据集由NowYSM在Database:Open Database,Contents:Database Contents#Contents下创建。它包含以下文件:...NLP,Arts and Entertainment Classification
2.52M
386
NowYSM![](https://www.payititi.com/api/avatar/show.php?username=ceshishuju009&size=large)
俄罗斯电报聊天记录,公开俄罗斯电报聊天中解析的数据
Russian Telegram chats history Data parsed from must popular public Russian Telegram chats...NLP,Text Data,Russia Classification
11.08G
389
Nick![](https://www.payititi.com/api/avatar/show.php?username=xiaochengxu&size=large)
带有偏差数据集的毒性清理版本
cleaned tox bias cleaned up version of toxicity with bias data set...NLP,Data Cleaning,Health Classification
535.39M
752
Ilya Evenbach![](https://www.payititi.com/api/avatar/show.php?username=xiaochengxu&size=large)
134.5M
682
utsav![](https://www.payititi.com/api/avatar/show.php?username=xiaochengxu&size=large)
MNIST 类似字母的数据集(A-Z)
Consist 28x28 handwritten Alphabet imagesContentThere are total 785 columns, each row consists an image of alphabets. Th...NLP,MNIST,CNN Classification
665.89M
781
Ashish Gupta![](https://www.payititi.com/api/avatar/show.php?username=ceshishuju009&size=large)
谷歌Word2Vec模型,包括 300 万个单词和短语的词汇表的单词向量
It’s 1.5GB! It includes word vectors for a vocabulary of 3 million words and phrases that they trained on roughly 100 b...Computer Science,Programming Classification
3.64G
419
Umberto![](https://www.payititi.com/api/avatar/show.php?username=xiaochengxu&size=large)
短信垃圾邮件收集,自然语言处理预测短信是否是垃圾邮件
ContextThe SMS Spam Collection.ContentBase on the text on SMS message, we should predict it is spam or not spam....Classification,Text Data,Email and Messaging,Text Mining Classification
50K
360
Truoc Pham![](https://www.payititi.com/api/avatar/show.php?username=ceshishuju009&size=large)
中国字符 20950,utf-8中的所有汉字及其发音数据集
中国字符 20950,utf-8中的所有汉字及其发音数据集...NLP,Comics and Animation Classification
1.2M
405
-T.K.-![](https://www.payititi.com/api/avatar/show.php?username=ceshishuju009&size=large)
斯坦福大学的GloVe预训练的单词向量
GloVe is an unsupervised learning algorithm for obtaining vector representations for words. Training is performed on agg...Computer Science,Sports,Retail and Shopping Classification
989.9M
466
Mohamed Atef Khalaf![](https://www.payititi.com/api/avatar/show.php?username=stanford&size=large)
Hearthstone Hearthstone卡名称和描述的翻译数据
Translation of Hearthstone card names and descriptions.Languages: German, English, Spanish, French, Italian, Japanese, K...NLP,Arts and Entertainment,Video Games,Games,Comics and Animation,Card Games Classification
54.7M
568
Liling Tan![](https://www.payititi.com/api/avatar/show.php?username=xiaochengxu&size=large)