相关搜索
您是不是在找?
今日排行
本周排行
本月排行
中国字符 20950,utf-8中的所有汉字及其发音数据集
中国字符 20950,utf-8中的所有汉字及其发音数据集...NLP,Comics and Animation Classification
1.2M
311
-T.K.-
斯坦福大学的GloVe预训练的单词向量
GloVe is an unsupervised learning algorithm for obtaining vector representations for words. Training is performed on agg...Computer Science,Sports,Retail and Shopping Classification
989.9M
370
Mohamed Atef Khalaf
文本语料数据集,包含所有《宋飞》剧集脚本的文本语料库
ContextSeinfeld is my favorite TV show. I wrote a script to scrape the scripts of all Seinfeld episodes from the site se...Text Data Classification
1.41M
454
LuongNguyen
万篇德国新闻文章数据集,10kGNAD基于一百万篇文章语料库
10kGNAD数据集旨在作为第一个德国主题分类数据集解决部分问题。它由一家奥地利在线报纸的10273篇德语新闻文章组成,分为9个主题...NLP,Classification,Computer Science,Programming,News,Social Science Classification
51.81M
250
Timo Block
短篇小说语料库,埃德加·爱伦·坡的短篇小说集
ContentThe present data set includes the full corpus of 69 Edgar Allan Poe's short stories in tabular format. In add...NLP,Text Data,Literature,Text Mining Classification
1.86M
503
Leandro
来自AskUbuntu的意图识别聊天机器人语料库
Context190 questions and answers from https://askubuntu.com. ContentWhat's inside is more than just rows and columns...NLP,Artificial Intelligence Classification
0.23M
443
Elvin Aghammadzada
日语-英语字幕语料库(JESC)[CLEANED],由280万个句子组成的大型语料库
This dataset is cleaned version of JESC by handling misplelled English words and doing word segmentation using:English=...NLP,Business,Computer Science,Languages Classification
220.08M
284
Wahyu Setianto
OSCAR尼泊尔语语料库,尼泊尔语文本语料库,用于训练NLP的无监督语言模型
The files are from [OSCAR Corpus](https://oscar-corpus.com/). Please visit their site for more information.The dataset i...NLP,Computer Science,Movies and TV Shows,Text Data,Languages Classification
3.1G
307
Prabesh Dhakal
经典英语文学语料库与元数据,经典英语书籍及其作者
This is a dataset about classic readings in English, some cases other language translated to English.Dickens, Plato, Sha...NLP,Arts and Entertainment,Literature Classification
431.55M
283
Raynard Jon
波斯维基百科数据集,波斯语(波斯语)维基百科语料库
Persian(Farsi) Wikipedia Dataset | دیتاست ویکی پدیا فارسی شامل تمامی مقالات فارسی...NLP,Deep Learning,Text Data,Data Analytics Classification
804.48M
254
Milad Farzalizadeh
Cal多音节语料库
Education,Universities and Colleges,NLP,Text Data,Text Mining,spaCy Classification
15.26M
368
Michael Fekadu
KcBERT训练前语料库(韩国新闻评论)
Computer Science,Education,News,NLP,Text Data Classification
11899.2M
247
Beomi
泰米尔语语料库
Business,NLP,Classification,India,Artificial Intelligence Classification
11750.7M
240
Praveen