相关数据分类
10
553
5
6
9
13
19
2
3
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
蔬菜 (谷歌 Word2vec 自由库,. tsv 文件)
Vegetables (Google Word2Vec Freebase, .tsv file)...NLP,Computer Science,Food Classification
20G
706
Liling Tan
性别猜测的推文文件
Online Communities,Social Science,Social Networks,NLP,Binary Classification,Gender Classification
5.74M
466
Andy Harless
DARPA TIMIT 声学语音连续语音
#DARPA TIMIT声学语音连续语音语料库-**特别感谢**:**https://github.com/philipperemy/timit/edit/master/README.md**-**下载...NLP,Audio Data Classification
1198.1M
238
Michael Fekadu
129M
199
Liling Tan
Hearthstone Hearthstone卡名称和描述的翻译数据
Translation of Hearthstone card names and descriptions.Languages: German, English, Spanish, French, Italian, Japanese, K...NLP,Arts and Entertainment,Video Games,Games,Comics and Animation,Card Games Classification
54.7M
488
Liling Tan
134.5M
609
utsav
带有偏差数据集的毒性清理版本
cleaned tox bias cleaned up version of toxicity with bias data set...NLP,Data Cleaning,Health Classification
535.39M
680
Ilya Evenbach
俄罗斯电报聊天记录,公开俄罗斯电报聊天中解析的数据
Russian Telegram chats history Data parsed from must popular public Russian Telegram chats...NLP,Text Data,Russia Classification
11.08G
351
Nick
自然语言处理中的情感分析
#数据集此数据集由NowYSM在Database:Open Database,Contents:Database Contents#Contents下创建。它包含以下文件:...NLP,Arts and Entertainment Classification
2.52M
350
NowYSM
NLP 数据
# DatasetThis dataset was created by AbiyuGReleased under CC BY-NC-SA 4.0# ContentsIt contains the following files:...NLP,Psychology Classification
3.14M
331
AbiyuG
星际迷航脚本,所有《星际迷航》系列脚本的原始文本脚本和处理行
Star Trek Scripts TextData scraped from data from http://www.chakoteya.net/StarTrek/index.htmlCode here: https://github....NLP,Movies and TV Shows,Text Data,Text Mining Classification
42.63M
322
Gary Broughton
有毒嵌入物,拼图有毒评论挑战中的通用句子编码文本
There's no need for everyone to encode the same text with the Universal Sentence EmbeddingThis data set contains the...NLP,Deep Learning,Earth and Nature Classification
610.81M
622
Liling Tan
CoNLL003 命名实体识别(NER)问题的注释数据集
This is an annotated dataset for Named Entity Recognition (NER) problemContentThis dataset is divided into train.txt, te...NLP,Arts and Entertainment,Computer Science,Text Data,Games,Text Mining Classification
4.63M
498
AlaaKhaled