相关搜索
您是不是在找?
今日排行
本周排行
本月排行
CoNLL003 命名实体识别(NER)问题的注释数据集
This is an annotated dataset for Named Entity Recognition (NER) problemContentThis dataset is divided into train.txt, te...NLP,Arts and Entertainment,Computer Science,Text Data,Games,Text Mining Classification
4.63M
456
AlaaKhaled
有毒嵌入物,拼图有毒评论挑战中的通用句子编码文本
There's no need for everyone to encode the same text with the Universal Sentence EmbeddingThis data set contains the...NLP,Deep Learning,Earth and Nature Classification
610.81M
561
Liling Tan
英语词频,⅓ 百万网络上最常见的英语单词
This dataset contains the counts of the 333,333 most commonly-used single words on the English language web, as derived...Languages Classification
4.73M
535
Rachael Tatman
斯坦福自然语言推理 (SNLI) 语料库的 Jsonl 格式
这是斯坦福大学自然语言推理(snLI)语料库的1.0版本。如果你使用这个语料库,请引用这篇论文: http://nlp.Stanford.edu/pubs/snli...Languages Classification
483.45M
407
John S. Hudzina
2.46M
379
NLTK Data
Mac Morpho,带有词性标签的巴西葡萄牙语新闻文本
The canonical metadata on NLTK:packageid=mac_morphoname=MAC-MORPHO:BrazilianPortuguesenewstextwithpart-of-speechtagswebp...Earth and Nature Classification
10.43M
664
NLTK Data
DBpedia语义网应用范例,提供42782篇维基百科文章提供了分类、分层类别
DBpedia (from DB for database) is a project aiming to extract structured content from the information created in Wikiped...Education,Text Data,Multiclass Classification,Text Mining Classification
443.28M
327
Dan Ofer
SVHN 预处理片段
Arts and Entertainment,Online Communities,Image Data,Classification,Deep Learning,Multiclass Classification Classification
1206.46M
529
Olga Belitskaya
中国机器翻译研讨会,语料数据集
# DatasetThis dataset was created by Liling TanReleased under Other (specified in description)# ContentsIt contains the...Deep Learning,Computer Science Classification
6.6G
402
Warmth
星际迷航脚本,所有《星际迷航》系列脚本的原始文本脚本和处理行
Star Trek Scripts TextData scraped from data from http://www.chakoteya.net/StarTrek/index.htmlCode here: https://github....NLP,Movies and TV Shows,Text Data,Text Mining Classification
42.63M
300
Gary Broughton
机器人先生中的单词事件,了解F-Society最喜欢的行话
Mr. Robot is all about data whether it's corrupting it, encrypting it, or deleting it. I wanted to dig up some data...Arts and Entertainment,Games Classification
0.31M
332
Emma
NLP 数据
# DatasetThis dataset was created by AbiyuGReleased under CC BY-NC-SA 4.0# ContentsIt contains the following files:...NLP,Psychology Classification
3.14M
316
AbiyuG
手写数学符号数据集,超过10万个图像样本
Dataset consists of jpg files(45x45)DISCLAIMER: dataset does not contain Hebrew alphabet at all. It includes basic Greek...NLP,Computer Science,Law,Email and Messaging Classification
410.19M
532
Xai Nano
TMDB 预处理数据集
Please, see [kernel](https://www.kaggle.com/samusram/train-test-comparison-eda-feature-engineering-ffm) for more details...Others Classification
675.4M
300
Raman
自然语言处理中的情感分析
#数据集此数据集由NowYSM在Database:Open Database,Contents:Database Contents#Contents下创建。它包含以下文件:...NLP,Arts and Entertainment Classification
2.52M
334
NowYSM