Select Language

AI社区

公开数据集

相关搜索
您是不是在找?
今日排行
本周排行
本月排行
变压器 2.8.0 Computer Science,Movies and TV Shows,NLP Classification
7.71M 540
夜愿歌词(1996 2020) Music,NLP,Text Data,Clustering Classification
0.36M 229
罗伯塔基地小队2配置数据 Arts and Entertainment,Software,NLP Classification
476.83M 224
拼图10长度120时代 NLP Classification
8215.28M 222
TOI 2018新闻文章 Computer Science,Internet,News,Social Science,NLP,Deep Learning Classification
267.42M 526
NLP阿尔伯特模型微调为2.0班 NLP,Transfer Learning Classification
175.27M 416
拥抱变形金刚2.3.0 Movies and TV Shows,NLP,Transfer Learning Classification
16.76M 211
所有美国国情咨文演讲(1790 2019) Government,NLP,Text Data,United States,Text Mining Classification
10.16M 229
语言生成数据集:2亿个样本,用于语言生成的已处理Amazon Review数据集 Amazon Customer Reviews Dataset is a dataset of user-generated product reviews on the shopping website Amazon. It contai...NLP,Business,Deep Learning,Classification,Artificial Intelligence Classification
20.51G 294
4.33M 425
CONLL2003杂项词重新标记 Earth and Nature,NLP,Text Data,Text Mining,spaCy Classification
0.01M 263
2020年美国总统大选 Politics,NLP,Text Data Classification
30.88M 219
NLP Word2Vec 现有的word2vec嵌入,包括手套和谷歌新闻,用于被训练来重建单词的语言上下文 Word2vec is a group of related models that are used to produce word embeddings. These models are shallow, two-layer neur...NLP,Computer Science Classification
5.89G 300
阿拉伯语Wiki数据转储2018 NLP,Languages,Neural Networks Classification
0M 198
手套6B 200d预训练词向量 Business,NLP Classification
661.31M 266
用土耳其语编写的数据,可以训练word2vec或n-gram模型 This data contains each document written in Turkish and contains wiki document id. You can train word2vec or n-gram mode...NLP,Text Data,Text Mining Classification
463.02M 433
纽约时报评论,对《纽约时报》发表文章的评论,超过200万条评论 New York Times has a wide audience and plays a prominent role in shaping people's opinion and outlook on current aff...NLP,Computer Science,Programming,News Classification
1.55G 284
Word2vec在维基百科上训练数据(单字母+双字母),以捕捉unigram和bigram 这是一个单词嵌入模型,创建于维基百科+各种来源的评论。与从基于短语的方法(不考虑相邻词的短语/双词上下文)创建双词不同,这...NLP,Computer Science,Software,Programming,Neural Networks Classification
8.62G 299
律政司2009 2018年新闻稿 Earth and Nature,Politics,NLP,Crime,Text Data Classification
52.47M 247
Facebook 发布的300维预训练,在 Common Crawl 上训练的200万个词向量 300-dimensional pretrained FastText English word vectors released by Facebook.The first line of the file contains the nu...NLP,Arts and Entertainment Classification
650M 360