2_公开数据集帕依提提-人工智能高质量数据集开放平台

变压器 2.8.0 Computer Science,Movies and TV Shows,NLP Classification

7.71M 724

HyeongChan Kim

夜愿歌词（1996 2020） Music,NLP,Text Data,Clustering Classification

0.36M 313

Yuan Meng

罗伯塔基地小队2配置数据 Arts and Entertainment,Software,NLP Classification

476.83M 293

Zenify

拼图10长度120时代 NLP Classification

8215.28M 315

Matt Yates

TOI 2018新闻文章 Computer Science,Internet,News,Social Science,NLP,Deep Learning Classification

267.42M 721

PulpAI

NLP阿尔伯特模型微调为2.0班 NLP,Transfer Learning Classification

175.27M 606

Zhuole Lu

拥抱变形金刚2.3.0 Movies and TV Shows,NLP,Transfer Learning Classification

16.76M 319

yuval reina

所有美国国情咨文演讲（1790 2019） Government,NLP,Text Data,United States,Text Mining Classification

10.16M 313

jyron

语言生成数据集：2亿个样本，用于语言生成的已处理Amazon Review数据集 Amazon Customer Reviews Dataset is a dataset of user-generated product reviews on the shopping website Amazon. It contai...NLP,Business,Deep Learning,Classification,Artificial Intelligence Classification

20.51G 485

Abhishek Chatterjee

常见漏洞和暴露（2019年） NLP Classification

4.33M 627

Awad Saleh

CONLL2003杂项词重新标记 Earth and Nature,NLP,Text Data,Text Mining,spaCy Classification

0.01M 369

chongyixiang

2020年美国总统大选 Politics,NLP,Text Data Classification

30.88M 321

Radu Stoicescu

NLP Word2Vec 现有的word2vec嵌入，包括手套和谷歌新闻，用于被训练来重建单词的语言上下文 Word2vec is a group of related models that are used to produce word embeddings. These models are shallow, two-layer neur...NLP,Computer Science Classification

5.89G 449

pkugoodspeed

阿拉伯语Wiki数据转储2018 NLP,Languages,Neural Networks Classification

0M 278

Abed Khooli

手套6B 200d预训练词向量 Business,NLP Classification

661.31M 420

HypersportR8

用土耳其语编写的数据，可以训练word2vec或n-gram模型 This data contains each document written in Turkish and contains wiki document id. You can train word2vec or n-gram mode...NLP,Text Data,Text Mining Classification

463.02M 628

MustafaKeskin

纽约时报评论，对《纽约时报》发表文章的评论，超过200万条评论 New York Times has a wide audience and plays a prominent role in shaping people's opinion and outlook on current aff...NLP,Computer Science,Programming,News Classification

1.55G 487

Aashita Kesarwani

Word2vec在维基百科上训练数据(单字母+双字母)，以捕捉unigram和bigram 这是一个单词嵌入模型，创建于维基百科+各种来源的评论。与从基于短语的方法（不考虑相邻词的短语/双词上下文）创建双词不同，这...NLP,Computer Science,Software,Programming,Neural Networks Classification

8.62G 502

aintnosunshine

律政司2009 2018年新闻稿 Earth and Nature,Politics,NLP,Crime,Text Data Classification

52.47M 324

John B

Facebook 发布的300维预训练，在 Common Crawl 上训练的200万个词向量 300-dimensional pretrained FastText English word vectors released by Facebook.The first line of the file contains the nu...NLP,Arts and Entertainment Classification

650M 497

Manish Maharjan

Select Language

AI社区

今日排行

本月搜索

Dataset Category

公开数据集