相关搜索
您是不是在找?
今日排行
本周排行
本月排行
28种语言中的停止词,自然语言处理中的文本预处理
Stopwords are the words in any language which does not add much meaning to a sentence. They can safely be ignored withou...NLP,Computer Science,Text Data,Languages Classification
0.09M
518
Heeral Dedhia
媒体文章集2020版
Arts and Entertainment,Computer Science,Education,NLP Classification
1.63M
314
KAUSTUBH1828
日语-英语字幕语料库(JESC)[CLEANED],由280万个句子组成的大型语料库
This dataset is cleaned version of JESC by handling misplelled English words and doing word segmentation using:English=...NLP,Business,Computer Science,Languages Classification
220.08M
288
Wahyu Setianto
Reddit印度NLP数据集,数据集包括2017-2020年从R/India子版块的帖子
[![forthebadge made-with-python](http://ForTheBadge.com/images/badges/made-with-python.svg)](https://www.python.org/) [!...NLP,Classification,Multiclass Classification,India Classification
117.86M
245
Pranav Hari
零售交易[于2020年7月17日发布]
Online Communities,Retail and Shopping,NLP,Data Visualization,Tabular Data,Data Cleaning Classification
1.3M
223
Jahnic Beck-Joseph
1914 2019年电影摄影中的所有伤害
Arts and Entertainment,Movies and TV Shows,NLP,Crime,Text Data,Exploratory Data Analysis,Text Mining Classification
0.09M
264
Thomas Konstantin
DetikNews标题(每2015 2020年1月15日至10月15日)
Internet,Online Communities,News,NLP Classification
1.17M
292
Fakhruzzaman
乔·拜登推特(2007 2020)
Internet,Social Networks,Email and Messaging,Politics,NLP,Text Data Classification
1.73M
324
Vopani
斯坦福情感树银行v2(SST2)
Arts and Entertainment,Education,Movies and TV Shows,NLP,Text Data Classification
46.5M
539
Atul Anand {Jha}