CC-100 卡纳达语单语言数据集:来自Web爬网数据的1300万条单语言数据集
This monolingual dataset includes roughly 13 million uncleaned Kannada sentences crawled from numerous websites....NLP,Text Data,Languages Classification
3.51G
348
Darshan
击败鲍比·弗莱:300集的结果
Movies and TV Shows,Food,NLP,Classification,Cooking and Recipes Classification
0.06M
274
Jeffrey Braun
Facebook发布的300维预训练FastText英语单词向量
300-dimensional pretrained FastText English word vectors released by Facebook.The first line of the file contains the nu...NLP,Arts and Entertainment,Games Classification
4.52G
381
Vladimir Demidov
Facebook 发布的300维预训练,在 Common Crawl 上训练的200万个词向量
300-dimensional pretrained FastText English word vectors released by Facebook.The first line of the file contains the nu...NLP,Arts and Entertainment Classification
650M
445
Manish Maharjan