公开数据集
数据结构 ? 5.8M
Data Structure ?
* 以上分析是由系统提取分析形成的结果,具体实际数据为准。
README.md
Jun Liu(liukeen '@' mail.xjtu.cn), Hao Chen(lechenhao '@' gmail.com) , Mengting Zhan, Jianhong Mi,Yanzhang Lv
MOEKLINNS Lab, Department of Computer Science ,Xi'an Jiaotong University, China
Data Set Information:
Our dataset is used by us to explore spammers in microblog and you can access our demo system at
[Web link]
Please add :8080 after the domain name as port. The repository webpage fails to parse the weblink when it's added in the source. (under inspection)
Attribute Information:
weibo_user.csv has the following attributes:
-user_id: account ID in sina weibo;
-user_name: account nickname???
-gender:account registration gender including male??? female and other???
-class:account level given by sina weibo;
-message:account registration location or other personal information;
-post_num: the number of posts of this account up to now;
-follower_num: the number of followers of this account;
-followee_num: the number of followee of this account;
-follow ratio: followee_num/follower_num;
-is_spammer: manually annotated label, 1 means spammer and -1 means non-spammer;
user_post.csv has the following attributes:
-post_id:user post ID given by sina weibo;
-post_time:the time when a post is posted;
-poster_id: the user ID who posted this post;
-repost_num:the number of retweet by others;
-commnet_num: the number of comment by others;
followe-followee.csv has the following attributes:
-follower: the nickname of follower;
-follower_id: the user ID of follower;
-followee: the nickname of followee;
-followee_id: the user ID of followee;
post.csv is almost the as user_post.csv and the post in it are retrievalled by a certain key word related to a topic;
-content: the post text(mostly in Chinese, please set your Microsoft Office to make it readable)
Relevant Papers:
N/A
Citation Request:
Thanks to MOEKLINNS Lab[[Web link]] especially Spammer Detection Group for opening its data
帕依提提提温馨提示
该数据集正在整理中,为您准备了其他渠道,请您使用
- 分享你的想法
全部内容
数据使用声明:
- 1、该数据来自于互联网数据采集或服务商的提供,本平台为用户提供数据集的展示与浏览。
- 2、本平台仅作为数据集的基本信息展示、包括但不限于图像、文本、视频、音频等文件类型。
- 3、数据集基本信息来自数据原地址或数据提供方提供的信息,如数据集描述中有描述差异,请以数据原地址或服务商原地址为准。
- 1、本站中的所有数据集的版权都归属于原数据发布者或数据提供方所有。
- 1、如您需要转载本站数据,请保留原数据地址及相关版权声明。
- 1、如本站中的部分数据涉及侵权展示,请及时联系本站,我们会安排进行数据下线。