公开数据集
数据结构 ? 14.36M
Data Structure ?
* 以上分析是由系统提取分析形成的结果,具体实际数据为准。
README.md
Context
Discovering Trump's character traits from his most retweeted tweets using HumanFirst (www.humanfirst.ai)
I used HumanFirst Studio to explore Trump's most retweeted tweets and it allowed me to create character-trait based classifiers. With HumanFirst's active learning pipeline (which involves zero-shot classification) it was very efficient to explore the data, discover the various character traits and label his tweets accordingly.
I started with higher level traits and decided to split his tweets between negative and positive.
I spent more time focusing on negative and found that after a few minutes of manual exploration and discovery of tweets that clearly illustrated negative character traits, I could use HF's active learning workflows and similarity search to source the rest of his negative tweets efficiently into their appropriate label. I decided to go more granular & developed a hierarchy of character traits under the negative label using HumanFirst's refactoring/splitting of training data. Which gave me traits like egocentric, accusive, angry etc. I then realized that most of these granular character traits can all be grouped under a very general character trait that illustrate Donald Trump's character; narcissist.
Here's the split out of ~320 classified tweets:
71% pointing towards negative character traits
29% pointing towards neutral or positive character traits
Of the negative traits:
88% pointing towards narcissism
Content
trumptweets-output.csv shows the classified tweets in their respective labels and hierarchies shown by "/" in right column
trumptweets-input.txt is a flat list of Donald Trump's tweets that I imported to HumanFirst
trumptweets-source file.csv is the file containing all his tweets (found on Kaggle)
Acknowledgements
Thanks @austinreese for the source file ??
×
帕依提提提温馨提示
该数据集正在整理中,为您准备了其他渠道,请您使用
注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。
暂无相关内容。
暂无相关内容。
- 分享你的想法
去分享你的想法~~
全部内容
欢迎交流分享
开始分享您的观点和意见,和大家一起交流分享.
数据使用声明:
- 1、该数据来自于互联网数据采集或服务商的提供,本平台为用户提供数据集的展示与浏览。
- 2、本平台仅作为数据集的基本信息展示、包括但不限于图像、文本、视频、音频等文件类型。
- 3、数据集基本信息来自数据原地址或数据提供方提供的信息,如数据集描述中有描述差异,请以数据原地址或服务商原地址为准。
- 1、本站中的所有数据集的版权都归属于原数据发布者或数据提供方所有。
- 1、如您需要转载本站数据,请保留原数据地址及相关版权声明。
- 1、如本站中的部分数据涉及侵权展示,请及时联系本站,我们会安排进行数据下线。