公开数据集
数据结构 ? 177.51M
Data Structure ?
* 以上分析是由系统提取分析形成的结果,具体实际数据为准。
README.md
Charlottesville, Virgina
========================
Charlottesville is home to a statue of Robert E. Lee which is slated to be removed. (For those unfamiliar with American history, Robert E. Lee was a US Army general who defected to the Confederacy during the American Civil War and was considered to be one of their best military leaders.) While many Americans support the move, believing the main purpose of the Confederacy was to defend the institution of slavery, many others do not share this view. Furthermore, believing Confederate symbols to be merely an expression of Southern pride, many have not taken its planned removal lightly.
As a result, many people--including white nationalists and neo-Nazis--have descended to Charlottesville to protest its removal. This in turn attracted many counter-protestors. Tragically, one of the counter-protestors--Heather Heyer--was killed and many others injured after a man intentionally rammed his car into them. In response, President Trump blamed "both sides" for the chaos in Charlottesville, leading many Americans to denounce him for what they see as a soft-handed approach to what some have called an act of "domestic terrorism."
This dataset below captures the discussion--and copious amounts of anger--revolving around this past week's events.
The Data
========
Description
-----------
This data set consists of a random sample of 50,000 tweets per day (in accordance with the Twitter Developer Agreement) of tweets mentioning Charlottesville or containing "#charlottesville" extracted via the Twitter Streaming API, starting on August 15. The files were copied from a large Postgres database containing--currently--over 2 million tweets. Finally, a table of tweet counts per timestamp was created using the whole database (not just the Kaggle sample). The data description PDF provides a full summary of the attributes found in the CSV files.
Note: While the tweet timestamps are in UTC, the cutoffs were based on Eastern Standard Time, so the August 16 file will have timestamps ranging from `2017-08-16 4:00:00` UTC to `2017-08-17 4:00:00` UTC.
Format
------
The dataset is available as either separate CSV files or a single SQLite database.
License
=======
I'm releasing the dataset under the CC BY-SA 4.0 license. Furthermore, because this data was extracted via the Twitter Streaming API, its use must abide by the [Twitter Developer Agreement][1]. Most notably, the display of individual tweets should satisfy [these requirements][2]. More information can be found in the data description file, or on Twitter's website.
Acknowledgements
============
Obviously, I would like to thank Twitter for providing a fast and reliable streaming service. I'd also like to thank the developers of the Python programming language, psycopg2, and Postgres for creating amazing software with which this data set would not exist.
Image Credit
------------------
The banner above is a personal modification of these images:
- Evan Nesterak: [Image Source][3] [Image License][4]
- Wikipedia user Cville Dog [Image Source][5]
- The Associated Press [Image Source][6]
Inspiration
==========
I almost removed the header "inspiration" from this section, because this is a rather sad and dark data set. However, this is preciously why this is an important data set to analyze. Good history books have never shied away from unpleasant events, and never should we.
This data set provides a rich opportunity for many types of research, including:
- Natural language processing
- Sentiment analysis
- Data visualization
Furthermore, given the political nature of this dataset, there are a lot of social science questions that can potentially be answered, or at least piqued, by this data.
[1]: https://dev.twitter.com/overview/terms/
[2]: https://dev.twitter.com/overview/terms/display-requirements
[3]: https://www.flickr.com/photos/153804281@N02/36421659232/ "Image Source"
[4]: https://en.wikipedia.org/wiki/Unite_the_Right_rally#/media/File:White_supremacists_clash_with_police_(36421659232).jpg "Image License"
[5]: https://en.wikipedia.org/wiki/Unite_the_Right_rally#/media/File:Lee_Park,_Charlottesville,_VA.jpg "Image Source"
[6]: https://www.voanews.com/a/rallies-in-aftermath-of-charlottesville-violence/3983710.html "Image Source"
×
帕依提提提温馨提示
该数据集正在整理中,为您准备了其他渠道,请您使用
注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。
暂无相关内容。
暂无相关内容。
- 分享你的想法
去分享你的想法~~
全部内容
欢迎交流分享
开始分享您的观点和意见,和大家一起交流分享.
数据使用声明:
- 1、该数据来自于互联网数据采集或服务商的提供,本平台为用户提供数据集的展示与浏览。
- 2、本平台仅作为数据集的基本信息展示、包括但不限于图像、文本、视频、音频等文件类型。
- 3、数据集基本信息来自数据原地址或数据提供方提供的信息,如数据集描述中有描述差异,请以数据原地址或服务商原地址为准。
- 1、本站中的所有数据集的版权都归属于原数据发布者或数据提供方所有。
- 1、如您需要转载本站数据,请保留原数据地址及相关版权声明。
- 1、如本站中的部分数据涉及侵权展示,请及时联系本站,我们会安排进行数据下线。