Select Language

AI社区

公开数据集

法国总统选举

法国总统选举

2763.36M
199 浏览
0 喜欢
0 次下载
0 条讨论
Business,Computer Science,Programming,Politics Classification

数据结构 ? 2763.36M

    Data Structure ?

    * 以上分析是由系统提取分析形成的结果,具体实际数据为准。

    README.md

    # Context This dataset is born from a test with the twitter streaming api to filter and collect data from this flow on a specific topic, in this case the [French election][1].The script used to make this data collection is available on this [Github repository][2]. Since the 18th of March, the [French election][3] enter in the final straight line until the first poll the 23 April 2017 , the candidates for the position are: - M. Nicolas DUPONT-AIGNAN - Mme Marine LE PEN - M. Emmanuel MACRON - M. Beno?t HAMON - Mme Nathalie ARTHAUD - M. Philippe POUTOU - M. Jacques CHEMINADE - M. Jean LASSALLE - M. Jean-Luc MéLENCHON - M. Fran?ois ASSELINEAU - M. Fran?ois FILLON The idea was to collect the data from the Twitter API periodically. The acquisition process evolved as follows: - Versions 1, 2 and 3 Every hour a python script listens to the twitter api stream for 10 minutes during 3 weeks. - Version 4+ The new versions will be based on a new data structure, and start after the validation by the French constitutional council on 18 March 2017 of the candidates. The data will be stored in a dbsqlite files(database_number of the week_number_block_weekday.sqlite format) and will be updated as often as I can (at least every week). After the first round (version 18+) i had to readjust the number of files per week and the 20 files kaggle limitation push me to reduce the number of files to upload (but you can join for your local analytics the version 17 + version 18+) **Example : Illustration of the number of mentions of the different candidates** ![illustration of the mention of the different candidates][4] I add to these databases a sqlite database that contains the informations from the google trends about the top 5 candidates.In thid database there is : - A table that contains the overall interests by region - A table that contains the interests by region for each candidate - A table with the top25 associated queries for each candidate in top and rising ranking # Content In this dbsqlite file, you will find a data table that contains for every row: ===============Common=============== - the index of the line - the language of the tweet - for each candidate :mention_candidatename, if the candidate or his associated account has been called (0 or 1) - the tweet - the timestamp in milliseconds ===============Version 4+=============== - the day - the hour (London timezone) - the username of the user that made the tweet - the username location (that he gives with his profile) - if the tweet is a retweet or a quote (0 or 1) - the username that has been retweeted - the original tweet (the one retweeted or quoted) # Acknowledgements This election is gonna be intense. # Inspiration The first version of the dataset was just a test to collect the data and see the first pieces of work that the community can do with this dataset.The new versions are (I think and hope) adapted to do deep text analytics. [1]: https://en.wikipedia.org/wiki/French_presidential_election,_2017 [2]: https://github.com/jeanmidevacc/french-presidential-election-twitter-pov.git [3]: https://en.wikipedia.org/wiki/French_presidential_election,_2017 [4]: https://www.kaggle.io/svf/1039441/a6d419b95420d7c28bd3e9a4437a6370/__results___files/__results___8_2.png [5]: https://trends.google.co.uk/trends/explore?date=today%201-m&geo=FR&q=%2Fm%2F011ncr8c,%2Fm%2F0fqmlm,%2Fm%2F02rdgs,%2Fm%2F04zzm99,%2Fm%2F0551nw [6]: https://www.kaggle.io/svf/1049376/e54172fcff84c97fff6583431eaf9567/__results___files/__results___16_1.png
    ×

    帕依提提提温馨提示

    该数据集正在整理中,为您准备了其他渠道,请您使用

    注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。
    暂无相关内容。
    暂无相关内容。
    • 分享你的想法
    去分享你的想法~~

    全部内容

      欢迎交流分享
      开始分享您的观点和意见,和大家一起交流分享.
    所需积分:0 去赚积分?
    • 199浏览
    • 0下载
    • 0点赞
    • 收藏
    • 分享