Select Language

AI社区

公开数据集

数据泄露 2004-2017 (EN)

数据泄露 2004-2017 (EN)

0.09M
147 浏览
0 喜欢
0 次下载
0 条讨论
Internet,Data Visualization Classification

数据结构 ? 0.09M

    Data Structure ?

    * 以上分析是由系统提取分析形成的结果,具体实际数据为准。

    README.md

    Context Data breaches. Incidents in the world, that compromised more than 30000 records, between 2004 and 2017. English version. I wanted to visualize the data including the possibility to compare numbers between variable levels. I did some improvements in levels of variables as well as data, and I did a visualization. I also uploaded [this version of the dataset in Spanish][1]. I did the visualization with Tableau software. In this post in my blog, you can read more about it: [Spanish version][2] and [English version][3]. You can also see the visualization in this link: [Spanish version][4] and [English version][5]. Content The dataset has 270 observations and 11 variables. Most of them, are categorical variables. Incidents happened between 2004 and 2017. Last updated: February 2018. Format: CSV2. Variables (columns) [EN]: - Entity: name of the organization (public or private) that had the breach. String - Alternative Name: other known names of the entity. String - Story: tells a summary of what happened. String - Year: year of the breach. Date - Records Lost: number of records that the breach compromised.Integer - Sector: organization's main sector (or field of business). String - Method of Leak: main cause of the breach. String - 1st source (link): 1st. url with more info about the breach. String - 2nd source (link): 2nd. url with more info about the breach. String - 3rd source (link): 3rd. url with more info about the breach. String - Source name: name of the source of news, official reports, blog, etc. included. Note that some of them have changed after I replaced some previous broken links that the original dataset had. String Acknowledgements [Informationisbeautiful.net][6]. Before the improvements, a first dataset was downloaded from this site, by the end of 2017. Inspiration The main question to be answered with the data visualization was "What quantities of records were compromised by important data breaches, in organizations and sectors, between 2004 and 2017, and what was the reason?". I wanted to have a visual answer that allows to compare numbers between year, sector, and method of leak. It would be great to improve the dataset adding new variables for data mining in the future. Achieving a complete and exhaustive "Data Breaches 2004-2017" dataset, would help to an in-depth analysis of incidents in this period. 2017 has been the worst year in the history. [1]: https://www.kaggle.com/estratic/brechas-datos-2004-2017-es-20180218 [2]: https://www.estratic.com/2018/02/09/visualizando-brechas-de-datos-2004-2017/ [3]: https://www.estratic.com/2018/02/09/visualizing-data-breaches-2004-2017-2/ [4]: https://www.estratic.com/brechas-de-datos/ [5]: https://www.estratic.com/data-breaches-viz/ [6]: http://www.informationisbeautiful.net/visualizations/worlds-biggest-data-breaches-hacks/
    ×

    帕依提提提温馨提示

    该数据集正在整理中,为您准备了其他渠道,请您使用

    注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。
    暂无相关内容。
    暂无相关内容。
    • 分享你的想法
    去分享你的想法~~

    全部内容

      欢迎交流分享
      开始分享您的观点和意见,和大家一起交流分享.
    所需积分:0 去赚积分?
    • 147浏览
    • 0下载
    • 0点赞
    • 收藏
    • 分享