公开数据集
数据结构 ? 0.97M
Data Structure ?
* 以上分析是由系统提取分析形成的结果,具体实际数据为准。
README.md
Many variables are included so that algorithms that select or learn weights for
attributes could be tested. However, clearly unrelated attributes were not included;
attributes were picked if there was any plausible connection to crime (N=122), plus
the attribute to be predicted (Per Capita Violent Crimes). The variables included in
the dataset involve the community, such as the percent of the population considered
urban, and the median family income, and involving law enforcement, such as per capita
number of police officers, and percent of officers assigned to drug units.
The per capita violent crimes variable was calculated using population and the sum of
crime variables considered violent crimes in the United States: murder, rape, robbery,
and assault. There was apparently some controversy in some states concerning the
counting of rapes. These resulted in missing values for rape, which resulted in
incorrect values for per capita violent crime. These cities are not included in the
dataset. Many of these omitted communities were from the midwestern USA.
Data is described below based on original values. All numeric data was normalized into
the decimal range 0.00-1.00 using an Unsupervised, equal-interval binning method.
Attributes retain their distribution and skew (hence for example the population
attribute has a mean value of 0.06 because most communities are small). E.g. An
attribute described as 'mean people per household' is actually the normalized (0-1)
version of that value.
The normalization preserves rough ratios of values WITHIN an attribute (e.g. double
the value for double the population within the available precision - except for
extreme values (all values more than 3 SD above the mean are normalized to 1.00; all
values more than 3 SD below the mean are nromalized to 0.00)).
However, the normalization does not preserve relationships between values BETWEEN
attributes (e.g. it would not be meaningful to compare the value for whitePerCap with
the value for blackPerCap for a community)
A limitation was that the LEMAS survey was of the police departments with at least 100
officers, plus a random sample of smaller departments. For our purposes, communities
not found in both census and crime datasets were omitted. Many communities are missing
LEMAS data.
×
帕依提提提温馨提示
该数据集正在整理中,为您准备了其他渠道,请您使用
注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。
暂无相关内容。
暂无相关内容。
- 分享你的想法
去分享你的想法~~
全部内容
欢迎交流分享
开始分享您的观点和意见,和大家一起交流分享.
数据使用声明:
- 1、该数据来自于互联网数据采集或服务商的提供,本平台为用户提供数据集的展示与浏览。
- 2、本平台仅作为数据集的基本信息展示、包括但不限于图像、文本、视频、音频等文件类型。
- 3、数据集基本信息来自数据原地址或数据提供方提供的信息,如数据集描述中有描述差异,请以数据原地址或服务商原地址为准。
- 1、本站中的所有数据集的版权都归属于原数据发布者或数据提供方所有。
- 1、如您需要转载本站数据,请保留原数据地址及相关版权声明。
- 1、如本站中的部分数据涉及侵权展示,请及时联系本站,我们会安排进行数据下线。