公开数据集
数据结构 ? 2.76M
Data Structure ?
* 以上分析是由系统提取分析形成的结果,具体实际数据为准。
README.md
# Context
## Survey data (2014, 2016, 2017 and 2018)
The aim of this dataset is to provide access to the raw survey data from the 2016, 2017 and 2018 OSMI mental health in technology surveys used to facilitate analysis e.g [my kernel fusing the OSMI surveys across time periods](https://www.kaggle.com/ekwiecinska96/dataset-creation-fusing-surveys-from-2014-2018).
This is due to the fact that the popular 2014 dataset uploaded onto Kaggle has already been pre-processed and cleaned (and the only other 2016 upload does not play nice with kernels). Whilst this is useful, many columns were renamed into simple attributes e.g 'Are you self-employed?' is standardised to 'self_employed'. As none of the
surveys from the following years have had this treatment, it was difficult to reverse-engineer the processing steps to make the attributes match. Also, it's great to have all the data in one place.
## Similarity matrix
The associated similarity matrix, stored as a numpy-readable file (.npy) is a supplementary file for the previously mentioned kernel. This was uploaded due to the unfortunate fact that any [SpaCy models ](https://spacy.io/usage/models) that are contain word vectors (aka any model other than *sm*) are not supported by Kaggle on the date of writing (Jun 2019). Please see the associated kernel for more information on how this matrix was created.
## Acknowledgements
The original data collection and hosting has all been provided by [Open-Sourcing Mental Illness (OSMI).](https://osmihelp.org) you can find all of the datasets (including 2016, 2017 and 2018) [here](https://osmihelp.org/research).
# Inspiration
The inspiration for uploading these datasets was to allow Kaggle users such as myself to have greater control over the pre-processing and standardisation of attributes.
×
帕依提提提温馨提示
该数据集正在整理中,为您准备了其他渠道,请您使用
注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。
暂无相关内容。
暂无相关内容。
- 分享你的想法
去分享你的想法~~
全部内容
欢迎交流分享
开始分享您的观点和意见,和大家一起交流分享.
数据使用声明:
- 1、该数据来自于互联网数据采集或服务商的提供,本平台为用户提供数据集的展示与浏览。
- 2、本平台仅作为数据集的基本信息展示、包括但不限于图像、文本、视频、音频等文件类型。
- 3、数据集基本信息来自数据原地址或数据提供方提供的信息,如数据集描述中有描述差异,请以数据原地址或服务商原地址为准。
- 1、本站中的所有数据集的版权都归属于原数据发布者或数据提供方所有。
- 1、如您需要转载本站数据,请保留原数据地址及相关版权声明。
- 1、如本站中的部分数据涉及侵权展示,请及时联系本站,我们会安排进行数据下线。