公开数据集

具有复制声学特征数据集的Parkinson数据集

121K

726 浏览

0 喜欢

2 次下载

0 条讨论

Life Classification

Data Set Information:Important remarks before using this dataset: 1. Each row can not be used independently, because is......

数据介绍
文件预览
相关论文
Code
分享讨论(0)
使用声明

启动Notebook开发

数据结构 ? 121K

* 以上分析是由系统提取分析形成的结果，具体实际数据为准。

README.md

Data Set Information:

important remarks before using this dataset:

1. Each row can not be used independently, because is one of the three replications of one individual. Nature of data is dependent for each subject, but independent from one to another subject. So, traditional technique from machine learning can not be applied to this dataset, because those techniques are based on the independent nature of the instances. There are 240 instances but for only 80 subjects, so they are not independent. Techniques as those presented in Naranjo et al. (2016), Naranjo et al. (2017) or other specifically designed can be used.

2. The concept of replication considered here does not match the classical concept of statistical repeated measurements. The term 'replications' refers to the collection of features extracted from voice recordings belonging to the same subject. Since, in this context, features are extracted from multiple consecutive voice recordings from the same subject, in principle, the features should be identical. The imperfections in technology and the own biological variability result in non-identical replicated features that are more similar to one another than features from different subjects.

3. All information about how the dataset was generated is presented in Naranjo et al. (2016).

Attribute Information:

1. ID: Subjects's identifier.
2. Recording: Number of the recording.
3. Status: 0=Healthy; 1=PD
4. Gender: 0=Man; 1=Woman
5. Pitch local perturbation measures: relative jitter (Jitter_rel), absolute jitter (Jitter_abs), relative average perturbation (Jitter_RAP), and pitch perturbation quotient (Jitter_PPQ).
6. Amplitude perturbation measures: local shimmer (Shim_loc), shimmer in dB (Shim_dB), 3-point amplitude perturbation quotient (Shim_APQ3), 5-point amplitude perturbation quotient (Shim_APQ5), and 11-point amplitude perturbation quotient (Shim_APQ11).
7. Harmonic-to-noise ratio measures: harmonic-to-noise ratio in the frequency band 0-500 Hz (HNR05), in 0-1500 Hz (HNR15), in 0-2500 Hz (HNR25), in 0-3500 Hz (HNR35), and in 0-3800 Hz (HNR38).
8. Mel frequency cepstral coefficient-based spectral measures of order 0 to 12 (MFCC0, MFCC1,..., MFCC12) and their derivatives (Delta0, Delta1,..., Delta12).
9. Recurrence period density entropy (RPDE).
10. Detrended fluctuation analysis (DFA).
11. Pitch period entropy (PPE).
12. Glottal-to-noise excitation ratio (GNE).

Relevant Papers:

Naranjo, L., P??rez, C.J., Campos-Roca, Y., Mart?-n, J.: Addressing voice recording replications for Parkinsona€?s disease detection. Expert Systems With Applications 46, 286-292 (2016)

Naranjo, L., P??rez, C.J., Mart?-n, J., Campos-Roca, Y.: A two-stage variable selection and classification approach for Parkinsona€?s disease detection by using voice recording replications. Computer Methods and Programs in Biomedicine 142, 147-156 (2017)

Citation Request:

Naranjo, L., P??rez, C.J., Campos-Roca, Y., Mart?-n, J.: Addressing voice recording replications for Parkinsona€?s disease detection. Expert Systems With Applications 46, 286-292 (2016)

Carlos J. P??rez
Departamento de Matem??ticas, Universidad de Extremadura, C??ceres (Spain)
Email: carper '@' unex.es

暂无相关内容。

分享你的想法

去分享你的想法~~

全部内容

欢迎交流分享

开始分享您的观点和意见，和大家一起交流分享.

数据使用声明：

一、数据来源与展示说明：

1、该数据来自于互联网数据采集或服务商的提供，本平台为用户提供数据集的展示与浏览。
2、本平台仅作为数据集的基本信息展示、包括但不限于图像、文本、视频、音频等文件类型。
3、数据集基本信息来自数据原地址或数据提供方提供的信息，如数据集描述中有描述差异，请以数据原地址或服务商原地址为准。

二、所有权说明：

1、本站中的所有数据集的版权都归属于原数据发布者或数据提供方所有。

三、数据转载说明：

1、如您需要转载本站数据，请保留原数据地址及相关版权声明。

四、侵权与处理说明：

1、如本站中的部分数据涉及侵权展示，请及时联系本站，我们会安排进行数据下线。

所需积分：

10 去赚积分？

726浏览
2下载
0点赞
收藏
分享

Select Language

AI社区

今日排行

本月搜索

Dataset Category