公开数据集
数据结构 ? 16.31M
Data Structure ?
* 以上分析是由系统提取分析形成的结果,具体实际数据为准。
README.md
Data Set Information:
The original image collection was obtained from Corel at [Web link]. There are 68,040 photo images from various categories.
Each set of features is stored in a separate file. For each file, a line corresponds to a single image. The first value in a line is is the image ID and the subsequent values are the feature vector (e.g. color histogram, etc.) of the image. The same image has the same ID in all files but the image ID is not the same as the image filename.
Attribute Information:
From each image four sets of features were extracted:
- Color Histogram
- Color Histogram Layout
- Color Moments
- Co-occurrence Texture
Color Histogram: 32 dimensions (8 x 4 = H x S)
- HSV color space is divided into 32 subspaces (32 colors : 8 ranges of H and 4 ranges of S).
- the value in each dimension in a ColorHistogram of an image is the density of each color in the entire image.
- Histogram intersection (overlap area between ColorHistograms of two images) can be used to measure the similarity between two images.
Color Histogram Layout: 32 dimensions (4 x 2 x 4 = H x S x sub-images)
- each image is divided into 4 sub-images (one horizontal split and one vertical split).
- 4x2 Color Histogram for each sub-image is computed.
- Histogram Intersection can be used to measure the similarity between two images.
Color Moments: 9 dimensions (3 x 3)
- the 9 values are: (one for each of H,S, and V in HSV color space)
-- mean,
-- standard deviation, and
-- skewness.
- Euclidean distance between Color Moments of two images can be used to represent the dis-similarity (distance) between two images.
Co-occurrence Texture: 16 dimensions (4 x 4)
- images are converted to 16 gray-scale images.
- co-ocurrence in 4 directions is computed (horizontal, vertical, and two diagonal directions). the 16 values are: (one for each direction).
-- Second Angular Moment,
-- Contrast,
I -- nverse Difference Moment, and
-- Entropy.
-Euclidean distance between ColorMoments of two images can be used to measure the dis-similarity (distance) between two images.
Relevant Papers:
Michael Ortega, Yong Rui, Kaushik Chakrabarti, Kriengkrai Porkaew, Sharad Mehrotra, and Thomas S. Huang, Supporting Ranked Boolean Similarity Queries in MARS, IEEE Transaction on Knowledge and Data Engineering, Vol. 10, No. 6, Pages 905-925, December 1998.
[Web link]
Kaushik Chakrabarti, and Sharad Mehrotra, The Hybrid Tree: An Index Structure for High Dimensional Feature Spaces, 1999 IEEE International Conference on Data Engineering (ICDE), Pages 440-447, February, 1999.
[Web link]
Kriengkrai Porkaew, Kaushik Chakrabarti, and Sharad Mehrotra, Query Refinement for Multimedia Retrieval and its evaluation Techniques in MARS, 1999 ACM International Multimedia Conference, Orlando, Florida, Oct 30 - Nov 4, 1999.
[Web link]
Kaushik Chakrabarti, Kriengkrai Porkaew, and Sharad Mehrotra, Efficient Query Refinement in Multimedia Databases, ICDE, 2000
[Web link]
Papers That Cite This Data Set1:
Thomas T. Osugi and M. S. EXPLORATION-baseD ACTIVE MACHINE LEARNING. Faculty of The Graduate College at the University of Nebraska In Partial Fulfillment of Requirements. [View Context].
Citation Request:
This data may be used for non-commercial purposes only.
Original Owner:
Michael Ortega-Binderberger
Information and Computer Science
University of California at Irvine
Irvine, CA 92697-3425
USA
miki '@' ics.uci.edu
Donor:
Kriengkrai Porkaew and Sharad Mehrotra
Information and Computer Science
University of California at Irvine
Irvine, CA 92697-3425
USA
nid '@' ics.uci.edu,sharad '@' ics.uci.edu
帕依提提提温馨提示
该数据集正在整理中,为您准备了其他渠道,请您使用
- 分享你的想法
全部内容
数据使用声明:
- 1、该数据来自于互联网数据采集或服务商的提供,本平台为用户提供数据集的展示与浏览。
- 2、本平台仅作为数据集的基本信息展示、包括但不限于图像、文本、视频、音频等文件类型。
- 3、数据集基本信息来自数据原地址或数据提供方提供的信息,如数据集描述中有描述差异,请以数据原地址或服务商原地址为准。
- 1、本站中的所有数据集的版权都归属于原数据发布者或数据提供方所有。
- 1、如您需要转载本站数据,请保留原数据地址及相关版权声明。
- 1、如本站中的部分数据涉及侵权展示,请及时联系本站,我们会安排进行数据下线。