For Authors_For Subscribers_For Librarians_For SocietiesFor Advertisers

Home | About Us | Contact Us | Site Map | FAQs

journal home
 
Services for Readers
Services for authors
Customer Services


December 2003, Volume 2, Number 4, Pages 232-246
Table of contents   Previous  Full text  Next   PDF
Original Article
Coordinating computational and visual approaches for interactive feature selection and multivariate clustering
Diansheng Guo1

1GeoVISTA Center & Department of Geography, The Pennsylvania State University, University Park, PA, U.S.A.

Correspondence to: Diansheng Guo, GeoVISTA Center & Department of Geography, The Pennsylvania State University, 302 Walker Building, University Park, PA 16802, U.S.A. Tel: +1 814 865 3433; fax: +1 814 863 7943; E-mail: dguo@psu.edu

Abstract

Unknown (and unexpected) multivariate patterns lurking in high-dimensional datasets are often very hard to find. This paper describes a human-centered exploration environment, which incorporates a coordinated suite of computational and visualization methods to explore high-dimensional data for uncovering patterns in multivariate spaces. Specifically, it includes: (1) an interactive feature selection method for identifying potentially interesting, multidimensional subspaces from a high-dimensional data space, (2) an interactive, hierarchical clustering method for searching multivariate clusters of arbitrary shape, and (3) a suite of coordinated visualization and computational components centered around the above two methods to facilitate a human-led exploration. The implemented system is used to analyze a cancer dataset and shows that it is efficient and effective for discovering unknown and unexpected multivariate patterns from high-dimensional data.

Information Visualization (2003) 2, 232-246. doi:10.1057/palgrave.ivs.9500053

Keywords

data mining and knowledge discovery; feature selection; mutual information; entropy; interactive visualization; hierarchical clustering

Received 5 August 2003; revised 15 September 2003; accepted 17 September 2003
Table of contents   Previous  Full text  Next   PDF