Special Issue Paper
Journal of the Operational Research Society (2009) 60, 1069–1084. doi:10.1057/jors.2008.195; published online 8 April 2009
An optimization approach to partitional data clustering
J Kim1, J Yang2 and S Ólafsson3
- 1Korea Small Business Institute(KOSBI), Seoul, Korea
- 2Chonbuk National University, Jeonju, Jeonbuck, South Korea
- 3Iowa State University, Ames, IA, USA
Correspondence: J Yang, Department of Industrial and Information Systems Engineering, Chonbuk National University, 418 Engineering Bldg #6, Duckjin-Dong Duckjin-Gu, Jeonju, Jeonbuck 561 756, South Korea. E-mail: jkyang@chonbuk.ac.uk
Received September 2007; Accepted December 2008; Published online 8 April 2009.
Abstract
Scalability of clustering algorithms is a critical issue facing the data mining community. One method to handle this issue is to use only a subset of all instances. This paper develops an optimization-based approach to the partitional clustering problem using an algorithm specifically designed for noisy performance, which is a problem that arises when using a subset of instances. Numerical results show that computation time can be dramatically reduced by using a partial set of instances without sacrificing solution quality. In addition, these results are more persuasive as the size of the problem is larger.
Keywords:
optimization-based partitional clustering, scalability, partitioning
MORE ARTICLES LIKE THIS
These links to content published by Palgrave Macmillan are automatically generated.
RESEARCH
An optimization approach to partitional data clusteringJournal of the Operational Research Society Special Feature
Near-optimal feature selection for large databasesJournal of the Operational Research Society Special Feature
Visually driven analysis of movement data by progressive clusteringInformation Visualization Article




