Special Issue Paper

Journal of the Operational Research Society (2009) 60, 1069–1084. doi:10.1057/jors.2008.195; published online 8 April 2009

An optimization approach to partitional data clustering

J Kim1, J Yang2 and S Ólafsson3

  1. 1Korea Small Business Institute(KOSBI), Seoul, Korea
  2. 2Chonbuk National University, Jeonju, Jeonbuck, South Korea
  3. 3Iowa State University, Ames, IA, USA

Correspondence: J Yang, Department of Industrial and Information Systems Engineering, Chonbuk National University, 418 Engineering Bldg #6, Duckjin-Dong Duckjin-Gu, Jeonju, Jeonbuck 561 756, South Korea. E-mail: jkyang@chonbuk.ac.uk

Received September 2007; Accepted December 2008; Published online 8 April 2009.

Top

Abstract

Scalability of clustering algorithms is a critical issue facing the data mining community. One method to handle this issue is to use only a subset of all instances. This paper develops an optimization-based approach to the partitional clustering problem using an algorithm specifically designed for noisy performance, which is a problem that arises when using a subset of instances. Numerical results show that computation time can be dramatically reduced by using a partial set of instances without sacrificing solution quality. In addition, these results are more persuasive as the size of the problem is larger.

Keywords:

optimization-based partitional clustering, scalability, partitioning

MORE ARTICLES LIKE THIS

These links to content published by Palgrave Macmillan are automatically generated.

RESEARCH

An optimization approach to partitional data clustering

Journal of the Operational Research Society Special Feature

Near-optimal feature selection for large databases

Journal of the Operational Research Society Special Feature

Visually driven analysis of movement data by progressive clustering

Information Visualization Article

Extra navigation

.

Society resources

ADVERTISEMENT
JORS-Link to full archive