Multi-class classification using a signomial function

Hwang, Kyoungmi; Lee, Kyungsik; Lee, Chungmok; Park, Sungsoo

doi:10.1057/jors.2013.180

Multi-class classification using a signomial function

General Paper
Published: 05 March 2014

Volume 66, pages 434–449, (2015)
Cite this article

Journal of the Operational Research Society

Kyoungmi Hwang¹,
Kyungsik Lee²,
Chungmok Lee³ &
…
Sungsoo Park¹

265 Accesses
3 Citations
Explore all metrics

Abstract

We propose two multi-class classification methods using a signomial function. Each of these methods directly constructs a multi-class classifier by solving a single optimization problem. Since the number of possible signomial terms is extremely large, we propose a column generation method that iteratively generates good signomial terms. Both of these methods obtain better or comparable classification accuracies than existing methods and also provide more sparse classifiers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Variable selection methods for multi-class classification using signomial function

Article 09 February 2017

Multi-dimensional Bayesian network classifiers: A survey

Article 11 July 2020

Categorizing feature selection methods for multi-label classification

Article 23 September 2016

References

Amaldi E and Kann V (1998). On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems. Theoretical Computer Science 209 (1–2): 237–260.
Article Google Scholar
Anand R, Mehrotra K, Mohan CK and Ranka S (1995). Efficient classification for multiclass problems using modular neural networks. IEEE Transactions on Neural Networks 6 (1): 117–124.
Article Google Scholar
Bache K and Lichman M (2013). University of California, Irvine (UCI) machine learning repository. http://archive.ics.uci.edu/ml, accessed 15 December 2013.
Baesens B, Mues C, Martens D and Vanthienen J (2009). 50 years of data mining and OR: Upcoming trends and challenges. The Journal of the Operational Research Society 60 (Supplement 1): S16–S23.
Article Google Scholar
Bennett KP and Mangasarian OL (1994). Multicategory discrimination via linear programming. Optimization Methods and Software 3 (1–3): 27–39.
Article Google Scholar
Bennett KP, Demiriz A and Shawe-Taylor J (2000). A column generation algorithm for boosting. In: Langley P (ed). Proceedings of the 17th International Conference on Machine Learning, ICML ’00, Morgan Kaufmann Publishers: Stanford, CA, pp 65–72.
Bertsimas D and Tsitsiklis J (1997). Introduction to Linear Optimization. Athena Scientific: Belmont, MA.
Google Scholar
Bi J, Zhang T and Bennett KP (2004). Column-generation boosting methods for mixture of kernels. In: Kim W, Kohavi R, Gehrke J and DuMouchel W (eds). Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’04, ACM: New York, NY, pp 521–526.
Blumer A, Ehrenfeucht A, Haussler D and Warmuth MK (1987). Occam’s razor. Information Processing Letters 24 (6): 377–380.
Article Google Scholar
Bradley P and Mangasarian O (1998). Feature selection via concave minimization and support vector machines. In: Shavlik JW (ed). Proceedings of the 15th International Conference on Machine Learning, ICML ’98, Morgan Kaufmann Publishers: Madison, WI, pp 82–90.
Bredensteiner EJ and Bennett KP (1999). Multicategory classification by support vector machines. Computational Optimization and Applications 12 (1–3): 53–79.
Google Scholar
Carrizosa E, Martin-Barragan B and Morales DR (2010). Binarized support vector machines. INFORMS Journal on Computing 22 (1): 154–167.
Article Google Scholar
Choo EU and Wedley WC (1985). Optimal criterion weights in repetitive multicriteria decision-making. The Journal of the Operational Research Society 36 (11): 983–992.
Article Google Scholar
Clark P and Boswell R (1991). Rule induction with CN2: Some recent improvements. In: Kodratoff Y (ed). Proceedings of the European Working Session on Machine Learning, EWSL ’91, Springer-Verlag: Porto, Portugal, pp 151–163.
Crammer K and Singer Y (2002). On the algorithmic implementation of multiclass kernel-based vector machines. The Journal of Machine Learning Research 2: 265–292.
Google Scholar
Debnath R, Takahide N and Takahashi H (2004). A decision based one-against-one method for multiclass support vector machine. Pattern Analysis and Applications 7 (2): 164–175.
Article Google Scholar
Demiriz A, Bennett KP and Shawe-Taylor J (2002). Linear programming boosting via column generation. Machine Learning 46 (1–3): 225–254.
Article Google Scholar
Frank M and Wolfe F (1956). An algorithm for quadratic programming. Naval Research Logistics Quarterly 3 (1–2): 95–110.
Article Google Scholar
Friedman JH (1996). Another approach to polychotomous classification. Technical report, Department of Statistics, Stanford University, Stanford, CA.
Fung GM and Mangasarian OL (2004). A feature selection Newton method for support vector machine classification. Computational Optimization and Applications 28 (2): 185–202.
Article Google Scholar
Galar M, Fernández A, Barrenechea E, Bustince H and Herrera F (2011). An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study on one-versus-one and one-versus-all schemes. Pattern Recognition 44 (8): 1761–1776.
Article Google Scholar
Garey MR and Johnson DS (1979). Computers and Intractability; A Guide to the Theory of NP-Completeness. WH Freeman and Company: New York, NY.
Google Scholar
Goldberg N and Eckstein J (2010). Boosting classifiers with tightened l0-relaxation penalties. In: Proceedings of the 27th International Conference on Machine Learning, ICML ’10, Omni Press, pp 383–390.
Goldberg N and Eckstein J (2012). Sparse weighted voting classifier selection and its linear programming relaxations. Information Processing Letters 112 (12): 481–486.
Article Google Scholar
Hastie T and Tibshirani R (1997). Classification by pairwise coupling. In: Jordan MI, Kearns MJ and Solla SA (eds). Proceedings of the 10th Annual Conference on Neural Information Processing Systems, NIPS ’97, MIT Press: Denver, CO, pp 507–513.
He X, Wang Z, Jin C, Zheng Y and Xue X (2012). A simplified multi-class support vector machine with reduced dual optimization. Pattern Recognition Letters 33 (1): 71–82.
Article Google Scholar
Hsu CW and Lin CJ (2002a). A comparison of methods for multiclass support vector machines. IEEE Transactions on Neural Networks 13 (2): 415–425.
Article Google Scholar
Hsu CW and Lin CJ (2002b). A simple decomposition method for support vector machines. Machine Learning 46 (1–3): 291–314.
Article Google Scholar
Hsu CW and Lin CJ (2012). BSVM: A SVM library for the solution of large classification and regression problems. http://www.csie.ntu.edu.tw/~cjlin/bsvm, accessed 15 December 2013.
Hsu CW, Chang CC and Lin CJ (2003). A practical guide to support vector classification. Technical report, Department of Computer Science, National Taiwan University, Taipei 106, Taiwan.
Huang K, Zheng D, King I and Lyu MR (2009). Arbitrary norm support vector machines. Neural Computation 21 (2): 560–582.
Article Google Scholar
Hüllermeier E and Vanderlooy S (2010). Combining predictions in pairwise classification: An optimal adaptive voting strategy and its relation to weighted voting. Pattern Recognition 43 (1): 128–142.
Article Google Scholar
Lam KF and Moy JW (1996). Improved linear programming formulations for the multi-group discriminant problem. The Journal of the Operational Research Society 47 (12): 1526–1529.
Article Google Scholar
Lawler EL and Wood DE (1966). Branch-and-bound methods: A survey. Operations Research 14 (4): 699–719.
Article Google Scholar
Lee K, Kim N and Jeong MK (2012). The sparse signomial classification and regression model. Annals of Operations Research published online 15 August, doi:10.1007/s10479-012-1198-y.
Lee Y, Lin Y and Wahba G (2004). Multicategory support vector machines: Theory and application to the classification of microarray data and satellite radiance data. Journal of the American Statistical Association 99 (465): 67–82.
Article Google Scholar
Lee Y, Kim Y, Lee S and Koo J (2006). Structured multicategory support vector machines with analysis of variance decomposition. Biometrika 93 (3): 555–571.
Article Google Scholar
Li J-T and Jia Y-M (2010). Huberized multiclass support vector machine for microarray classification. Acta Automatica Sinica 36 (3): 399–405.
Google Scholar
Lorena AC, Carvalho AC and Gama JM (2008). A review on the combination of binary classifiers in multiclass problems. Artificial Intelligence Review 30 (1–4): 19–37.
Article Google Scholar
Mangasarian OL (2000). Generalized support vector machines. In: Smola AJ, Bartlett P, Schöolkopf B and Schuurmans D (eds). Advances in Large Margin Classifiers. The MIT Press: Cambridge, MA, pp 135–146.
Google Scholar
Mangasarian OL (2006). Exact 1-norm support vector machines via unconstrained convex differentiable minimization. The Journal of Machine Learning Research 7: 1517–1530.
Google Scholar
Oladunni OO and Singhal G (2009). Piecewise multi-classification support vector machines. In: Kozma R (ed). Proceedings of the 2009 International Joint Conference on Neural Networks, IJCNN ’09, IEEE Press: Piscataway, NJ, pp 2323–2330.
Pavur R and Loucopoulos C (1995). Examining optimal criterion weights in mixed integer programming approaches to the multiple-group classification problem. The Journal of the Operational Research Society 46 (5): 626–640.
Article Google Scholar
Tax DMJ and Duin RPW (2002). Using two-class classifiers for multiclass classification. In: Kasturi R (ed). Proceedings of the 16th International Conference on Pattern Recognition, ICPR ’02, IEEE Computer Society: Quebec, Canada, pp 124–127.
Vapnik VN (1998). Statistical Learning Theory. Wiley: New York, NY.
Google Scholar
Wang L and Shen X (2006). Multi-category support vector machines, feature selection and solution path. Statistica Sinica 16 (2): 617–633.
Google Scholar
Wang L and Shen X (2007). On l1-norm multiclass support vector machines. Journal of the American Statistical Association 102 (478): 583–594.
Article Google Scholar
Wang L, Zhu J and Zou H (2006). The doubly regularized support vector machine. Statistica Sinica 16 (2): 589–615.
Google Scholar
Wang L, Zhu J and Zou H (2008). Hybrid huberized support vector machines for microarray classification and gene selection. Bioinformatics 24 (3): 412–419.
Article Google Scholar
Weston J and Watkins C (1999). Support vector machines for multi-class pattern recognition. In: Verleysen M (ed). Proceedings of the 7th European Symposium on Artificial Neural Networks, ESANN ’99, Citeseer: Bruges, Belgium, pp 219–224.
Weston J, Elisseeff A, Schölkopf B and Tipping M (2003). Use of the zero-norm with linear models and kernel methods. Journal of Machine Learning Research 3: 1439–1461.
Google Scholar
Xpress (2012). Xpress-MP 7.3. http://www.fico.com/en, accessed 15 December 2013.
Yajima Y (2005). Linear programming approaches for multicategory support vector machines. European Journal of Operational Research 162 (2): 514–531.
Article Google Scholar
Zhang HH, Liu Y, Wu Y and Zhu J (2008). Variable selection for the multicategory SVM via adaptive sup-norm regularization. Electronic Journal of Statistics 2: 149–167.
Article Google Scholar
Zhou W, Zhang L and Jiao L (2002). Linear programming support vector machines. Pattern Recognition 35 (12): 2927–2936.
Article Google Scholar
Zhu J, Rosset S, Hastie T and Tibshirani R (2003). 1-norm support vector machines. In: Proceedings of the 16th Annual Conference on Neural Information Processing Systems, NIPS ’03, MIT Press: Vancouver and Whistler, BC, Canada, pp 49–56.
Zou H (2007). An improved 1-norm SVM for simultaneous classification and variable selection. Journal of Machine Learning Research—Proceedings Track 2: 675–681.
Google Scholar
Zou H and Hastie T (2005). Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 67 (2): 301–320.
Article Google Scholar

Download references

Acknowledgements

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (2012–006351).

Author information

Authors and Affiliations

KAIST, Daejeon, Republic of Korea, South Korea
Kyoungmi Hwang & Sungsoo Park
Seoul National University, Seoul, Republic of Korea; Hankuk University of Foreign Studies, Yongin-si, Republic of Korea
Kyungsik Lee
IBM Research—Ireland, Dublin, Ireland
Chungmok Lee

Authors

Kyoungmi Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Kyungsik Lee
View author publications
You can also search for this author in PubMed Google Scholar
Chungmok Lee
View author publications
You can also search for this author in PubMed Google Scholar
Sungsoo Park
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kyungsik Lee.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hwang, K., Lee, K., Lee, C. et al. Multi-class classification using a signomial function. J Oper Res Soc 66, 434–449 (2015). https://doi.org/10.1057/jors.2013.180

Download citation

Received: 10 July 2012
Accepted: 02 December 2013
Published: 05 March 2014
Issue Date: 01 March 2015
DOI: https://doi.org/10.1057/jors.2013.180

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-class classification using a signomial function

Abstract

Access this article

Similar content being viewed by others

Variable selection methods for multi-class classification using signomial function

Multi-dimensional Bayesian network classifiers: A survey

Categorizing feature selection methods for multi-label classification

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Multi-class classification using a signomial function

Abstract

Access this article

Similar content being viewed by others

Variable selection methods for multi-class classification using signomial function

Multi-dimensional Bayesian network classifiers: A survey

Categorizing feature selection methods for multi-label classification

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation