International Journal of
Physical Sciences

  • Abbreviation: Int. J. Phys. Sci.
  • Language: English
  • ISSN: 1992-1950
  • DOI: 10.5897/IJPS
  • Start Year: 2006
  • Published Articles: 2568

Full Length Research Paper

A subspace ensemble based data dependent binary classification model

Muhammad A. Khan1* and Anwar M. Mirza2
  1Department of Computer Science, National University of Computer and Emerging Sciences. A. K Brohi road, H-11/4, Islamabad 44000, Pakistan. 2College of Computer and Information Sciences, King Saud University, Saudi Arabia.
Email: [email protected]

  •  Accepted: 24 June 2011
  •  Published: 09 December 2011

Abstract

 

Classifying patterns into two classes is a typical problem of binary classification in pattern recognition. Binary classification is an industrial problem in many fields like medicine, search mechanism, diagnostic of disease in humans, security and many other aspects. In this paper, we have proposed a random subspace based ensemble data dependent classification model for the binary classification problem. The proposed method makes use of the information about the structure of given data and the availability of the training instances, in selection of the classification model. A subspace ensemble for a set of one class and two-class classifier are generated, trained and tested on the given data. The proposed method is evaluated on receiver operating curve (ROC), cross validation accuracy and Q-statistics. From the empirical results obtained, we have reached the following conclusions that the overall performance of the two class ensemble was better because of the ability of the ensemble to make use of the knowledge of both the positive and negative samples and thus constructs better class boundaries. The one class ensemble makes use of positive samples only and gives better performance when (i) training patterns are sparse and (ii) outlier detection is required.

 

Key words: One class classifier, two-class classifier, binary classification, classification model, receiver operating curve (ROC), evaluation, Q- statistics.