A Comparison of Semi-Supervised Classification Approaches for Software Defect Prediction

Cagatay Catal

Download from

dx.doi.org

A Comparison of Semi-Supervised Classification Approaches for Software Defect Prediction

Cagatay Catal

Journal of Intelligent Systems 23 (1):75-82 (2014) Copy BIBT_EX

Abstract

Predicting the defect-prone modules when the previous defect labels of modules are limited is a challenging problem encountered in the software industry. Supervised classification approaches cannot build high-performance prediction models with few defect data, leading to the need for new methods, techniques, and tools. One solution is to combine labeled data points with unlabeled data points during learning phase. Semi-supervised classification methods use not only labeled data points but also unlabeled ones to improve the generalization capability. In this study, we evaluated four semi-supervised classification methods for semi-supervised defect prediction. Low-density separation, support vector machine, expectation-maximization, and class mass normalization methods have been investigated on NASA data sets, which are CM1, KC1, KC2, and PC1. Experimental results showed that SVM and LDS algorithms outperform CMN and EM-SEMI algorithms. In addition, LDS algorithm performs much better than SVM when the data set is large. In this study, the LDS-based prediction approach is suggested for software defect prediction when there are limited fault data.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

Keywords

Add keywords

Reprint years

DOI

10.1515/jisys-2013-0030

My notes

Similar books and articles

Human Semi-Supervised Learning.Bryan R. Gibson, Timothy T. Rogers & Xiaojin Zhu - 2013 - Topics in Cognitive Science 5 (1):132-172.

Instance Based Classification for Decision Making in Network Data.Amarjit Singh, Parag Kulkarni & Shankar Lal - 2012 - Journal of Intelligent Systems 21 (2):167-193.

Can semi-supervised learning explain incorrect beliefs about categories?Charles W. Kalish, Timothy T. Rogers, Jonathan Lang & Xiaojin Zhu - 2011 - Cognition 120 (1):106-118.

Between the fundamental and the phenomenological: The challenge of 'semi-empirical' methods.Jeffry L. Ramsey - 1997 - Philosophy of Science 64 (4):627-653.

Semi-supervised ensemble learning of data streams in the presence of concept drift.Zahra Ahmadi & Hamid Beigy - 2012 - In Emilio Corchado, Vaclav Snasel, Ajith Abraham, Michał Woźniak, Manuel Grana & Sung-Bae Cho (eds.), Hybrid Artificial Intelligent Systems. Springer. pp. 526--537.

A Hybrid Approach To Learn With Imbalanced Classes Using Evolutionary Algorithms.Claudia Milaré, Gustavo Batista & André Carvalho - 2011 - Logic Journal of the IGPL 19 (2):293-303.

CLEAR: Class Level Software Refactoring Using Evolutionary Algorithms.Chenxiang Yuan, Bo Jiang, Weifeng Pan & Muchou Wang - 2015 - Journal of Intelligent Systems 24 (1):85-97.

A novel network framework using similar-to-different learning strategy.Bhanu Prakash Battula & R. Satya Prasad - 2015 - AI and Society 30 (1):129-138.

Reflex Fuzzy Min Max Neural Network for Semi-supervised Learning.A. V. Nandedkar & P.Κ Biswas - 2008 - Journal of Intelligent Systems 17 (1-3):5-18.

Knowledge Supervised Text Classification with No Labeled Documents.Congle Zhang, Gui-Rong Xue & Yong Yu - 2008 - In Tu-Bao Ho & Zhi-Hua Zhou (eds.), Pricai 2008: Trends in Artificial Intelligence. Springer. pp. 509--520.

Backend Framework and Software Approach to Compute Earthquake Parameters from Signals Recorded by Seismic Instrumentation System.Raman K. Attri - manuscript

One of these greebles is not like the others: Semi-supervised models for similarity structures.Rachel G. Stephens & Daniel J. Navarro - 2008 - In B. C. Love, K. McRae & V. M. Sloutsky (eds.), Proceedings of the 30th Annual Conference of the Cognitive Science Society. Cognitive Science Society. pp. 1996--2001.

Active learning approach to concept drift problem.Bartosz Kurlej & Michal Wozniak - 2012 - Logic Journal of the IGPL 20 (3):550-559.

On the Theoretical Limits to Reliable Causal Inference.Benoit Desjardins - 1999 - Dissertation, University of Pittsburgh

Statistical Learning Theory: A Tutorial.Sanjeev R. Kulkarni & Gilbert Harman - 2011 - Wiley Interdisciplinary Reviews: Computational Statistics 3 (6):543-556.

Analytics

Added to PP
2017-01-12

Downloads
13 (#978,482)

6 months
6 (#431,022)

Historical graph of downloads

How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

A Comparison of Semi-Supervised Classification Approaches for Software Defect Prediction

Abstract

Categories

Keywords

Reprint years

DOI

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Citations of this work

References found in this work