Investigating Tree Family Machine Learning Techniques for a Predictive System to Unveil Software Defects

Complexity 2020:1-21 (2020)
  Copy   BIBTEX

Abstract

Software defects prediction at the initial period of the software development life cycle remains a critical and important assignment. Defect prediction and correctness leads to the assurance of the quality of software systems and has remained integral to study in the previous years. The quick forecast of imperfect or defective modules in software development can serve the development squad to use the existing assets competently and effectively to provide remarkable software products in a given short timeline. Hitherto, several researchers have industrialized defect prediction models by utilizing statistical and machine learning techniques that are operative and effective approaches to pinpoint the defective modules. Tree family machine learning techniques are well-thought-out to be one of the finest and ordinarily used supervised learning methods. In this study, different tree family machine learning techniques are employed for software defect prediction using ten benchmark datasets. These techniques include Credal Decision Tree, Cost-Sensitive Decision Forest, Decision Stump, Forest by Penalizing Attributes, Hoeffding Tree, Decision Tree, Logistic Model Tree, Random Forest, Random Tree, and REP-Tree. Performance of each technique is evaluated using different measures, i.e., mean absolute error, relative absolute error, root mean squared error, root relative squared error, specificity, precision, recall, F-measure, G-measure, Matthew’s correlation coefficient, and accuracy. The overall outcomes of this paper suggested RF technique by producing best results in terms of reducing error rates as well as increasing accuracy on five datasets, i.e., AR3, PC1, PC2, PC3, and PC4. The average accuracy achieved by RF is 90.2238%. The comprehensive outcomes of this study can be used as a reference point for other researchers. Any assertion concerning the enhancement in prediction through any new model, technique, or framework can be benchmarked and verified.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,752

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Machine Learning and Job Posting Classification: A Comparative Study.Ibrahim M. Nasser & Amjad H. Alzaanin - 2020 - International Journal of Engineering and Information Systems (IJEAIS) 4 (9):06-14.
Machine Learning Application to Predict The Quality of Watermelon Using JustNN.Ibrahim M. Nasser - 2019 - International Journal of Engineering and Information Systems (IJEAIS) 3 (10):1-8.
Software quality and group performance.Yuk Kuen Wong - 2009 - AI and Society 23 (4):559-573.
The Trouble With Thinking Like Arena: Learning to Use Simulation Software.Reinaldo J. Moraga & Diane M. Rodgers - 2011 - Bulletin of Science, Technology and Society 31 (2):144-152.
Software Intensive Science.John Symons & Jack Horner - 2014 - Philosophy and Technology 27 (3):461-477.

Analytics

Added to PP
2020-12-22

Downloads
71 (#230,328)

6 months
67 (#71,995)

Historical graph of downloads
How can I increase my downloads?

Author's Profile

Ahmad Arshad
York University

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references