Cascading k-means with Ensemble Learning: Enhanced Categorization of Diabetic Data

Journal of Intelligent Systems 21 (3):237-253 (2012)
  Copy   BIBTEX

Abstract

. This paper illustrates the applications of various ensemble methods for enhanced classification accuracy. The case in point is the Pima Indian Diabetic Dataset. The computational model comprises of two stages. In the first stage, k-means clustering is employed to identify and eliminate wrongly classified instances. In the second stage, a fine tuning in the classification was effected. To do this, ensemble methods such as AdaBoost, bagging, dagging, stacking, decorate, rotation forest, random subspace, MultiBoost and grading were invoked along with five chosen base classifiers, namely support vector machine, radial basis function network, decision tree J48, naïve Bayes and Bayesian network. The k-fold cross validation technique is adopted. Computational experiments with the proposed method showed an improvement of 16.14% to 22.49% in the classification accuracy compared to literature survey. Among the ensemble methods tried, MultiBoost ensemble with SVM classifier and grading ensemble with naïve Bayes showed the best performance followed by MultiBoost, stacking and grading ensemble with Bayesian classifier, rotation forest ensemble with RBF and grading and rotation forest ensemble with J48. This investigation conclusively proves the significance of cascading k-means clustering with ensemble methods in the enhanced accuracy in categorization of diabetic dataset.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 93,774

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Machine Learning and Job Posting Classification: A Comparative Study.Ibrahim M. Nasser & Amjad H. Alzaanin - 2020 - International Journal of Engineering and Information Systems (IJEAIS) 4 (9):06-14.

Analytics

Added to PP
2017-01-11

Downloads
16 (#227,957)

6 months
8 (#1,326,708)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references