Analysis of the Use of Background Distribution for Naive Bayes Classifiers

Journal of Intelligent Systems 28 (2):259-273 (2019)
  Copy   BIBTEX

Abstract

The naive Bayes classifier is a popular classifier, as it is easy to train, requires no cross-validation for parameter tuning, and can be easily extended due to its generative model. Moreover, recently it was shown that the word probabilities estimated from large unlabeled corpora could be used to improve the parameter estimation of naive Bayes. However, previous methods do not explicitly allow to control how much the background distribution can influence the estimation of naive Bayes parameters. In contrast, we investigate an extension of the graphical model of naive Bayes such that a word is either generated from a background distribution or from a class-specific word distribution. We theoretically analyze this model and show the connection to Jelinek-Mercer smoothing. Experiments using four standard text classification data sets show that the proposed method can statistically significantly outperform previous methods that use the same background distribution.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,897

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Bayes's theorem. [REVIEW]Massimo Pigliucci - 2005 - Quarterly Review of Biology 80 (1):93-95.
Bayes, Hume, Price, and Miracles.John Earman - 2002 - In Richard Swinburne (ed.), Bayes’s Theorem. Oxford University Press. pp. 91--110.
A Logic for Trial and Error Classifiers.Martin Kaså - 2015 - Journal of Logic, Language and Information 24 (3):307-322.
Bayes' Theorem.Richard Swinburne - 2004 - Revue Philosophique de la France Et de l'Etranger 194 (2):250-251.
Individual-denoting classifiers.Mana Kobuchi-Philip - 2007 - Natural Language Semantics 15 (2):95-130.

Analytics

Added to PP
2017-12-14

Downloads
13 (#1,036,774)

6 months
3 (#976,478)

Historical graph of downloads
How can I increase my downloads?

Author's Profile

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references