Mining meaning from Wikipedia

International Journal of Human-Computer Interactions 67 (9):716-754 (2009)
  Copy   BIBTEX

Abstract

Wikipedia is a goldmine of information; not just for its many readers, but also for the growing community of researchers who recognize it as a resource of exceptional scale and utility. It represents a vast investment of manual effort and judgment: a huge, constantly evolving tapestry of concepts and relations that is being applied to a host of tasks. This article provides a comprehensive description of this work. It focuses on research that extracts and makes use of the concepts, relations, facts and descriptions found in Wikipedia, and organizes the work into four broad categories: applying Wikipedia to natural language processing; using it to facilitate information retrieval and information extraction; and as a resource for ontology building. The article addresses how Wikipedia is being used as is, how it is being improved and adapted, and how it is being combined with other structures to create entirely new resources. We identify the research groups and individuals involved, and how their work has developed in the last few years. We provide a comprehensive list of the open-source software they have produced.

Links

PhilArchive

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Toward an epistemology of Wikipedia.Don Fallis - 2008 - Journal of the American Society for Information Science and Technology 59 (10):1662--1674.
Integrating Cyc and Wikipedia: Folksonomy meets rigorously defined common-sense.Olena Medelyan & Catherine Legg - 2008 - Proceedings of Wikipedia and AI Workshop at the AAAI-08 Conference. Chicago, US, July 12 2008.
“All you can eat” ontology-building: Feeding Wikipedia to Cyc.Samuel Sarjant, Catherine Legg, Olena Medelyan & Michael Robinson - 2009 - IEEE/WIC/ACM International Conference on Web Intelligence (WI-09), 15 – 18 September 2009 Università Degli Studi di Milano Bicocca, Milano, Italy.
Transparency and social responsibility issues for wikipedia.Adele Santana & Donna J. Wood - 2009 - Ethics and Information Technology 11 (2):133-144.
Bill Gates is not a parking meter: Philosophical quality control in automated ontology building.Catherine Legg & Samuel Sarjant - 2012 - Proceedings of the Symposium on Computational Philosophy, AISB/IACAP World Congress 2012 (Birmingham, England, July 2-6).
Ontologies on the Semantic Web.Catherine Legg - 2007 - Annual Review of Information Science and Technology 41:407-451.
Epistemology and the Wikipedia.P. D. Magnus - 2006 - North American Computing and Philosophy Conference.
Sense and Reference on the Web.Harry Halpin - 2011 - Minds and Machines 21 (2):153-178.

Analytics

Added to PP
2010-12-22

Downloads
401 (#47,751)

6 months
55 (#76,961)

Historical graph of downloads
How can I increase my downloads?

Author's Profile

Cathy Legg
Deakin University

Citations of this work

No citations found.

Add more citations

References found in this work

Word and Object.Willard Van Orman Quine - 1960 - Cambridge, MA, USA: MIT Press.
Word and Object.Willard Van Orman Quine - 1960 - Les Etudes Philosophiques 17 (2):278-279.
The Fixation of Belief.C. S. Peirce - 1877 - Popular Science Monthly 12 (1):1-15.
The ontology of the Gene Ontology.Barry Smith, Jennifer Williams & Steffen Schulze-Kremer - 2003 - In Smith Barry, Williams Jennifer & Schulze-Kremer Steffen (eds.), AMIA 2003 Symposium Proceedings. AMIA. pp. 609-613.

View all 10 references / Add more references