Big Data, epistemology and causality: Knowledge in and knowledge out in EXPOsOMICS

Big Data and Society 3 (2) (2016)
  Copy   BIBTEX

Abstract

Recently, it has been argued that the use of Big Data transforms the sciences, making data-driven research possible and studying causality redundant. In this paper, I focus on the claim on causal knowledge by examining the Big Data project EXPOsOMICS, whose research is funded by the European Commission and considered capable of improving our understanding of the relation between exposure and disease. While EXPOsOMICS may seem the perfect exemplification of the data-driven view, I show how causal knowledge is necessary for the project, both as a source for handling complexity and as an output for meeting the project’s goals. Consequently, I argue that data-driven claims about causality are fundamentally flawed and causal knowledge should be considered a necessary aspect of Big Data science. In addition, I present the consequences of this result on other data-driven claims, concerning the role of theoretical considerations. I argue that the importance of causal knowledge and other kinds of theoretical engagement in EXPOsOMICS undermine theory-free accounts and suggest alternative ways of framing science based on Big Data.

Links

PhilArchive

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

The Causal Nature of Modeling with Big Data.Wolfgang Pietsch - 2016 - Philosophy and Technology 29 (2):137-171.
Aspects of Theory-Ladenness in Data-Intensive Science.Wolfgang Pietsch - 2015 - Philosophy of Science 82 (5):905-916.
What Counts as Scientific Data? A Relational Framework.Sabina Leonelli - 2015 - Philosophy of Science 82 (5):810-821.
Causal criteria and the problem of complex causation.Andrew Ward - 2009 - Medicine, Health Care and Philosophy 12 (3):333-343.
Data fusion with probabilistic conditional logic.Jens Fisseler & Imre Fehér - 2010 - Logic Journal of the IGPL 18 (4):488-507.
Data models and the acquisition and manipulation of data.Todd Harris - 2003 - Philosophy of Science 70 (5):1508-1517.
Classificatory Theory in Data-intensive Science: The Case of Open Biomedical Ontologies.Sabina Leonelli - 2012 - International Studies in the Philosophy of Science 26 (1):47 - 65.
Data Interpretation in the Digital Age.Sabina Leonelli - 2014 - Perspectives on Science 22 (3):397-417.
Knowledge-driven versus data-driven logics.Didier Dubois, Petr Hájek & Henri Prade - 2000 - Journal of Logic, Language and Information 9 (1):65--89.

Analytics

Added to PP
2016-10-05

Downloads
645 (#25,336)

6 months
93 (#44,040)

Historical graph of downloads
How can I increase my downloads?

Author's Profile

Stefano Canali
Politecnico di Milano

References found in this work

Causality: Models, Reasoning and Inference.Judea Pearl - 2000 - New York: Cambridge University Press.
Nature's capacities and their measurement.Nancy Cartwright - 1989 - New York: Oxford University Press.
Causality: Models, Reasoning and Inference.Judea Pearl - 2000 - Tijdschrift Voor Filosofie 64 (1):201-202.
Unsimple Truths: Science, Complexity, and Policy.Sandra D. Mitchell - 2009 - London: University of Chicago Press.

View all 23 references / Add more references