What makes Big Data, Big Data? Exploring the ontological characteristics of 26 datasets

Big Data and Society 3 (1) (2016)
  Copy   BIBTEX

Abstract

Big Data has been variously defined in the literature. In the main, definitions suggest that Big Data possess a suite of key traits: volume, velocity and variety, but also exhaustivity, resolution, indexicality, relationality, extensionality and scalability. However, these definitions lack ontological clarity, with the term acting as an amorphous, catch-all label for a wide selection of data. In this paper, we consider the question ‘what makes Big Data, Big Data?’, applying Kitchin’s taxonomy of seven Big Data traits to 26 datasets drawn from seven domains, each of which is considered in the literature to constitute Big Data. The results demonstrate that only a handful of datasets possess all seven traits, and some do not possess either volume and/or variety. Instead, there are multiple forms of Big Data. Our analysis reveals that the key definitional boundary markers are the traits of velocity and exhaustivity. We contend that Big Data as an analytical category needs to be unpacked, with the genus of Big Data further delineated and its various species identified. It is only through such ontological work that we will gain conceptual clarity about what constitutes Big Data, formulate how best to make sense of it, and identify how it might be best used to make sense of the world.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,349

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Bodies of Data: Genomic Data and Bioscience Data Sharing.Pilar Ossorio - 2011 - Social Research: An International Quarterly 78 (4):907-932.
Bodies of data: genomic data and bioscience data sharing.Pilar N. Ossorio - 2011 - Social Research: An International Quarterly 78 (3):907-932.
The Personal Data Is Political.Athina Tzovara & Bastian Greshake Tzovaras - 2019 - In Peter Dabrock, Matthias Braun & Patrik Hummel (eds.), The Ethics of Medical Data Donation. Springer Verlag.
Data models and the acquisition and manipulation of data.Todd Harris - 2003 - Philosophy of Science 70 (5):1508-1517.
Data Collection from the Web for Informetric Purposes.Judit Bar-Ilan - 2019 - In Wolfgang Glänzel, Henk F. Moed, Ulrich Schmoch & Mike Thelwall (eds.), Springer Handbook of Science and Technology Indicators. Springer Verlag. pp. 781-800.
Towards a Taxonomy of the Model-Ladenness of Data.Alisa Bokulich - forthcoming - PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association.
Data Interpretation in the Digital Age.Sabina Leonelli - 2014 - Perspectives on Science 22 (3):397-417.

Analytics

Added to PP
2020-11-24

Downloads
36 (#431,270)

6 months
14 (#170,850)

Historical graph of downloads
How can I increase my downloads?