The data archive as factory: Alienation and resistance of data processors

Big Data and Society 8 (1) (2021)
  Copy   BIBTEX

Abstract

Archival data processing consists of cleaning and formatting data between the moment a dataset is deposited and its publication on the archive’s website. In this article, I approach data processing by combining scholarship on invisible labor in knowledge infrastructures with a Marxian framework and show the relevance of considering data processing as factory labor. Using this perspective to analyze ethnographic data collected during a six-month participatory observation at a U.S. data archive, I generate a taxonomy of the forms of alienation that data processing generates, but also the types of resistance that processors develop, across four categories: routine, speed, skill, and meaning. This synthetic approach demonstrates, first, that data processing reproduces typical forms of factory worker’s alienation: processors are asked to work along a strict standardized pipeline, at a fast pace, without acquiring substantive skills or having a meaningful involvement in their work. It reveals, second, how data processors resist the alienating nature of this workflow by developing multiple tactics along the same four categories. Seen through this dual lens, data processors are therefore not only invisible workers, but also factory workers who follow and subvert a workflow organized as an assembly line. I conclude by proposing a four-step framework to better value the social contribution of data workers beyond the archive.

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 91,853

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Data models and the acquisition and manipulation of data.Todd Harris - 2003 - Philosophy of Science 70 (5):1508-1517.
Data as oil, infrastructure or asset? Three metaphors of data as economic value.Jan Michael Nolin - 2019 - Journal of Information, Communication and Ethics in Society 18 (1):28-43.
Towards a Taxonomy of the Model-Ladenness of Data.Alisa Bokulich - forthcoming - PSA: Proceedings of the Biennial Meeting of the Philosophy of Science Association.
Ethics of Big Data.Kord Davis - 2012 - O'reilly. Edited by Doug Patterson.
Data fusion with probabilistic conditional logic.Jens Fisseler & Imre Fehér - 2010 - Logic Journal of the IGPL 18 (4):488-507.

Analytics

Added to PP
2021-07-01

Downloads
6 (#1,461,013)

6 months
2 (#1,198,779)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

Data-Centric Biology: A Philosophical Study.Sabina Leonelli - 2016 - London: University of Chicago Press.
Sorting Things out: Classification and Its Consequences.Geoffrey C. Bowker & Susan Leigh Star - 2001 - Journal of the History of Biology 34 (1):212-214.
La pensée sauvage.Claude Lévi-Strauss - 1964 - Revue Philosophique de la France Et de l'Etranger 154:508-511.
La pensée sauvage.Claude Levi-Strauss - 1963 - Les Etudes Philosophiques 18 (1):104-105.

View all 9 references / Add more references