Abstract
This article presents Epistemic Spatialization as a new framework for investigating the interconnected patterns of biases when identifying objects with convolutional neural networks. It draws upon Foucault’s notion of spatialized knowledge to guide its method of enquiry. We argue that decisions involved in the creation of algorithms, alongside the labeling, ordering, presentation, and commercial prioritization of objects, together create a distorted “nomination of the visible”: they harden the visibility of some objects, make other objects excessively visible, and consign yet others to permanent or haphazard invisibility. Our approach differs from those who focus on high-stakes misidentifications, such as errors tied to structural racism. Examining the far more dominant series of low-stakes mistakes shows the scope of errors, destabilizing the goal of image content identification with considerable societal impact. We explore these issues by closely examining the demonstration video of a popular convnet. This examination reveals an interlocking series of biases undermining the content identification process. The picture we paint is crucial for a better understanding of the errors that result as these convnets become further embedded in everyday products. The framework is valuable for critical work on computer vision, AI studies, and large-scale visual analysis.