Explaining Explanations in AI

Brent Mittelstadt

Explaining Explanations in AI

FAT* 2019 Proceedings 1 (forthcoming) Copy BIBT_EX

Abstract

Recent work on interpretability in machine learning and AI has focused on the building of simplified models that approximate the true criteria used to make decisions. These models are a useful pedagogical device for teaching trained professionals how to predict what decisions will be made by the complex system, and most importantly how the system might break. However, when considering any such model it’s important to remember Box’s maxim that "All models are wrong but some are useful." We focus on the distinction between these models and explanations in philosophy and sociology. These models can be understood as a "do it yourself kit" for explanations, allowing a practitioner to directly answer "what if questions" or generate contrastive explanations without external assistance. Although a valuable ability, giving these models as explanations appears more difficult than necessary, and other forms of explanation may not have the same trade-offs. We contrast the different schools of thought on what makes an explanation, and suggest that machine learning might benefit from viewing the problem more broadly.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Edit

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Author's Profile

Brent Mittelstadt

University of Oxford

Keywords

interpretability explanations accountability philosophy of science data ethics machine learning artificial intelligence automated decision-making

Reprint years

My notes

Analytics

Added to PP
2018-11-04

Downloads
1,370 (#8,878)

6 months
179 (#20,781)

Historical graph of downloads

How can I increase my downloads?

Author's Profile

Brent Mittelstadt

University of Oxford

Citations of this work

The Ethics of AI Ethics: An Evaluation of Guidelines.Thilo Hagendorff - 2020 - Minds and Machines 30 (1):99-120.

The Pragmatic Turn in Explainable Artificial Intelligence (XAI).Andrés Páez - 2019 - Minds and Machines 29 (3):441-459.

What do we want from Explainable Artificial Intelligence (XAI)? – A stakeholder perspective on XAI and a conceptual model guiding interdisciplinary XAI research.Markus Langer, Daniel Oster, Timo Speith, Lena Kästner, Kevin Baum, Holger Hermanns, Eva Schmidt & Andreas Sesing - 2021 - Artificial Intelligence 296 (C):103473.

Black-box assisted medical decisions: AI power vs. ethical physician care.Berman Chan - 2023 - Medicine, Health Care and Philosophy 26 (3):285-292.

The Pragmatic Turn in Explainable Artificial Intelligence.Andrés Páez - 2019 - Minds and Machines 29 (3):441-459.

View all 44 citations / Add more citations

References found in this work

Counterfactuals.David K. Lewis - 1973 - Malden, Mass.: Blackwell.

Counterfactuals.David Lewis - 1973 - Foundations of Language 13 (1):145-151.

Models and Analogies in Science.Mary Hesse - 1965 - British Journal for the Philosophy of Science 16 (62):161-163.

How the machine ‘thinks’: Understanding opacity in machine learning algorithms.Jenna Burrell - 2016 - Big Data and Society 3 (1):205395171562251.

Models and Analogies in Science.Mary B. Hesse - 1966 - Philosophy and Rhetoric 3 (3):190-191.

View all 25 references / Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Explaining Explanations in AI

Abstract

Author's Profile

Categories

Keywords

Reprint years

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author's Profile

Citations of this work

References found in this work