The Intriguing Relation Between Counterfactual Explanations and Adversarial Examples

Timo Freiesleben

Download from

epub.ub.uni-muenchen.de

More download options

The Intriguing Relation Between Counterfactual Explanations and Adversarial Examples

Timo Freiesleben

Minds and Machines 32 (1):1-33 (2021) Copy BIBT_EX

Abstract

The same method that creates adversarial examples to fool image-classifiers can be used to generate counterfactual explanations that explain algorithmic decisions. This observation has led researchers to consider CEs as AEs by another name. We argue that the relationship to the true label and the tolerance with respect to proximity are two properties that formally distinguish CEs and AEs. Based on these arguments, we introduce CEs, AEs, and related concepts mathematically in a common framework. Furthermore, we show connections between current methods for generating CEs and AEs, and estimate that the fields will merge more and more as the number of common use-cases grows.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Translate to english

Revision history

Edit

Author's Profile

Timo Freiesleben

Keywords

Munich Center for Mathematical Philosophy (MCMP) ddc:100

Reprint years

DOI

10.1007/s11023-021-09580-9

My notes

Analytics

Added to PP
2021-10-30

Downloads
33 (#470,805)

6 months
6 (#522,885)

Historical graph of downloads

How can I increase my downloads?

Author's Profile

Timo Freiesleben

Citations of this work

Reliability in Machine Learning.Thomas Grote, Konstantin Genin & Emily Sullivan - forthcoming - Philosophy Compass.

Fragility, robustness and antifragility in deep learning.Chandresh Pravin, Ivan Martino, Giuseppe Nicosia & Varun Ojha - 2024 - Artificial Intelligence 327 (C):104060.

Add more citations

References found in this work

Philosophical papers.David Kellogg Lewis - 1983 - New York: Oxford University Press.

A Theory of Conditionals.Robert Stalnaker - 1968 - In Nicholas Rescher (ed.), Studies in Logical Theory (American Philosophical Quarterly Monographs 2). Oxford: Blackwell. pp. 98-112.

Causality.Judea Pearl - 2000 - New York: Cambridge University Press.

Counterfactuals.David Lewis - 1973 - Foundations of Language 13 (1):145-151.

Counterfactual Dependence and Time’s Arrow.David Lewis - 1979 - Noûs 13 (4):455-476.

View all 15 references / Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

The Intriguing Relation Between Counterfactual Explanations and Adversarial Examples

Abstract

Author's Profile

Categories

Keywords

Reprint years

DOI

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author's Profile

Citations of this work

References found in this work