Switch to: References

Citations of:

Counterfactual state explanations for reinforcement learning agents via generative deep learning

Matthew L. Olson, Roli Khanna, Lawrence Neal, Fuxin Li & Weng-Keen Wong

Artificial Intelligence 295 (C):103455 (2021)

Add citations

You must login to add citations.

(1 other version)The Intriguing Relation Between Counterfactual Explanations and Adversarial Examples.Timo Freiesleben - 2021 - Minds and Machines 32 (1):77-109.details The same method that creates adversarial examples to fool image-classifiers can be used to generate counterfactual explanations that explain algorithmic decisions. This observation has led researchers to consider CEs as AEs by another name. We argue that the relationship to the true label and the tolerance with respect to proximity are two properties that formally distinguish CEs and AEs. Based on these arguments, we introduce CEs, AEs, and related concepts mathematically in a common framework. Furthermore, we show connections between current (...) Download Export citation Bookmark 3 citations
(1 other version)The Intriguing Relation Between Counterfactual Explanations and Adversarial Examples.Timo Freiesleben - 2021 - Minds and Machines 32 (1):1-33.details The same method that creates adversarial examples to fool image-classifiers can be used to generate counterfactual explanations that explain algorithmic decisions. This observation has led researchers to consider CEs as AEs by another name. We argue that the relationship to the true label and the tolerance with respect to proximity are two properties that formally distinguish CEs and AEs. Based on these arguments, we introduce CEs, AEs, and related concepts mathematically in a common framework. Furthermore, we show connections between current (...) Download Export citation Bookmark 4 citations
Mitigating social biases of pre-trained language models via contrastive self-debiasing with double data augmentation.Yingji Li, Mengnan Du, Rui Song, Xin Wang, Mingchen Sun & Ying Wang - 2024 - Artificial Intelligence 332 (C):104143.details Download Export citation Bookmark
“That's (not) the output I expected!” On the role of end user expectations in creating explanations of AI systems.Maria Riveiro & Serge Thill - 2021 - Artificial Intelligence 298:103507.details Download Export citation Bookmark 3 citations

1