Why Attention is Not Explanation: Surgical Intervention and Causal Reasoning about Neural Models

Christopher Grimsley; Elijah Mayfield; Julia Bursten

Why Attention is Not Explanation: Surgical Intervention and Causal Reasoning about Neural Models

Christopher Grimsley, Elijah Mayfield & Julia Bursten

Proceedings of the 12th Conference on Language Resources and Evaluation (2020) Copy BIBT_EX

Abstract

As the demand for explainable deep learning grows in the evaluation of language technologies, the value of a principled grounding for those explanations grows as well. Here we study the state-of-the-art in explanation for neural models for natural-language processing (NLP) tasks from the viewpoint of philosophy of science. We focus on recent evaluation work that finds brittleness in explanations obtained through attention mechanisms.We harness philosophical accounts of explanation to suggest broader conclusions from these studies. From this analysis, we assert the impossibility of causal explanations from attention layers over text data. We then introduce NLP researchers to contemporary philosophy of science theories that allow robust yet non-causal reasoning in explanation, giving computer scientists a vocabulary for future research

View on PhilPapers

Author's Profile

Julia Bursten

University of Kentucky

Archival history

Archival date: 2020-06-23
View all versions

Keywords

explanation explainability machine learning philosophy of science causation attention mechanisms

Reprint years

Analytics

Added to PP
2020-06-23

Downloads
943 (#26,656)

6 months
150 (#36,477)

Historical graph of downloads since first upload

This graph includes both downloads from PhilArchive and clicks on external links on PhilPapers.

How can I increase my downloads?

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Why Attention is Not Explanation: Surgical Intervention and Causal Reasoning about Neural Models

Abstract

Author's Profile

Archival history

Categories

Keywords

Reprint years

Analytics