Artificial Intelligence: Arguments for Catastrophic Risk

Adam Bales; William D'Alessandro; Cameron Domenico Kirk-Giannini

Artificial Intelligence: Arguments for Catastrophic Risk

Adam Bales, William D'Alessandro & Cameron Domenico Kirk-Giannini

Philosophy Compass 19 (2):e12964 (2024) Copy BIBT_EX

Abstract

Recent progress in artificial intelligence (AI) has drawn attention to the technology’s transformative potential, including what some see as its prospects for causing large-scale harm. We review two influential arguments purporting to show how AI could pose catastrophic risks. The first argument — the Problem of Power-Seeking — claims that, under certain assumptions, advanced AI systems are likely to engage in dangerous power-seeking behavior in pursuit of their goals. We review reasons for thinking that AI systems might seek power, that they might obtain it, that this could lead to catastrophe, and that we might build and deploy such systems anyway. The second argument claims that the development of human-level AI will unlock rapid further progress, culminating in AI systems far more capable than any human — this is the Singularity Hypothesis. Power-seeking behavior on the part of such systems might be particularly dangerous. We discuss a variety of objections to both arguments and conclude by assessing the state of the debate.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

View on PhilPapers

Author Profiles

Adam Bales

University of Oxford

William D'Alessandro

University of Oxford

Cameron Domenico Kirk-Giannini

Rutgers University - Newark

Archival history

Archival date: 2024-01-24
View all versions

Keywords

AI Safety Singularity Existential Risk Catastrophic Risk Instrumental Convergence Orthogonality Thesis Power-Seeking Goal Misgeneralization Reward Misspecification

Reprint years

DOI

10.1111/phc3.12964

Analytics

Added to PP
2024-01-24

Downloads
610 (#26,378)

6 months
610 (#2,257)

Historical graph of downloads since first upload

This graph includes both downloads from PhilArchive and clicks on external links on PhilPapers.

How can I increase my downloads?

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Artificial Intelligence: Arguments for Catastrophic Risk

Abstract

Author Profiles

Archival history

Categories

Keywords

Reprint years

DOI

Analytics