Levels of Self-Improvement in AI  and their Implications for AI Safety

Alexey Turchin

Levels of Self-Improvement in AI and their Implications for AI Safety

Abstract

Abstract: This article presents a model of self-improving AI in which improvement could happen on several levels: hardware, learning, code and goals system, each of which has several sublevels. We demonstrate that despite diminishing returns at each level and some intrinsic difficulties of recursive self-improvement—like the intelligence-measuring problem, testing problem, parent-child problem and halting risks—even non-recursive self-improvement could produce a mild form of superintelligence by combining small optimizations on different levels and the power of learning. Based on this, we analyze how self-improvement could happen on different stages of the development of AI, including the stages at which AI is boxed or hiding in the internet.

View on PhilPapers

Author's Profile

Alexey Turchin

Archival history

First archival date: 2018-04-08
Latest version: 3 (2018-04-29)
View all versions

Keywords

recursive self-improvement artificial intelligence existential risks

Reprint years

Analytics

Added to PP
2018-04-08

Downloads
970 (#23,825)

6 months
179 (#22,957)

Historical graph of downloads since first upload

This graph includes both downloads from PhilArchive and clicks on external links on PhilPapers.

How can I increase my downloads?

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...