Simulated Selfhood in LLMs: A Behavioral Analysis of Introspective Coherence

José Augusto de Lima Prestes

Simulated Selfhood in LLMs: A Behavioral Analysis of Introspective Coherence

Abstract

Large Language Models (LLMs) increasingly produce outputs that resemble introspection, including self-reference, epistemic modulation, and claims about internal states. This study investigates whether such behaviors display consistent patterns across repeated prompts or reflect surface-level generative artifacts. We evaluated five open-weight, stateless LLMs using a structured battery of 21 introspective prompts, each repeated ten times, yielding 1,050 completions. These outputs are analyzed across three behavioral dimensions: surface-level similarity (via token overlap), semantic coherence (via sentence embeddings), and inferential consistency (via natural language inference). The study introduces the concept of pseudo-consciousness to describe structured but non-experiential self-referential output. Based on Dennett’s intentional stance, our analysis avoids ontological claims and instead focuses on behavioral regularities. The findings have implications for interpretability, alignment, and user perception, highlighting the need for caution in attributing mental states to stateless generative systems based solely on linguistic fluency.

View on PhilPapers

Author's Profile

José Augusto de Lima Prestes

Archival history

Archival date: 2025-04-01
View all versions

Keywords

Large Language Models Pseudo-consciousness Introspective Simulation Behavioral Coherence Intentional Stance Self-reference AI Alignment Epistemic Modulation

Reprint years

Analytics

Added to PP
2025-04-01

Downloads
26 (#106,145)

6 months
26 (#104,045)

Historical graph of downloads since first upload

This graph includes both downloads from PhilArchive and clicks on external links on PhilPapers.

How can I increase my downloads?

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Simulated Selfhood in LLMs: A Behavioral Analysis of Introspective Coherence

Abstract

Author's Profile

Archival history

Categories

Keywords

Reprint years

Analytics