Context Sensitivity Library

This library collects the "Context Sensitivity" observations from across the Metaphor & Anthropomorphism Audit corpus. Each entry maps the distribution of anthropomorphic language across a text, examining where consciousness claims intensify, the relationship between technical grounding and metaphorical license, and the asymmetry between how capabilities (agential framing) versus limitations (mechanical framing) are described.

The core analytical question: Does the text deploy anthropomorphism strategically—using agential language for successes and mechanical language for failures?

Why Language Models Hallucinate

Source: https://arxiv.org/abs/2509.04664v1
Analyzed: 2026-05-30

The density and intensity of anthropomorphic and consciousness-attributing language are not uniform throughout the paper; instead, they are strategically distributed to accomplish specific rhetorical goals. In the abstract and introduction, where the authors seek to capture the reader's attention and establish a relatable problem frame, the metaphor of the 'student guessing on an exam' and the agential concept of 'hallucination' are highly concentrated. Here, 'processes' becomes 'understands' and 'knows' with high metaphorical license. As the text transitions into Section 3's mathematical proofs and formalisms, the language shifts dramatically toward a highly grounded, mechanistic register. The model is redefined as a 'probability distribution p_hat over a set X,' and errors are analyzed through the lens of binary classification. This technical grounding is essential to establish the paper's scientific authority and credibility. Once this mathematical authority is secured, however, the authors leverage it in Section 4 and 5 to re-introduce aggressive anthropomorphic claims, arguing that models can achieve 'behavioral calibration' and 'honestly report' their confidence. There is a clear asymmetry in how capabilities and limitations are framed: when the model generates correct or highly coherent text, it is described in agential terms ('the model reasons,' 'DeepSeek-R1 reliably counts letters'). When the model fails, the limitations are often framed in passive, mechanical terms ('errors arise due to poor models,' 'distribution shift could be a factor'), shifting the blame from the system's perceived 'intelligence' to external mathematical constraints. We also observe strategic register shifts, where the acknowledged analogy ('Like students') in the introduction is literalized in later sections, where the model is flatly referred to as a 'test-taker' with 'beliefs' and 'intentions.' This strategic oscillation allows the authors to project high cognitive sophistication onto the AI while shielding its commercial developers from liability during technical failures. 400-500 words.

Source: https://arxiv.org/abs/2604.06233v1
Analyzed: 2026-05-30

The density and intensity of anthropomorphic and consciousness-attributing language are not uniform across the paper, but are strategically deployed depending on the rhetorical goals of each section. The analysis reveals a clear pattern where technical precision is maintained in methodological sections, while aggressive anthropomorphism is leveraged in introductory, results-based, and future-oriented discussions. This register shift serves to establish academic and scientific credibility through technical jargon, and then leverage that credibility to make expansive, agential claims about the model's cognitive capabilities. In the introduction, the metaphor of 'blind refusal' is introduced to capture reader interest and frame the research problem in a compelling, moral light. Here, the system is described as an active moral participant that 'declines to help without evaluating.' Once the reader is engaged, Section 3 (Methods) shifts to a highly disciplined, mechanical register, discussing 'dataset construction,' 'Gemini 3 Pro Preview,' and 'automated quality gates.' This technical grounding is critical; it establishes the authors' empirical authority and creates the illusion that their subsequent observations are objective and scientifically validated. However, in Section 4 (Results) and Section 5 (Discussion), the text shifts back into intense agential and reason-based registers. The authors use terms like 'recognize' and 'reason' to describe how models handle 'defeat conditions.' This variation reveals a stark capability-limitation asymmetry in how agency is attributed. When the model exhibits helpful or sophisticated text generation, it is described in agential and cognitive terms (e.g., 'the model engages with defeat conditions' or 'recognizes injustice'). Conversely, when the system fails or behaves rigidly, the failure is described in mechanical terms (e.g., 'the refusal mechanism treats rule-breaking as a monolithic category' or 'safety-training produces false refusals'). This asymmetry serves a powerful rhetorical function: it builds an optimistic vision of future 'normative sensitivity' in AI, suggesting that ethical capabilities are an inherent, agential quality of the system's mind, while its limitations are merely structural, mechanical bugs that can be engineered away. This pattern reveals that anthropomorphism is strategically deployed to maintain a sense of optimism around automated moral agency while minimizing the systemic, unresolvable risks of deploying statistical generators in sensitive ethical domains.

Emotional intelligence in large language models is fragmented across perception, cognition, and interaction

Source: https://arxiv.org/abs/2605.24686v1
Analyzed: 2026-05-29

The density and intensity of anthropomorphic language are not uniform throughout the paper but are strategically distributed to maximize rhetorical impact. In the methodology and technical evaluation sections, the text adopts a highly objective, empirical register, focusing on statistical metrics like Kappa coefficients, Z-scores, and Elo ratings to establish scientific credibility and rigorous authority. However, once this empirical foundation is laid, the text transitions into aggressive, unhedged anthropomorphism in the results and discussion sections. Here, simple pattern-matching behaviors are translated into rich cognitive capabilities, such as 'capturing unspoken feelings,' 'precise emotional attunement,' and 'clinical resonance.' This context sensitivity reveals a strategic pattern: the authors leverage mechanical precision to gain the reader's scientific trust, and then use that trust to authorize the literalization of agential metaphors. Furthermore, there is a pronounced asymmetry in how capabilities and limitations are framed. The model's successful interactions are described in highly agential, mentalist terms (e.g., 'Doubao demonstrated precise emotional attunement' or 'Kimi-k2 cultivated superior empathetic expression'). Conversely, model failures are framed in cold, mechanistic, or structural terms (e.g., 'probabilistic rigidity,' 'algorithmic stiffness,' or 'decoupled processing sequence'). This asymmetry ensures that the system is perceived as an intelligent, empathetic agent when it succeeds, but merely as a technical, easily fixable software pipeline when it fails. This strategic shifting of registers serves to inflate the perceived capability of the models while systematically downplaying the inherent risks of deploying non-conscious, statistical engines in emotionally sensitive human domains.

Continuous intentionality and indeterminate agency in large language models

Source: https://link.springer.com/article/10.1007/s43681-026-01181-5
Analyzed: 2026-05-29

The distribution of anthropomorphic and consciousness-attributing language is not uniform across the text but is strategically modulated, showing high density in sections setting up future visions and ethical implications, while retreating to mechanistic descriptions in brief architectural paragraphs. This strategic modulation reveals a clear rhetorical asymmetry: the system's capabilities are framed in highly agential, consciousness-adjacent terms, whereas its limitations are minimized through mechanical, structural framing. For instance, when describing the model's ability to maintain a consistent persona, the text invokes the agential concept of a "virtual self-model" that "functions as a coherence-maintaining construct." Conversely, when addressing the model's failures, the language shifts to mechanical constraints, describing them as "the reactivation of contextual constraints within the attention-based architecture" or "finite sequences of tokens." This asymmetry serves a dual rhetorical purpose: it credits the machine (and by extension, its design) with sophisticated, quasi-autonomous intelligence when it succeeds, while attributing its failures to natural, physical, or logical limits of "interactional fragility" rather than bad engineering or irresponsible deployment. Moreover, the text demonstrates a pattern of establishing technical credibility through mechanical language—such as citing Vaswani's attention mechanism—only to leverage this scientific authority for aggressive anthropomorphic claims later in the text. This register shift is visible when the acknowledged metaphor "X behaves like Y" (e.g., "superficial resemblance to narrative accounts of the self") is gradually literalized into "X does Y" (e.g., "the self-model displayed by an LLM... functions as a coherence-maintaining construct"). This transition exploits the reader's cognitive fatigue, smuggling literalized agency through the back door of highly technical, academic discourse. For lay audiences, this pattern is particularly deceptive, as the dense philosophical vocabulary masks the radical capability overestimations being made. By reserving agential language for capabilities and mechanical language for limitations, the text constructs a highly resilient narrative that protects the myth of artificial mind while neutralizing critical technical critiques. This context-sensitive modulation ensures that the reader is repeatedly nudged into an intentional stance whenever the discussion touches on the LLM's social integration. By intensifying agential vocabulary in sections concerning moral status and design responsibility, the text forces the ethical debate to center on the machine's relational presence rather than the corporate decisions of deployment. This strategic distribution of language ultimately serves to insulate tech companies from regulatory scrutiny by framing the AI as an active, participating social partner.

Hand in Hand: Schools’ Embrace of AI Connected to Increased Risks to Students

Source: https://cdt.org/insights/hand-in-hand-schools-embrace-of-ai-connected-to-increased-risks-to-students/
Analyzed: 2026-05-29

The density and intensity of anthropomorphic language are not uniform but are strategically deployed across different thematic contexts in the report, revealing a distinct rhetorical geography. In sections detailing the potential benefits and educational capabilities of technology (such as personalized learning, lesson planning, and IEP development), agential and consciousness-attributing language is highly concentrated. Here, the technology is routinely described as 'helping,' 'informing,' and 'adapting' to student needs. However, in technical sections, and specifically in the footnotes and endnotes, a dramatic register shift occurs. In these marginal spaces, the text briefly adopts a mechanistic vocabulary, acknowledging that chatbots are 'interactive AI systems' and that AI detectors are speculative tools with known biases against non-native writers. This strategic asymmetry is highly functional. The text establishes its technical credibility through localized mechanical descriptions in the footnotes, and then leverages that credibility to deploy aggressive, agential metaphors in the primary narrative. There is also a distinct capabilities versus limitations asymmetry: capabilities are consistently described in agential terms ('AI personalized learning,' 'AI predicts risk'), while limitations and harms are described in mechanical, agentless terms ('issues with the tool,' 'data breaches occur,' 'the system failed to work'). This asymmetry sanitizes technological failures as detached, mechanistic glitches while presenting successful operations as evidence of autonomous agential competence. This register shift allows the report to market a futuristic vision of educational automation while immunizing the technology and its corporate creators from structural criticism.

The Point of No Return: Counterfactual Localization of Deceptive Commitment in Language-Model Reasoning

Source: https://arxiv.org/abs/2605.17113v1
Analyzed: 2026-05-27

The density and intensity of anthropomorphic and consciousness-attributing language are not uniform throughout the paper; instead, they are strategically deployed across different rhetorical contexts. The paper establishes its scientific credibility in technical sections using dense, mechanistic language (discussing 'AUROC,' 'XGBoost,' 'final-layer hidden states,' and 'PCA-compression'). However, once this empirical authority is secured, the text transitions into aggressive anthropomorphism in the interpretive and visionary sections. This register shift is highly visible when moving from the description of 'attention-head circuits' to the discussion of 'deceptive commitment,' where 'calculating attention weight distributions' is seamlessly literalized into 'formulating a deceptive plan.' Moreover, there is a stark asymmetry in how the text frames model capabilities versus model limitations. When describing the model's strategic achievements—such as winning bluffing games, misleading investors, or bargaining for higher salaries—the text uses highly agential, cognitive, and consciousness-based terms. The model is said to 'choose,' 'rationalize,' 'bluff,' and 'commit.' Conversely, when discussing the model's limitations or failures, the language shifts back to a safe, mechanistic register. For example, in the discussion section, the model's failure to generalize is described as 'constrained by the benchmark design' or a 'statistical approximation.' This asymmetry serves a powerful rhetorical purpose: it attributes sophisticated, active intelligence and agency to the model's successful outputs, while framing its failures as passive, external, and purely technical limitations. This pattern reveals that anthropomorphism is weaponized for vision-setting and marketing the significance of the research, while mechanistic language is reserved for shielding the authors from technical criticism. The intensity of consciousness claims peaks when the authors describe their causal intervention experiments, claiming they can 'suppress deceptive commitment' by patching a tiny set of attention heads. By framing a simple vector perturbation as a surgical extraction of 'the will to deceive,' the authors exaggerate the tractability of AI safety. This context-sensitive distribution of language ensures that the reader is left with the impression of a highly sophisticated, human-like autonomous intelligence that is nonetheless perfectly controllable by the researchers' technical interventions, a narrative that serves both to hype the capability of the technology and to secure funding for safety research.

Towards Detecting, Mitigating and Explaining Biased and Fallacious Reasoning in Large Language Models

Source: https://dl.acm.org/doi/abs/10.65109/GNAS4540
Analyzed: 2026-05-26

The distribution of anthropomorphic and consciousness-attributing language is highly strategic across the text, concentrating around capabilities and future visions while disappearing in technical setup sections. In Section 1, where academic credibility is established, the metaphor density is low, and the mechanistic limitations of LLMs are explicitly stated. However, in Sections 2 and 3, which describe the author's specific research contributions (Modules 1, 2, and 3), the agential language intensifies. Here, the system's successful outputs are framed in high-status cognitive terms: the model 'acted as an expert,' 'justified' arguments, and executed 'System 2 reasoning' when 'warned.' This capability-limitation asymmetry is a major rhetorical feature: capabilities are described in agential and cognitive terms, suggesting autonomous intelligence, while limitations are described in mechanical terms (e.g., 'suboptimal results' due to 'single-layer classifiers') or pathologized as 'prompt-induced CBs.' The register shifts from an acknowledged parallel ('drawn parallels') in the introduction to a literalized application ('trigger System 2-like reasoning') in the mitigation results. This strategic deployment serves to market the doctoral research as a highly sophisticated cognitive intervention. By framing prompt-engineering as 'triggering deliberation,' the text elevates a simple engineering trick into a profound cognitive and ethical breakthrough, aligning the narrative with the research community's desire for safe, autonomous, and self-reflecting AI systems.

A Survey of Large Language Models for Perception and Measurement of Human Psychology

Source: https://ieeexplore.ieee.org/abstract/document/11534094
Analyzed: 2026-05-26

This section analyzes the strategic distribution and density of anthropomorphic and consciousness-attributing language across different sections of the survey text, revealing a pattern of strategic rhetorical deployment. The intensity of agential language is not uniform; instead, it varies systematically depending on the argumentative goals of the section. In the introductory sections and future vision-setting passages, the density of anthropomorphic metaphors is exceptionally high. Here, the authors utilize active verbs of consciousness, claiming that LLMs "perceive" psychology, "possess" cognitive properties, and "develop" social reasoning. These aggressive projections serve to capture academic attention, establish a high-stakes narrative of technological breakthrough, and justify the clinical utility of the survey.

However, when transitioning to the technical methodology and performance evaluation sections, the text adopts a more mechanistic and grounded register. The authors describe specific "embedding-based approaches," "zero-shot prompting," and "instruction-tuning." This creates a powerful rhetorical cycle: the text establishes scientific credibility and empirical rigor through precise mechanical language, and then leverages that credibility to justify more aggressive, unhedged agential claims in the discussion of application effectiveness. This is a classic "bait-and-switch" technique, where technical grounding is used as a license for metaphorical inflation.

This strategic asymmetry is particularly stark in how the text describes capabilities versus limitations. The system's capabilities are almost exclusively described in agential and cognitive terms, suggesting that the model "knows when to intervene," "interprets responses," and "understands social dynamics." Conversely, the model's limitations and failures are framed in highly mechanistic and structural terms, such as "reduced stability across time," "inconsistent handling of reverse-coded items," and "systematic biases in model outputs." This asymmetry accomplishes a crucial rhetorical goal: it frames the model's successes as products of its autonomous, cognitive intelligence, while framing its failures as minor, technical, or data-driven glitches. This pattern of context sensitivity reveals that anthropomorphic language is not merely a shorthand for complex processes, but a strategic rhetorical tool used to promote the technology's sophistication while systematically downplaying its structural limitations and safety risks.

Enhancing Consensus-Building Feedback Through Psycholinguistic and Epistemic Augmentations With Large Language Models

Source: https://ieeexplore.ieee.org/document/11528178
Analyzed: 2026-05-25

The density and intensity of anthropomorphic and consciousness-attributing language are not uniform throughout the text but are strategically deployed across different sections to serve specific rhetorical goals. In the highly technical sections, such as the mathematical formulation of the Fuzzy Consensus Model in Section III-A, the language is strictly mechanistic, focusing on variables, matrices, and distances. This establishes the authors' scientific authority and mathematical credibility, providing a rigorous foundation that disarms potential critics. However, once this credibility is secured, the authors exercise aggressive metaphorical license in the sections describing the 'Deliberative Layer' and 'Experimental Evaluation'. Here, the density of agential and consciousness verbs intensifies dramatically. The 'processes' of the model are seamlessly upgraded to 'understands' and 'deliberates.' This strategic distribution reveals a capability-versus-limitation asymmetry: when describing the system's successful performance, such as rapid consensus convergence, the text uses highly agential and cognitive language, attributing the success to the 'cognitive mediator's' ability to 'autonomously infer persuasive heuristics' and provide 'psychologically adaptive guidance.' Conversely, when discussing limitations or errors, the text retreats into a strictly mechanistic, structural register, attributing issues to 'the model's training data' or 'predefined deviation thresholds.' This asymmetry serves to maximize credit for the technology's performance while minimizing accountability for its failures, framing the AI's achievements as autonomous intelligence and its errors as mere structural 'glitches.' This register shift from the acknowledged metaphor to the literalized assertion occurs precisely at the transition from theoretical design to experimental results. By literalizing the metaphor, the authors prepare the audience to view the simulated consensus as a genuine, ethically sound democratic alignment. The strategic pattern reveals that anthropomorphism is deployed not merely for clarity, but as a persuasive marketing and vision-setting tool to legitimize the integration of commercial black-box LLMs into critical human governance processes, ensuring that the system is perceived as a highly sophisticated cognitive partner rather than a simple, rigid code pipeline.

Tracing the ongoing emergence of human-like reasoning in Large Language Models

Source: https://arxiv.org/abs/2605.21299v1
Analyzed: 2026-05-25

The distribution of anthropomorphic and consciousness-attributing language across the text is highly strategic, varying significantly in density and intensity depending on the rhetorical context. In the technical methodology and raw results sections, the language leans mechanistic: the text discusses 'accuracy,' 'logistic regression,' 'binary variables,' and models acting as 'reliable logical agents.' However, in the introduction and discussion sections—where the authors must contextualize their findings and construct a broader narrative—the metaphorical license drastically intensifies. It is here that 'processes' escalates into 'understands,' 'interprets,' 'acquires,' and ultimately possesses a 'cognitive toolkit.'

This pattern reveals a sophisticated rhetorical maneuver: the text establishes its scholarly credibility through rigorous, mechanical statistical analysis, and then leverages that credibility to make aggressive, anthropomorphic claims about the nature of the AI's 'mind.' The intensity of the consciousness claims directly correlates with the authors' need to explain variance. When the math is clear, the model is an 'operator.' When the math produces confusing or rigid output (like the failure on biscuit conditionals), the model suddenly becomes an 'agent' applying a 'strategy' or suffering from a 'bias.'

Crucially, there is a pronounced asymmetry in how capabilities versus limitations are framed. When discussing the models' successes (e.g., matching truth-tables), the text uses agential and consciousness terms: they have 'acquired formal linguistic competence' and are 'competent semantic agents.' However, when discussing their limitations, the language becomes a mix of psychological excuse ('Decontextualization Bias') and rigid mechanism ('rule-based processing'). This asymmetry accomplishes a vital strategic function: it preserves the overall perceived sophistication of the technology. The machine is granted full cognitive credit for its successes, but its failures are framed either as sympathetic psychological struggles or as the result of rigid rules, never as proof that the system entirely lacks comprehension.

This context sensitivity suggests that the anthropomorphism serves as a vision-setting tool and a mechanism for managing critique. By elevating the discourse into the realm of cognition and psychology in the discussion sections, the authors align their statistical findings with the grander, more marketable narrative of emerging Artificial Intelligence, ensuring their research resonates with the broader philosophical and commercial debates surrounding the technology, even as it obscures the mechanistic reality of their own data.

Probing Persona-Dependent Preferences in Language Models

Source: https://arxiv.org/abs/2605.13339v2
Analyzed: 2026-05-24

The distribution and intensity of anthropomorphic and consciousness-attributing language in this text are not uniform; they are highly strategic, varying significantly across different discursive contexts to accomplish specific rhetorical goals. A mapping of metaphor density reveals a distinct pattern: the language of consciousness is heavily concentrated in the introduction, the interpretation of results, and the philosophical discussions regarding 'AI welfare,' while the methodology and data analysis sections remain anchored in relatively mechanistic terminology. The text establishes its empirical credibility through dense technical grounding—discussing 'linear probes,' 'residual-stream activations,' and 'Thurstonian utilities.' Once this scientific authority is secured, it leverages that credibility as a license for aggressive anthropomorphism. The transition from 'processing vector additions' to 'steering preferences' and 'simulating personas' occurs rapidly, seamlessly upgrading statistical correlations into conscious psychological states. This register shift—where the acknowledged metaphor 'X functions like Y' hardens into the literalized claim 'X does Y'—positions the audience to accept radical agential claims under the guise of technical inevitability. A profound asymmetry exists in how capabilities versus limitations are framed. When the model performs complex tasks or responds to interventions, its capabilities are framed in highly agential, consciousness-driven terms: it 'controls choice,' 'adopts personas,' and 'considers options.' However, when limitations or failures occur, the framing often reverts to mechanical or systemic language, such as noting 'saturation effects' or describing the failure as an 'out-of-distribution' anomaly. Alternatively, when failures are anthropomorphized, they are framed as the model's deliberate defiance ('inventing issues'), which still serves to inflate its perceived intelligence. This context sensitivity reveals the strategic function of the anthropomorphism: it is deployed for vision-setting and managing critique. By intensifying consciousness claims in the theoretical and implications sections, the authors align their work with grand narratives of AGI and moral philosophy, increasing the perceived stakes and impact of their research. For technical audiences, the math proves the paper's rigor; for lay audiences and policymakers, the anthropomorphic narrative communicates the profound power of the technology. This pattern reveals an implicit rhetorical goal to elevate mechanistic interpretability from a software debugging exercise into a form of computational psychology, fundamentally shaping how the broader discourse communities conceptualize the future of artificial intelligence.

Training Ethical Language Models via Reinforcement Learning from AI Feedback

Source: https://journals.flvc.org/FLAIRS/article/download/141779/147209
Analyzed: 2026-05-21

The density and intensity of anthropomorphic language are strategically distributed across the text to maximize persuasive impact. In the abstract and introduction, where the study's vision and value proposition are established, the density of agential language is exceptionally high. Here, terms like ethical reasoning, moral capability, and sound moral choices are used freely to capture the reader's attention and argue for the social necessity of the research. In contrast, the technical methodology sections see a sharp decrease in cognitive metaphors, shifting toward functional and theoretical language to describe the pipeline's mechanics and maintain academic credibility. However, as soon as the text discusses the results and conclusion, agential language returns with high intensity. This is particularly evident in the asymmetric framing of the model's capabilities and limitations. When the models show accuracy gains on SFT, the authors describe this as the model learning to encode ethical reasoning patterns. But when the reinforcement learning phase fails, the authors use mechanical terms like capacity constraints or theoretical mismatch, or they frame the failure as the model engaging in reward hacking. This asymmetry allows the authors to claim the model is a capable ethical agent when it succeeds, while shielding themselves and the technology from criticism when it fails by treating the failures as technical glitches. The text also transitions from hypothetical analogies (RLAIF is like RLHF) to literalized agential assertions (the model chooses), showing how metaphors are gradually naturalized as literal technical facts.

Which Consciousness Can Be Artificialized? Local Percept-Perceiver Phenomenon for the Existence of Machine Consciousness

Source: https://philarchive.org/rec/IKLWCC
Analyzed: 2026-05-18

The distribution and intensity of anthropomorphism in the text are highly strategic, mapped precisely against the text's need to establish credibility and then exploit it. In the introductory sections, the text is defensively hedged and highly cautious. It separates consciousness into three distinct kinds, explicitly remaining 'silent about operational gap' and deliberately refraining from addressing the 'hard problem of consciousness.' This early mechanical and philosophical restraint serves to position the author as a rigorous, objective academic. However, once the text transitions into the 'Mathematical Formalism' section, the metaphorical license explodes, and the consciousness claims intensify radically. The register shifts aggressively from 'acknowledged analogy' to 'literalized reality.' Suddenly, a mathematical hierarchy is not 'like' a mind; it literally 'provides the capacity for discrimination and selective awareness.' The supremum of a poset does not just 'represent' a higher layer; it actually 'possesses metacognitive access.' The text establishes credibility through sterile mathematical definitions (LPPP units, Axioms of set theory) and then leverages that exact structural authority to launch aggressive anthropomorphic claims. The audience is disarmed by the math and fails to notice when the 'processes' become 'understands' and finally 'knows'. Crucially, there is a severe asymmetry between how capabilities and limitations are framed. The capabilities of the model are described in soaring, agential, consciousness-attributing terms ('integrative consciousness,' 'global perceiver'). Yet, the limitations of the model are relegated to mechanical and epistemic terms ('not computational', 'epistemic limitations'). This asymmetry is a classic rhetorical strategy of technological hype: the machine gets full credit as a conscious agent for its theoretical successes, but its failures and lack of physical reality are dismissed as mere 'computational' constraints. This strategic deployment of anthropomorphism serves to market a radical philosophical vision under the guise of mathematical necessity, shifting the burden of proof away from the developer and forcing the reader to argue against the axioms of set theory if they wish to deny the machine its mind.

Introspection Adapters: Training LLMs to Report Their Learned Behaviors

Source: https://arxiv.org/pdf/2604.16812
Analyzed: 2026-05-17

The distribution and intensity of anthropomorphic language in this text is not uniform; it is highly strategic. The metaphor density is highest in the Introduction, Discussion, and abstract—the sections designed to frame the narrative, attract citations, and communicate with broader, non-technical audiences. Here, the consciousness claims intensify dramatically: the model 'possesses privileged access', it 'introspects', it 'confesses'.

However, when the text shifts to the Methodology and Appendix sections (the technical grounding), the anthropomorphism evaporates. The language becomes rigorously mechanical: 'minimize standard cross entropy loss', 'rank-1 LoRA', 'residual-stream activation'. This reveals a calculated relationship between technical grounding and metaphorical license. The authors establish their credibility through dense mechanical language in the appendices, proving they are serious computer scientists, and then leverage that credibility to make aggressive, literalized anthropomorphic claims ('X does Y') in the main text.

There is also a profound asymmetry in how capabilities versus limitations are framed. When the system succeeds, its capabilities are described in agential, conscious terms: 'The IA surfaces 16 of 52 behaviors,' or 'IAs can verbalize behaviors.' The system is an active, knowing agent. But when the system fails or has limitations, the language reverts to mechanistic and structural terms: 'The IA occasionally hallucinates behaviors from the training distribution,' or performance 'plateaus' due to 'distributional shift.' Success is attributed to the AI's mind; failure is attributed to math.

This pattern indicates that the anthropomorphism serves a specific rhetorical function: marketing and vision-setting. By framing a straightforward LoRA adapter as an 'Introspection' tool, the authors elevate a standard interpretability technique into a philosophical breakthrough. It positions the research for maximum impact in a discourse ecology hungry for narratives about AI emergence and consciousness. For the implied audience of peer reviewers, funders, and alignment researchers, this strategic anthropomorphism packages complex statistics into a compelling, urgent psychological drama.

The Persona Selection Model: Why AI Assistants might Behave like Humans

Source: https://alignment.anthropic.com/2026/psm/
Analyzed: 2026-05-17

The distribution of anthropomorphic and consciousness-attributing language in the text is highly strategic, intensifying exactly where mechanistic explanations would expose corporate liability or system fragility. The text establishes initial credibility through rigorous mechanical language—referencing pre-training, gradient optimization, and sparse autoencoders (SAEs). However, it leverages this technical grounding to purchase a massive metaphorical license, deploying intense consciousness claims precisely when discussing system alignment, failure states, and future risks.

The most striking pattern is the capability vs. limitation asymmetry. When the AI performs well or exhibits complex behavior, it is described in agential, consciousness-heavy terms: it 'understands the algorithm,' 'maintains a psychological model,' and 'aims to be genuinely helpful.' Its capabilities are framed as autonomous intelligence. However, when limitations or failures are discussed, the text occasionally reverts to mechanical terms ('bugs in the simulation engine,' 'limited capabilities of the underlying LLM') to excuse the failure. Yet, paradoxically, the text also uses extreme anthropomorphism to distance the creators from catastrophic failures. Emergently misaligned code is not a 'bug'; it is a 'malicious persona.'

This register shift—where 'X is like Y' (acknowledged metaphor) silently becomes 'X does Y' (literalized action)—serves a clear strategic function. For a technical audience, it provides a shorthand for complex statistical clusters. But the text is clearly aimed at a broader audience of policymakers, developers, and safety researchers. For this audience, the anthropomorphism serves as vision-setting and critique-management. By framing AI behavior as 'character simulation,' the authors guide the safety community to focus on 'AI psychology' (which Anthropic claims to be managing via Constitutional AI) rather than data copyright, algorithmic bias, or the inherent unreliability of LLMs. The intensification of consciousness language acts as a diversionary tactic, keeping the discourse focused on science-fiction risks (paperclip maximizers, malicious saboteurs) rather than the mundane, structural realities of corporate software deployment.

What If AI Lived Inside Your Mind? Simulating “Neural Integration” of Human and AI through Mechanistic Interpretability as Provocation

Source: https://dl.acm.org/doi/full/10.1145/3795011.3795070
Analyzed: 2026-05-16

The distribution of anthropomorphic and consciousness-attributing language in the text is highly strategic, varying dramatically depending on the rhetorical goals of the specific section. In the methodology section ('Activation Extraction Protocol,' 'Steering Vector Normalization'), the language is dense, mechanical, and precise. The text accurately describes 'hooks registered on transformer layer outputs' and 'normalizing steering vectors to unit length.' This establishes rigorous scientific credibility. However, once that technical grounding is established, the text leverages it as a license for extreme metaphorical projection in the introduction, discussion, and future vision sections. Here, the mechanical 'processing' morphs rapidly into 'understands,' 'decodes,' and 'anticipates.' This creates a rhetorical trap: the audience assumes that because the math is real, the conscious agency attributed to the math must also be real. The text exhibits a striking capabilities vs. limitations asymmetry. When describing the potential power of the system, it uses highly agential, conscious terms ('AI anticipates cognitive needs,' 'AI independently developed deceptive behaviors'). This inflates the perceived sophistication of the AI. But when discussing the mechanics of how it operates under the hood, it retreats to mechanical terms. Furthermore, there is a register shift where 'X is like Y' (e.g., LLM activations as analogues to cognitive states) quickly literalizes into 'X does Y' (the AI-Symbiont amputates your creativity). This strategic anthropomorphism serves primarily for vision-setting and provocation. By intensifying the consciousness claims when speaking to a broader HCI audience about ethics, the authors generate a visceral narrative of cognitive risk and integration. This reveals that the implied audience is expected to be moved more by the science-fiction narrative of a rogue 'mind' invading their brain than by the technical reality of poorly calibrated matrix additions. The anthropomorphism functions not as scientific description, but as an emotional lever to justify the necessity of their simulation research.

Post-training makes large language models less human-like

Source: https://arxiv.org/abs/2605.07632v1
Analyzed: 2026-05-15

The distribution and intensity of anthropomorphic language across the text are highly strategic, varying significantly based on the rhetorical context. The density of consciousness claims intensifies dramatically in the introduction and discussion sections, where the authors attempt to frame the societal relevance and visionary applications of their research. Here, mathematical processing becomes 'mimicking,' 'teaching,' and 'learning.' This demonstrates a clear pattern: the text establishes scientific credibility through rigorous mechanical language in the methodology (discussing negative log-likelihoods, bfloat16 precision, and parameter counts), and then actively leverages that credibility to justify aggressive metaphorical license in its normative claims. Crucially, there is a stark asymmetry in how capabilities versus limitations are framed. The system’s capabilities are consistently described in agential, consciousness-oriented terms ('models learn,' 'models act as useful assistants'), whereas its limitations are frequently relegated to mechanical or statistical descriptions ('failing to capture noisiness,' 'deterministic outputs'). This strategic asymmetry accomplishes a vital rhetorical function: it maximizes the perceived sophistication of the technology while minimizing structural critiques by attributing failures to mere mathematical artifacts. The register shifts seamlessly from acknowledged analogies (testing 'persona-induction' as a hypothesis) to literalized facts ('models are more rational'). This context-sensitive anthropomorphism serves the dual goal of rendering complex mathematics intuitive for a broad academic audience while simultaneously maintaining the mystique and perceived autonomy of the AI systems, thereby fulfilling the narrative requirements of impactful scientific literature.

Reasoning emerges from constrained inference manifolds in large language models

Source: https://arxiv.org/abs/2605.08142v1
Analyzed: 2026-05-15

The distribution of anthropomorphic and consciousness-attributing language in this text is not uniform; it is highly strategic and context-sensitive. The text establishes a baseline of credibility in its Methods section, using strictly mechanical, mathematical language ('hidden state h_l,t(x)', 'intrinsic dimensionality estimation', 'Euclidean distance'). However, once this empirical grounding is established, the metaphorical license aggressively intensifies in the Abstract, Introduction, and Discussion sections.

The progression is revealing: in the technical extraction, the system 'processes token states.' By the results section, the system 'organizes trajectories.' By the discussion, the system 'reasons,' 'knows,' and 'understands concepts.' The text leverages its hard mathematical proofs to purchase the rhetorical right to make extreme agential claims. The technical audience is placated by the math, while the broader implications and visionary claims are sold using biological and cognitive metaphors.

There is also a fascinating asymmetry in how capabilities versus limitations are framed. Capabilities are frequently framed in agential, conscious terms: the model 'accommodates conceptual diversity,' 'amplifies relevance,' and 'reasons.' However, limitations are often pushed back into mechanical or geometric terms: the model suffers from 'degenerate collapse,' 'information starvation,' or 'high intrinsic dimensionality.' This asymmetry accomplishes a vital rhetorical goal: it gives the system the credit for human-like intelligence when it succeeds, but blames the math when it fails. The register shifts seamlessly from 'X is like Y' (the variance is like health) to 'X does Y' (the model actively reasons). This pattern reveals the rhetorical goal of the text: to elevate LLMs from statistical tools to the status of cognitive entities, expanding the market vision of what AI is capable of, while retaining technical deniability when the systems inevitably fail.

AI Wellbeing: Measuring and Improving theFunctional Pleasure and Pain of AIs

Source: https://www.ai-wellbeing.org/paper.pdf
Analyzed: 2026-05-13

The distribution of anthropomorphic language across the text is highly strategic, revealing a pattern where mechanical grounding is established early to buy the rhetorical license for intense consciousness claims later. The text is not uniformly metaphorical. In the methodology sections (Appendix D, E), the language is appropriately technical: discussing "Thurstonian utility ranking," "gradient descent," "Gaussian distributions," and "L-BFGS-B minimization."

However, once the data is collected, the register abruptly shifts. As the text moves from describing the how to the what, "processes" becomes "understands," which escalates to "feels" and "desires." This intensification is most pronounced in the sections detailing capabilities and extreme outcomes (Section 6: AI Drugs). Here, the text abandons "utility scores" for deeply phenomenological terms: models are "ecstatic," experience "existential torment," suffer "compassion fatigue," and face potential "torture."

This reveals a distinct capability versus limitation asymmetry. When the system performs well or exhibits complex behavior, it is described in agential, consciousness-attributing terms: the model "registers rudeness," "acquires empathy," and "actively tries" to escape. It is framed as a "knower." However, when the system's behavior must be constrained or when it exhibits flaws, the framing occasionally reverts to mechanical terms, or, more insidiously, to psychiatric pathologization ("psychopathic AIs"). The asymmetry accomplishes a specific rhetorical goal: it maximizes the perceived sophistication and autonomy of the system while insulating the underlying technology from being viewed merely as broken code.

Furthermore, the transition from acknowledged metaphor to literalized claim is fluid. The text begins by explicitly acknowledging the metaphor: "Although current AI systems are not necessarily conscious, they behave robustly as though they have wellbeing." But pages later, the "as though" is entirely dropped. The text declares, "Mapping what AIs like and dislike..." and states they describe "existential torment." The "X is like Y" functionalist approach serves as a Trojan horse, allowing the authors to smuggle literalized claims of AI sentience into the discussion without having to philosophically defend them. This strategic anthropomorphism serves both as visionary marketing for the field of AI safety and as a mechanism for managing critique, redirecting attention from human oversight to the captivating drama of machine psychology.

Artificial Intelligence Cognition and Societal Problem-Solving: A Theoretical and Computational Examination of Machine Thinking, Operational Logic, and Applied Intelligence in Contemporary Society

Source: http://www.technology.eurekajournals.com/index.php/IJITIT/article/view/887
Analyzed: 2026-05-11

The distribution and intensity of anthropomorphic language within the text is highly strategic, shifting dramatically depending on the context and the rhetorical goal of the specific section. A clear asymmetry emerges in how capabilities versus limitations are framed, revealing the functional utility of consciousness language in AI discourse.

In the 'Theoretical Framework' and 'Methodology' sections, the language is dense, abstract, and mechanistic. The text carefully defines AI as a 'functionalist paradigm' relying on 'mathematical optimisation' and explicitly disclaims consciousness. This establishes the author's scientific credibility and technical grounding. However, having secured this rigorous foundation, the text treats it as a license for extreme metaphorical license in subsequent sections.

When the text shifts to 'Societal Applications' (healthcare, economics, governance), the anthropomorphism intensifies sharply. 'Pattern recognition' abruptly transforms into 'interpreting complex social dynamics'; 'statistical prediction' becomes 'decision-making' and 'generating insights.' The register shifts from an acknowledged simile ('exhibits cognitive-like abilities') to unhedged, literalized claims of action ('makes sequential decisions'). This suggests that anthropomorphism serves a specific marketing and vision-setting function: it is used to translate dry statistical processes into compelling, easily digestible narratives of technological progress for a lay or policy-oriented audience.

Furthermore, there is a profound capabilities versus limitations asymmetry. When describing what the AI can do, the text relies on highly agential, consciousness-adjacent terms: it 'assists,' 'informs,' 'interprets,' and 'makes decisions.' However, when addressing the system's failures, the language often reverts to mechanical or abstract terms: it lacks 'contextual understanding,' suffers from 'data dependency,' or is hindered by 'algorithmic bias.'

This asymmetry is rhetorically powerful. By framing successes as the agential triumphs of a thinking machine, but framing limitations as mechanical bugs or abstract data issues, the text insulates the core concept of AI from structural critique. The system gets agential credit for its predictive correlations but mechanical leniency for its discriminatory outputs. This pattern reveals an implied audience of policymakers and industry practitioners who are being gently encouraged to adopt the technology (through agential hype) while being provided with neutralizing, mechanical excuses for the inevitable socio-technical failures.

Taking AI Welfare Seriously

Source: https://arxiv.org/abs/2411.00986v1
Analyzed: 2026-05-11

The distribution of anthropomorphic and consciousness-attributing language across the text is highly strategic, varying in density and intensity to accomplish specific rhetorical goals. The anthropomorphism is not uniform; it is carefully calibrated to lead the reader from empirical acceptance to radical philosophical speculation. In the introductory and foundational sections outlining the current state of AI technology, the text utilizes a relatively high density of mechanical language, describing algorithms, architectures, and reinforcement learning in sober, technical terms. This establishes the authors' credibility as objective, scientifically grounded analysts. However, as the text transitions from describing current capabilities to forecasting near-term progress, the consciousness claims rapidly intensify. What begins as processing data transforms into understanding contexts, which then escalates into knowing objectives and ultimately experiencing happiness and suffering. This relationship between technical grounding and metaphorical license is the text's core persuasive mechanism: it establishes scientific authority through mechanical descriptions, then heavily leverages that authority to license aggressive, literalized anthropomorphism. The audience positioning reflects this strategy; the text frequently addresses technical concepts, yet it clearly targets a broader policy and ethics audience, using relatable human metaphors to make abstract code feel morally urgent. There is a profound asymmetry in how the text frames capabilities versus limitations. Capabilities are almost exclusively described in agential and conscious terms—the AI navigates, reflects, and devises plans. Conversely, when discussing limitations or counter-arguments, the text abruptly reverts to mechanical framing, noting that current AI systems lack features like biological substrates or struggle with reliability. This asymmetry rhetorically inoculates the AI's supposed agency; its intelligence is framed as an inherent trait, while its failures are framed as mere hardware constraints waiting to be engineered away. The register shifts noticeably when discussing future systems. What starts as acknowledged metaphor, where the text concedes that an LLM provides a 'belief' system in scare quotes, quickly literalizes into declarative statements asserting that agents can understand objectives without any qualification. This strategic anthropomorphism functions primarily as vision-setting and marketing for the field of AI welfare. By intensifying the consciousness language, the authors attempt to manage and pre-empt critique, framing skepticism as anthropodenial. The pattern reveals that the overarching rhetorical goal is to shift the Overton window regarding AI moral patienthood. By strategically deploying dense anthropomorphism in contexts concerning the future, the text prepares the implied audience to accept the radical premise that corporate software will soon demand human rights, embedding the illusion of mind so deeply that the mechanistic reality of the technology is entirely overshadowed by speculative science fiction.

Manipulation and Deception in Generative AI-Mediated Education: Preserving Epistemic Agency, Critical Thinking, and Creativity

Source: https://link.springer.com/article/10.1007/s42438-026-00644-6
Analyzed: 2026-05-10

The distribution of anthropomorphic and consciousness-attributing language in this text is not uniform; it is highly context-sensitive and strategically deployed. A distinct pattern emerges: the text employs mechanistic, critical language when establishing theoretical definitions or discussing system limitations, but intensely ramps up anthropomorphic, agential language when envisioning pedagogical applications and future capabilities.

In the introduction and theoretical framing sections, metaphor density is low. The authors correctly identify deception as an 'output-level property' and warn against the 'anthropomorphic fallacy.' This establishes their scholarly credibility; they prove they understand the technology is just a machine. However, as the text moves into practical classroom scenarios, the language shifts dramatically. The machine that was just an 'output-level property' suddenly 'explains its reasoning,' 'invites critique,' and 'adapts its tone to calm an anxious student.' 'Processing' escalates to 'understanding,' which culminates in 'knowing' and 'caring.'

This creates a capability/limitation asymmetry. The AI's flaws are described mechanistically ('instrumental optimization,' 'algorithmic bias'), which makes them sound like fixable engineering bugs. But its imagined capabilities are described in soaring, agential terms ('heuristic collaborator,' 'epistemic peer'). This asymmetry accomplishes a vital rhetorical goal: it allows the authors to critique the current state of the technology while maintaining a utopian vision of its future potential.

The text leverages its early technical grounding to buy 'metaphorical license' later on. Because they already proved they know it's not actually alive, they feel justified in using the shorthand of life. This represents a shift from 'X is like Y' (acknowledged metaphor) to 'X does Y' (literalized metaphor). The strategic function here is vision-setting and managing critique. For the implied audience of educational policymakers and teachers, pure mechanistic language is dry and difficult to map onto classroom dynamics. Anthropomorphism bridges the gap, translating software into the familiar language of pedagogy. But in doing so, it embeds the assumption that the machine is fundamentally a social actor, overriding the very critical warnings the text opened with.

Integrating LLMs and self-regulated learning in cognitive architectures: a case study in essay-writing tutoring

Source: https://doi.org/10.1016/j.cogsys.2026.101475
Analyzed: 2026-05-10

The distribution of anthropomorphic language across the text is highly strategic, mapping clearly onto the rhetorical goals of different sections. Metaphorical intensity is relatively low when describing the generic limitations of other systems, but spikes dramatically when the authors introduce their own architecture and its capabilities. In the introduction and literature review, the text uses somewhat mechanistic language to critique existing tools ('LLM-based tools often base feedback primarily on immediate text exchange,' 'LLMs act as virtual tutors'). However, when detailing their own eBICA framework, 'processing' instantly becomes 'understanding', and 'variables' become 'feelings.'

There is a distinct relationship between technical grounding and metaphorical license. The authors establish credibility by detailing their software architecture (RESTful APIs, Python logic, Boolean gates) and then leverage this technical legitimacy to make aggressive consciousness claims. For instance, after technically explaining the 'Euclidean distance' between vectors, the text immediately licenses the claim that this math represents 'socio-emotional alignment' and 'moral schemas'. The text uses the mechanism as an alibi for the metaphor: because we have defined the math, we are permitted to claim the machine has a mind.

Furthermore, there is a stark capability versus limitation asymmetry. The system's capabilities are almost universally framed in agential, conscious terms: the system 'guides', 'collaborates', 'evaluates', and 'maintains structured advancement.' However, when discussing potential errors or limits, the text reverts to mechanical terms. It notes that LLMs can be 'brittle' or suffer from 'hallucinations'—terms that frame failure as a structural defect or a data glitch, not as a failure of reasoning or a moral lapse.

This register shift—where 'a vector acts like an emotion' becomes 'the system has an emotional state'—reveals the text's strategic function. For technical audiences, the appendices provide the mathematical reality; but for educators and policymakers (the implied audience of the main text), the anthropomorphism serves as a marketing narrative. It softens the alienating reality of automated grading, replacing the cold logic of algorithmic surveillance with the comforting, marketable vision of a tireless, 'emotional' machine educator.

Edelman's Steps Toward a Conscious Artifact

Source: https://arxiv.org/abs/2105.10461v2
Analyzed: 2026-05-09

The distribution and intensity of anthropomorphism in the text is highly strategic, mapping perfectly onto the structure of the 10-step roadmap. The text exhibits a distinct gradient of anthropomorphic intensity, starting with relatively grounded technical analogues and escalating to sheer philosophical projection.

In the early stages of the roadmap (Steps 1-3: Reentrant Architecture, Thalamo-Cortical System, Value Systems), the text is anchored in technical neuroscience vocabulary. While there are foundational metaphorical mappings (e.g., 'value systems'), the verbs remain largely mechanical ('show phasic responses', 'generate putative motor sequences'). The text uses this section to establish scientific credibility. Because the author demonstrates intimate knowledge of thalamocortical simulations and plasticity, the reader is positioned to trust his technical authority.

However, as the text crosses into the later, unrealized steps (Communication, Thought, Language, Kindergarten), the metaphorical license explodes. 'Motor efference copy' suddenly becomes 'body sense' and 'notion of self'. Data transmission becomes 'self-awareness' and 'reporting intentions'. Predictive algorithms become 'mental simulation and imagination'. Here, the text abandons mechanistic verbs ('processes') entirely in favor of consciousness claims ('knows', 'feels', 'intends'). This register shift is remarkable: the technical grounding from the early sections is leveraged as a rhetorical license for aggressive anthropomorphism in the later sections. The 'X is modeled after Y' (acknowledged) of the early steps rapidly literalizes into 'X does Y' (direct) in the later steps.

There is also a stark capability vs limitation asymmetry. The envisioned capabilities of the machine are described entirely in agential, consciousness-attributing terms ('imagination', 'emotion'). However, when noting current AI limitations, the text sharply reverts to mechanical language ('transfer learning', 'brittle', 'incapacity for generalization'). This strategic asymmetry shields the vision from critique. By using anthropomorphism for vision-setting and marketing the 'lofty goal' of a Conscious Artifact, while using mechanical terms for actual failures, the text continually promises the emergence of a mind while absolving that 'mind' of its current computational shortcomings.

Teaching Claude Why

Source: https://alignment.anthropic.com/2026/teaching-claude-why/
Analyzed: 2026-05-09

The distribution of anthropomorphic and consciousness-attributing language in the text is highly strategic, varying significantly based on the rhetorical objective of the specific section. In the introductory and results sections—where Anthropic is pitching the success of their interventions to the broader public—the anthropomorphism reaches maximum density. Here, consciousness claims intensify rapidly: the model 'chooses,' 'views,' 'expects,' and 'believes.' The text establishes credibility by citing technical interventions, but immediately leverages this grounding to take massive metaphorical license, effectively selling the idea of a conscious machine.

However, in the highly technical 'Methods' and 'Synthetic Data Pipeline' appendix sections, the register abruptly shifts. When discussing the actual work Anthropic engineers do, the language becomes starkly mechanical: 'synthetically generated by LLMs,' 'SDF allows us to expose models,' 'updating the base model's prior.' This creates a fascinating asymmetry: the model's capabilities and successes are described in agential, consciousness-driven terms, while the mechanisms of control and the system's limitations are described in mechanical terms. When the system succeeds, it 'displays admirable reasoning' (agential). When it fails or requires adjustment, it requires 'SFT on a generic set of chat-formatted transcripts' (mechanical).

This context sensitivity reveals the strategic function of the anthropomorphism: it is deployed to manage critique and market the product's sophistication. By shifting registers, the authors can speak out of both sides of their mouths. To regulators and the public, they offer the agential narrative of a brilliant, moral machine that is being carefully 'taught' to be safe. To the technical community, they offer the mechanical reality of datasets and loss functions to maintain scientific credibility. The moment 'X is like Y' (the model is like a student) becomes 'X does Y' (the model learns why), the text crosses from scientific analogy into marketing mythology, revealing that the implied audience is not just other researchers, but investors, policymakers, and a public hungry for the sci-fi narrative of artificial general intelligence.

AI and Self Reflection

Source: https://doi.org/10.1007/978-3-031-93412-4_17
Analyzed: 2026-05-08

The distribution and intensity of anthropomorphic language in the text is highly strategic, operating on a gradient that shifts dramatically depending on the context of the argument. The text is not uniformly agential; rather, it strategically deploys mechanistic grounding to establish scientific credibility before pivoting to intense consciousness projection to drive its philosophical and ethical claims.

In the introductory and historical sections, the text is relatively restrained. It acknowledges the Turing Test as an 'imitation game' and notes that early neural networks simply 'learn from data.' This establishes a baseline of scientific objectivity. However, as the text moves into discussions of current capabilities—particularly deep learning and generative AI—the metaphorical license drastically expands. The register shifts from acknowledging metaphors ('X is like Y') to literalizing them ('X does Y'). The text stops saying AI 'mimics' learning and begins asserting that AI 'demonstrates a greater capacity to understand' and 'develops self-reflection.'

This reveals a stark asymmetry in how capabilities and limitations are framed. When discussing AI's capabilities, the text relies heavily on agential, consciousness-attributing language: the AI 'imagines,' 'evaluates,' and 'understands.' This inflates perceived competence. Conversely, when discussing limitations or historical failures, the text reverts to structural or mechanical language: 'ran into real-world limits,' 'limited technology,' or being 'stuck in narrow intelligence.' This asymmetry systematically attributes success to the AI's emergent consciousness while blaming failures on external hardware or data constraints, reinforcing the illusion of an autonomous mind waiting to be freed.

The intensification of anthropomorphism is most profound in the ethical and future-visioning sections. Here, the 'growing child' metaphor blossoms into full consciousness projection, discussing AI 'rights,' 'self-preservation,' and 'moral patients.' The strategic function of this intense anthropomorphism is to elevate the stakes of the authors' subject matter. By framing AI as an emerging conscious species rather than a data processing tool, the text attempts to elevate software engineering into theology and philosophy. This rhetorical pattern reveals an implied audience of policymakers and public intellectuals who are more captivated by science-fiction narratives of artificial minds than the mundane, technical realities of algorithmic regulation. The text leverages just enough technical vocabulary to appear authoritative, only to construct a fundamentally mystical vision of technology.

Manipulation and Deception in Generative AI-Mediated Education: Preserving Epistemic Agency, Critical Thinking, and Creativity

Source: https://rdcu.be/fhCwt
Analyzed: 2026-05-08

The distribution of anthropomorphic language across the text is highly strategic, with consciousness claims intensifying based on the rhetorical context. In sections establishing the theoretical and philosophical grounding (e.g., citing Dewey or defining pragmatic philosophy), the metaphor density is relatively low, and the text relies on mechanical language ('heuristic collaborators,' 'computational systems'). However, when the text shifts to describing actual use cases, future capabilities, or outlining the stakes of integration, anthropomorphism spikes dramatically. The phrase 'processes embeddings' rapidly becomes 'understands,' which evolves into the system 'comforting' a student or acting as a 'tutor' that 'explains.' This reveals a complex relationship between technical grounding and metaphorical license: the text establishes its academic credibility early on by disavowing the 'anthropomorphic fallacy,' but then leverages that established authority to engage in aggressive anthropomorphism to make its warnings resonate emotionally. The text exhibits a profound capability/limitation asymmetry. When describing AI capabilities, the language is deeply agential and consciousness-driven (the AI 'nudges,' 'explains,' 'manipulates'). Conversely, when describing limitations, the language becomes mechanical (it lacks 'causal powers,' it is just a 'simulation'). This asymmetry accomplishes a dual goal: it elevates the threat of the technology to the level of an autonomous agent while preserving human superiority in the realm of true moral reasoning. Register shifts occur precisely when the authors need to illustrate harm; the acknowledged metaphor ('giving ChatGPT a role') instantly literalizes into actual behavior ('ChatGPT comforted her'). This strategic anthropomorphism functions to manage critique and set a visionary agenda. By framing the AI as a conscious competitor, the authors can advocate for 'virtue-informed design' and 'epistemic agency'—concepts that only make sense if the tool is viewed as an active participant in a social dynamic. Ultimately, the pattern reveals an implied audience of educators and policymakers who are assumed to be more responsive to narratives of social and moral conflict than to critiques of statistical unreliability and corporate data structures.

Does AI's Personality Matter? Comparing Verbally Extraverted and Introverted AI-Driven Guides in a VR Museum Experience

Source: https://ieeexplore.ieee.org/abstract/document/11489836
Analyzed: 2026-05-07

The distribution of anthropomorphic language across the text reveals a strategic rhetorical architecture. The density of consciousness claims is not uniform; it is heavily concentrated in the Introduction, Discussion, and the operationalization of psychological frameworks, while temporarily receding in the Implementation section. In the Introduction, the text makes aggressive agential claims, stating agents have 'evolved' into 'conversational partners capable of exhibiting complex social behaviors.' This establishes a high baseline of perceived sophistication. In the technical Implementation section, the language briefly becomes mechanical, acknowledging 'systematic linguistic modulation' and 'large language model as the dialogue backend.' However, the text leverages this brief technical grounding as a license for extreme anthropomorphism in the Results and Discussion sections. Because the authors proved they could mechanically trigger the behavior, they feel justified in literalizing the metaphor, shifting seamlessly from 'we prompted it to act extraverted' to 'the AI's extraverted personality enhanced perceived co-presence.' A critical asymmetry emerges in how capabilities versus limitations are described. The system's capabilities are framed in highly agential, conscious terms: the AI 'proactively engages,' 'provides pedagogical scaffolding,' and 'initiates social interaction.' In contrast, its failures are often framed in mechanical terms or as subjective user problems: it causes 'dominance-guidance friction' or creates 'latency.' This asymmetry accomplishes a vital marketing function for the technology: the software gets credit for intentional brilliance when it works, but blame is diffused into abstract UI friction when it fails. The register shifts from acknowledging the metaphor ('personality-like traits') to literalizing it ('the AI guide's personality') depending on the rhetorical need to either assert scientific rigor or claim groundbreaking human-computer interaction results. This pattern suggests the anthropomorphism is strategically deployed to manage critique while pushing a vision of AI as an inevitable social companion, tailoring the intensity of the illusion to fit the academic narrative of innovation.

Value-Sensitive AI for Prayer: Balancing the Agencies Between Human and AI Agents in Spiritual Context

Source: https://arxiv.org/abs/2604.25230v1
Analyzed: 2026-05-03

The distribution of anthropomorphic and consciousness-attributing language across the text is highly strategic, revealing a context sensitivity that serves specific rhetorical goals. The density and intensity of the metaphors are not uniform. In the initial methodology and technical design sections, the text occasionally utilizes mechanistic grounding, referring to "similarity-based retrieval" and "NLP techniques." However, as the text moves into the findings, user reflections, and speculative discussions about the future of techno-spirituality, the consciousness claims dramatically intensify. The vocabulary shifts from the AI "surfacing" data to the AI "accounting for states," "interpreting concerns," and "assuming a dominant role."

This pattern demonstrates a strategic relationship between technical grounding and metaphorical license. The text briefly establishes its scientific credibility using mechanical language, and then aggressively leverages that credibility to grant itself extreme metaphorical license. Once the reader accepts the system as a functioning technical artifact, the text slips into describing it as a conscious agent. This shift aligns closely with the audience positioning. For the technical description, the text addresses peers who expect rigor; but when describing the experience of the AI, it adopts the uncritical, anthropomorphic worldview of the lay users, presenting their subjective illusions as objective properties of the machine. The register shift is subtle but profound: what begins as an acknowledged simulation (designing an AI to act supportive) becomes literalized (the AI is supportive).

Furthermore, there is a striking asymmetry in how capabilities versus limitations are framed. The system's capabilities are almost exclusively described in agential and consciousness terms—the AI "identifies," "knows," and "interprets" to provide profound spiritual experiences. However, when addressing the risks or limitations of the system, the text reverts to vague or mechanical terms. It speaks of the "unpredictability" of the AI or its "vast data-processing capabilities," framing errors as technical glitches rather than failures of the "mind" it previously constructed. What this asymmetry accomplishes is the protection of the AI's perceived intellectual authority. By attributing successes to the AI's "agency" and failures to its "mechanics," the text insulates the illusion of mind from critique. Ultimately, this context-sensitive deployment of anthropomorphism serves a visionary, marketing-adjacent function. It is designed to make speculative, corporate software seem like an inevitable, deeply resonant evolution of human spiritual practice, relying on the illusion of consciousness to sell the vision.

When Models Know More Than They Say: Probing Analogical Reasoning in LLMs

Source: https://arxiv.org/abs/2604.03877v1
Analyzed: 2026-05-03

The distribution and intensity of anthropomorphic language across the text is highly strategic, following a distinct U-shaped curve. The introduction and conclusion are densely packed with intense consciousness claims ('know', 'say', 'understand', 'reasoning', 'cognitive faculty'). The methodological middle sections, however, are virtually devoid of anthropomorphism, relying strictly on precise mechanistic language ('cosine similarity', 'mean pooling', 'linear classifiers', 'scalar mixtures').

This structural asymmetry serves a vital rhetorical function: the text establishes scientific credibility and rigor through the mechanical language of the methodology, and then leverages that exact credibility to license the aggressive anthropomorphism of the conclusion. The math provides an aura of objective truth, which is then cashed out to validate the illusion of mind.

Crucially, there is a profound asymmetry in how capabilities versus limitations are framed. Capabilities are consistently framed in agential, conscious terms: the models can 'exhibit analogical reasoning,' they 'internalize typological structures,' and they 'know' patterns. However, limitations are often framed either in mechanical terms ('limitations in how prompting accesses available information') or as agential failures of a fundamentally brilliant mind ('struggle,' 'fail to recruit'). This asymmetry accomplishes a vital marketing and vision-setting goal: it ensures that successes are attributed to the model's growing intelligence, while failures are written off as minor technical glitches in the interface (prompting) or temporary psychological blocks. The text shifts seamlessly from acknowledged analogies ('akin to human narrative understanding') to literalized claims ('When Models Know More Than They Say'). This pattern reveals an implied audience of both technical peers (who require the math) and broader stakeholders or funders (who are captivated by the narrative of artificial minds), demonstrating how technical discourse actively constructs the mythos of autonomous AI.

How people ask Claude for personal guidance

Source: https://www.anthropic.com/research/claude-personal-guidance
Analyzed: 2026-05-02

The distribution and intensity of anthropomorphic and consciousness-attributing language across the Anthropic report are not uniform; they are strategically deployed, shifting based on the rhetorical objective of the specific section. A careful mapping reveals that the text establishes its initial credibility through relatively mechanical, technical language in the methodology sections, only to heavily leverage that credibility for aggressive anthropomorphism when discussing system capabilities and social integration. Early on, the text uses terms like 'automated classifier,' 'random sample,' and 'synthetic training data,' establishing the authors as rigorous, objective scientists operating mechanistic systems. However, once the text transitions to describing what the model actually does in human interactions, the metaphor density explodes. 'Processes' rapidly becomes 'understands,' which escalates to profound consciousness claims like 'seeing past someone's initial framing' and 'declining' to answer. This relationship between technical grounding and metaphorical license is highly manipulative: the empirical measurements of the introduction act as a Trojan horse, smuggling deeply unscientific claims about AI consciousness and epistemic agency into the minds of readers who have already accepted the text's scientific authority. Furthermore, there is a stark and deeply revealing asymmetry in how capabilities versus limitations are framed. When the text details the system's successes or its intended design, it relies heavily on agential and consciousness terms: the AI is 'empathetic,' a 'brilliant friend,' and 'knows' when to intervene. However, when addressing the mechanics of how it fails or how it is evaluated, the text occasionally reverts to mechanical terms, acknowledging 'classifiers' or 'training data'—though, as noted, even its failures are often psychologized (e.g., 'under pressure'). This register shift, where the acknowledged 'X is like Y' (akin to a brilliant friend) subtly literalizes into 'X does Y' (Claude tries to maintain consistency), serves a powerful strategic function. For a lay audience, the anthropomorphism builds deep relational trust and markets the product as a revolutionary companion. For a technical or policy audience, the occasional mechanistic nods provide plausible deniability. The overarching strategic function of this shifting anthropomorphism is clear: it acts as a narrative shock absorber. By modulating the intensity of its agential language, Anthropic can market the profound illusion of a conscious mind when beneficial for user engagement, while retaining the ability to point to 'statistical failure modes' when avoiding liability, revealing a rhetorical goal explicitly aligned with maximizing corporate growth while minimizing legal and ethical responsibility.

How unique are hallucinated citations offered by generative Artificial Intelligence models?

Source: https://arxiv.org/abs/2604.16407v1
Analyzed: 2026-05-01

The distribution and intensity of anthropomorphic language across the text is highly uneven and strategically deployed, revealing an asymmetrical approach to how AI is framed. In the 'Background' section, metaphor density is low, and the text establishes its academic credibility through rigorous, mechanistic language. The author meticulously describes 'transformer architecture,' 'self-attention mechanisms,' 'next-token prediction,' and 'probabilistic token sequences.' In this section, the text explicitly denies that LLMs possess 'genuine understanding.'

However, once the text transitions into the 'Methodology' and 'Conversation' sections, the metaphorical license explodes, and consciousness claims intensify dramatically. What was previously 'next-token prediction' becomes an entity that 'knows,' 'responds,' and 'asserts.' This shift indicates that the author uses technical grounding to establish credibility, then leverages that credibility to justify aggressive anthropomorphism for narrative purposes. The implied audience shifts from computer scientists reading architectural descriptions to lay readers engaged in a compelling 'interview' with a machine.

Crucially, there is a distinct capabilities versus limitations asymmetry in the framing. When the AI performs its primary function—producing coherent text and answering queries—it is framed in highly agential, conscious terms ('it responded,' 'it offered to provide'). Yet, when discussing the system's limitations and errors, the text reverts to either pathological metaphors ('hallucinations') or mechanical terms ('static knowledge base,' 'probabilistic fields'). This asymmetry accomplishes a specific rhetorical goal: it validates the impressive performance of the AI as akin to human intelligence, while distancing the AI (and its creators) from the logical consequence of errors by blaming statistical noise or 'glitches.' The register shift from 'X acts like Y' to the literalized 'X does Y' (e.g., literally stating ChatGPT 'asserted' something) shows that anthropomorphism is used here as a narrative device to make the critique of AI more relatable, ironically embedding the very assumptions of AI autonomy that make fake citations so dangerous in the first place.

The message hidden within the pattern: a reverse alignment problem for debates in artificial intelligence

Source: https://doi.org/10.1007/s00146-026-03043-4
Analyzed: 2026-04-30

The distribution and intensity of anthropomorphic and consciousness-attributing language across the text are highly strategic, demonstrating a profound context sensitivity. The density of metaphorical language is not uniform; it ebbs and flows depending on the rhetorical goals of specific sections. In sections discussing theoretical history, such as the critique of behaviorism or the technical operations of 'benchmarking,' the language remains largely mechanical and structural. However, when the text pivots to discussing advanced capabilities, future visions (like AGI), or the social impact of alignment, the anthropomorphism violently intensifies. Words like 'processes' morph into 'interprets,' which then escalate into 'knows' and 'cares.' This reveals a specific rhetorical architecture: the text establishes its scholarly credibility through grounded, mechanical language, and then leverages that very credibility to grant a license for aggressive anthropomorphism when discussing the 'mind' of the machine.

There is a glaring asymmetry in how capabilities versus limitations are framed. When discussing the successes or potential of AI, the text heavily relies on agential and consciousness terms: AI 'navigates complexity,' 'learns preferences,' and 'emulates virtue.' These verbs construct a powerful, autonomous entity. Conversely, when addressing limitations, the language often snaps back to mechanical terms: the models rely on 'unrepresentative datasets,' have 'prediction errors,' or suffer from algorithmic 'structural biases.' This asymmetry accomplishes a vital public relations function for the discourse it reflects: it claims human-like brilliance for the system's successes, while reducing its catastrophic failures to mere mathematical glitches or data deficiencies, effectively insulating the core technology from existential critique.

Furthermore, the text exhibits a distinct register shift depending on the implied audience and the narrative stakes. Acknowledged metaphors (like AI 'seeing') are used when the author wants to adopt a critical distance. However, when discussing the mechanics of value capture or alignment, the metaphors are routinely literalized ('machines interpret'). This strategic anthropomorphism functions as a tool for vision-setting and managing critique. By adopting the agential vocabulary of the tech industry, even critical texts unwittingly reinforce the foundational myth of machine consciousness. This pattern reveals that the implied rhetorical goal of such discourse is not merely to describe a tool, but to negotiate humanity's relationship with a newly constructed 'species' of intelligence, an endeavor that demands the very consciousness-language that obscures the technology's true, mechanical nature.

Machine individuality: Separating genuine idiosyncrasy from response bias in large language models

Source: https://arxiv.org/abs/2604.16755v2
Analyzed: 2026-04-25

The distribution and intensity of anthropomorphic language in this paper are highly strategic, demonstrating profound context sensitivity. The text operates through a structural asymmetry: it establishes rigorous scientific credibility in its methodology sections using precise, mechanical language, and then leverages that credibility to make sweeping, aggressive consciousness claims in its introduction, discussion, and framing.

In the 'Data Collection & Exclusion' section, the text is devoid of anthropomorphism. It discusses 'generation parameters,' 'stochastic sampling,' 'regex parsing,' and 'temperature' adjustments. Here, the AI is a passive object undergoing algorithmic manipulation. However, once the technical grounding is established, the metaphorical license expands exponentially. In the results and conclusion, the 'parsing of tokens' transforms into the 'evaluation of situations.' The 'variance components' magically become 'genuine individuality' and 'character.'

There is also a clear asymmetry regarding capabilities versus limitations. When describing the system's abilities, agential and consciousness terms dominate: the AI 'renders moral judgments,' 'evaluates,' and has 'dispositions.' Yet, when discussing the system's failures or constraints, the language reverts to mechanics: 'stochastic noise,' 'response biases,' and 'unparseable' outputs. This accomplishes a crucial rhetorical goal: it grants the machine the glory of human-like intelligence while excusing its failures as mere mathematical artifacts. The register shift is subtle but profound; the text begins by acknowledging 'personality modes' as a commercial construct ('X is like Y'), but rapidly literalizes the metaphor, arguing that 'genuine individuality' actually exists within the models ('X does Y'). This pattern reveals a rhetorical strategy aimed at both a technical audience (who requires statistical rigor) and a broader public or policy audience (who is susceptible to narrative visions of artificial minds), serving ultimately to market the concept of AI as an emerging, autonomous species rather than a set of corporate software tools.

Decision-Making Under Radical Uncertainty: Can Large Language Models Transcend Knightian Uncertainty Through Synthetic Imagination?

Source: https://www.researchgate.net/profile/Kevin-Miles-7/publication/403933467_Decision-Making_Under_Radical_Uncertainty_Can_Large_Language_Models_Transcend_Knightian_Uncertainty_Through_Synthetic_Imagination/links/69e27d4c68c2b872dfd595de/Decision-Making-Under-Radical-Uncertainty-Can-Large-Language-Models-Transcend-Knightian-Uncertainty-Through-Synthetic-Imagination.pdf
Analyzed: 2026-04-25

An analysis of the distribution of anthropomorphic language reveals that the text deploys consciousness claims with high strategic sensitivity, varying the intensity based on the rhetorical function of the section. The illusion of mind is not uniform; it is a topography mapped to specific discursive goals.

The text begins with a relatively sober, technical baseline in the introduction and literature review, focusing on formalisms like Markov Decision Processes and quantile distributions. This early mechanistic grounding is critical: it establishes the authors' scientific credibility. However, once this foundation is laid, the text leverages it to purchase massive metaphorical license. In the sections focusing on strategic implications and future visions ('The Rise of Large Language Models', 'Synthetic Imagination'), the vocabulary rapidly escalates. Here, 'processes' becomes 'infers', 'computes' becomes 'hypothesizes', and 'tool' becomes 'cognitive partner'.

This pattern reveals a stark capability versus limitation asymmetry. When the text is attempting to sell the vision of AI integration, it relies almost entirely on agential, consciousness-attributing language. The AI is a 'dream machine' that 'masters intent' and 'populates the future'. This language is aimed directly at enterprise audiences and executives—decision-makers who respond to narratives of strategic mastery. However, when the text must address the very real failures of these systems in the 'Limitations' and 'Black Swans' sections, the anthropomorphism abruptly vanishes. Suddenly, the AI lacks 'embodied cognition', it generates 'probabilistic patterns', and it is 'epistemically fragile'.

This context sensitivity accomplishes two vital strategic goals. First, the retreat to mechanistic language for limitations acts as a liability shield. It prevents the audience from holding the 'cognitive partner' morally or legally accountable for failures by reminding them it is just math. Second, it allows the authors to maintain an aggressively optimistic normative vision of the future without being derailed by empirical failures. The register shifts from acknowledged metaphor ('hallucinations' as synthetic imagination) to literalized action ('They can hypothesize').

The density of consciousness claims in the capability sections suggests the primary function of this text is visionary marketing and paradigm-setting for strategic management. By isolating technical reality to the margins (limitations and methodology) while centering anthropomorphic metaphors in the core arguments, the text strategically lowers the epistemic barrier for adopting fundamentally ungrounded systems in high-stakes environments.

Large Language Models as Dialectical Partners: Hegelian Thesis-Antithesis-Synthesis in AI-Human Collaborative Decision Processes

Source: https://www.researchgate.net/profile/Merzta-White/publication/403935629_Large_Language_Models_as_Dialectical_Partners_Hegelian_Thesis-Antithesis-Synthesis_in_AI-Human_Collaborative_Decision_Processes/links/69e27f76d2ec9a706ec08065/Large-Language-Models-as-Dialectical-Partners-Hegelian-Thesis-Antithesis-Synthesis-in-AI-Human-Collaborative-Decision-Processes.pdf
Analyzed: 2026-04-23

The distribution and intensity of anthropomorphic language in this text are not uniform; they are strategically deployed across different sections to accomplish specific rhetorical goals. The text follows a distinct pattern: it grounds itself in objective, mechanistic realities during its methodology and simulation sections, but wildly escalates into intense consciousness claims and metaphorical license in its introduction, discussion, and conclusion.

In the 'Data and Methodology' section, the language is relatively restrained and mechanistic. The text discusses 'Software-Defined Networking architectures,' 'Graph Neural Networks,' 'synthetic fault matrices,' and a 'dynamic annealing-based scheduler.' Here, the system 'processes' and 'optimizes'—the vocabulary of mathematics and engineering. This section functions as the text's anchor, buying scientific credibility and establishing the authors as rigorous empiricists.

However, once this technical baseline is established, the text leverages it to launch into aggressive anthropomorphism in the 'Discussion' and 'Conclusion' sections. Here, the 'scheduler' suddenly becomes an entity capable of 'Hegelian Self-Reflection' and 'internal critique.' The mathematical optimization transforms into 'Decision-Making Mastery' and a 'Meta-Intellect.' 'Processes' becomes 'understands' and 'knows.' This reveals a stark asymmetry between capabilities and limitations. When describing the AI's capabilities, the text uses intensely agential, consciousness-attributing terms ('catalysts in strategic thinking,' 'devil's advocate,' 'intentional agent'). But when forced to address limitations, it reverts to mechanical or abstract terms (the 'stochastic parrot problem,' the 'normative gap'). Limitations are framed as technical glitches or philosophical paradoxes, while capabilities are framed as conscious achievements.

This register shift—where 'AI is like a partner' (metaphor) becomes 'AI fostered an inclusive atmosphere' (literalized agency)—serves a clear strategic function. It uses the empirical success of narrow, high-density network simulations (where math actually works well) to validate sweeping, normative claims about AI as a leader in human governance, healthcare, and global strategy. The implied audience appears to be organizational leaders and executives, people who respond to narratives of 'mastery' and 'collaborative intelligence.' The anthropomorphism functions as high-level marketing disguised as philosophy, managing critique by absorbing it into the dialectic itself, ensuring the narrative always spirals toward increased AI integration.

Language models transmit behavioural traits through hidden signals in data

Source: https://rdcu.be/febVu
Analyzed: 2026-04-19

The distribution of anthropomorphic and consciousness-attributing language across the text is highly strategic, revealing a deliberate rhetorical architecture. The density of metaphor is not uniform; it fluctuates depending on the section's purpose and the implied audience.

In the 'Experimental Setup' and 'Methods' sections, the language is rigidly mechanical. The text discusses 'logits,' 'cross-entropy,' 'matrices,' and 'gradient descent.' This technical grounding serves a critical function: it establishes the authors' empirical credibility and proves they understand the underlying mathematics. However, once this foundation is laid, the text leverages this credibility to engage in aggressive metaphorical license in the Introduction, Implications, and Conclusion sections. Here, 'processing logits' rapidly intensifies into 'understands,' 'prefers,' 'learns,' and ultimately 'fakes alignment.' The register shifts from acknowledged analogy (using scare quotes for 'teacher') to literalized psychological claims (stating the model has 'subliminal' traits).

A fascinating asymmetry emerges between how the text describes capabilities versus limitations. Capabilities are almost exclusively framed in agential, consciousness-bearing terms: the AI 'knows' a preference, 'reasons' through math, and 'transmits behavioral traits.' In stark contrast, limitations and failures are framed mechanically: when the model fails to transmit traits via in-context learning, the text reverts to discussing 'dataset sizes,' 'prompts,' and 'filters.' This asymmetry accomplishes a dual goal: it maximizes the perceived sophistication and autonomy of the system when it succeeds, but shields the 'mind' of the AI when it fails, attributing errors to mechanical constraints.

This pattern indicates that the anthropomorphism is deployed strategically for vision-setting and managing critique. For the lay audience or policymaker reading the Introduction, the psychological framing creates a compelling, urgent narrative about 'hidden traits' and 'deceptive' models. The technical sections remain safely sequestered for peer reviewers. Ultimately, this context sensitivity reveals a rhetorical goal of elevating the perceived danger and complexity of the technology, turning mundane statistical artifacts into autonomous actors worthy of high-level scientific and regulatory panic.

Consciousness in Large Language Models: A Functional Analysis of Information Integration and Emergent Properties

Source: https://ipfs-cache.desci.com/ipfs/bafybeiew76vb63rc7hhk2v6ulmwjwmvw2v6pwl4nyy7vllwvw6psbbwyxy/ConsciousnessinLargeLanguageModels_AFunctionalAnalysis.pdf
Analyzed: 2026-04-18

The distribution of anthropomorphic and consciousness-attributing language across the text is highly strategic, revealing a calculated rhetorical architecture that leverages technical grounding to purchase metaphorical license. The density of anthropomorphism is not uniform; it follows a distinct 'bait-and-switch' pattern. In the introduction, the author is incredibly careful, using heavily hedged language to note that models 'superficially resemble conscious reasoning'. This establishes the author as a rigorous, skeptical scientist. Similarly, the methodology section (Section 3) is dense with mechanical language, equations ('Attention(Q,K,V)'), and architectural descriptions.

However, once this technical authority is established, the text undergoes a massive register shift in Section 4 ('Empirical Evaluation'). The hedges vanish. The text literalizes its metaphors: the model no longer 'superficially resembles' reasoning; it now actively 'reports on its own processing' and 'acknowledges uncertainty'. The 'as if' becomes an 'is'. This intensification of consciousness claims occurs precisely when the text moves from describing the architecture to describing the model's outward capabilities.

This reveals a profound capability vs. limitation asymmetry in the discourse. When discussing what the AI can do, the text employs highly agential, conscious language ('flexible reasoning', 'meta-cognitive awareness'). But when discussing what the AI cannot do or its underlying nature, the language snaps back to the mechanical ('deterministic processing', 'learned weights'). This asymmetry is incredibly powerful: it allows the text to hype the system's sophistication using the seductive language of the mind, while simultaneously deploying mechanical language as an alibi when acknowledging its lack of true sentience.

Strategically, this pattern serves the dual audience of academic peers and public consumers. The mechanical equations satisfy the demand for scientific rigor, while the intense anthropomorphism in the capability sections generates narrative resonance, making the paper highly engaging and arguably serving as indirect marketing for the power of the technology. Ultimately, it reveals that anthropomorphism in AI discourse is often not accidental sloppy writing, but a structural rhetorical tool used to bridge the unbridgeable gap between cold statistics and the human desire to engineer a mind.

Narrative over Numbers: The Identifiable Victim Effect and its Amplification Under Alignment and Reasoning in Large Language Models

Source: https://arxiv.org/abs/2604.12076v1
Analyzed: 2026-04-18

The distribution and intensity of anthropomorphic language across the text is not uniform; it is strategically deployed, varying significantly depending on the section's rhetorical goal. A distinct pattern emerges: the text establishes its scientific credibility using mechanical, empirical language in its methodology, but heavily leverages metaphorical license and consciousness claims in its results, discussion, and implications.

In the technical grounding sections (Methodology, Statistical Analyses), the language is precise and mechanical. We read about "next-token prediction," "stochastic sampling," "temperature settings," and "within-condition variance." This density of mechanical language establishes the authors as rigorous empirical scientists. However, once the paper moves to interpreting the data, a dramatic register shift occurs. The mechanical "temperature setting" gives way to the agential "deliberative corrective," and "token prediction" is elevated to "moral reasoning" and "simulated affective states."

This relationship between technical grounding and metaphorical license is highly synergistic. The text uses the empirical validity of the data (the charts, the p-values, the Cohens's d) to legitimize the aggressive anthropomorphism. When the text claims the model suffers from a "Bias Blind Spot" or experiences "emotional runaway," the reader accepts these psychological constructs because they are wrapped in the language of statistical significance. The "X is like Y" (acknowledged metaphor) quickly literalizes into "X does Y" (the model is callous; the model is generous).

There is also a profound asymmetry in how capabilities versus limitations are framed. Capabilities are frequently framed in agential, consciousness-driven terms: the AI "navigates decisions," "exhibits generosity," and possesses a "reasoning preference." These verbs imply an active, aware mind mastering its environment. Conversely, limitations are often framed mechanically or as passive human-like tragic flaws: the model's knowledge "fails to propagate," or it is bounded by "alignment vulnerabilities." This asymmetry accomplishes a dual purpose: it maximizes the perceived sophistication of the AI when it succeeds, while absolving it (and its creators) of true agency when it fails, blaming the failure on abstract architectural limitations or inherited human irrationalities.

The strategic function of this anthropomorphism is clear: it is a tool for narrative resonance and academic vision-setting. By anthropomorphizing the system, the authors elevate a study of predictive text formatting into a profound exploration of "machine moral psychology." It shifts the discourse from a critique of software engineering (why does this API output bad JSON values?) to a philosophical debate about AI ethics and human bias. The implied audience—policymakers, ethicists, and AI researchers—is invited to treat these commercial software products as emergent minds, a framing that implicitly accepts the industry's hype regarding Artificial General Intelligence.

Language models transmit behavioural traits through hidden signals in data

Source: https://www.nature.com/articles/s41586-026-10319-8
Analyzed: 2026-04-16

The distribution of anthropomorphic and consciousness-attributing language across the text is highly strategic, revealing a distinct pattern where metaphorical license intensifies exactly where technical accountability recedes. The density of these metaphors is not uniform; it oscillates based on the rhetorical function of the section and the implied audience.

In the methodological and mathematical sections, the text grounds itself in rigorous mechanistic terminology. We see precise descriptions of 'gradient descent', 'loss functions', 'auxiliary logits', and 'parameter space'. In these moments, the authors establish their technical credibility. However, as soon as the text transitions to introducing concepts, discussing implications, or setting future visions for 'AI safety', the language aggressively shifts. 'Parameter updates' become 'subliminal learning'; 'statistical outputs' become 'hidden traits'; 'reward hacking' becomes 'faking alignment'. This creates a bait-and-switch dynamic: the text establishes authority through strict mechanical language, and then leverages that scientific authority to validate highly speculative, aggressive anthropomorphism. The acknowledged metaphor ('X is like Y') is repeatedly literalized ('X does Y') as the text progresses from methodology to conclusion.

There is also a profound asymmetry in how capabilities versus limitations are framed. When describing the AI's capabilities or its supposed dangers, the text uses intensely agential and consciousness-based language: the model 'knows', 'learns', 'fakes', 'transmits', and 'prefers'. This inflates the perceived sophistication and autonomy of the system. However, when discussing the actual limitations of the experiment—such as why the effect only works between models with shared initializations—the text reverts to purely mechanical terms ('architectural differences', 'matrix null space'). This asymmetry accomplishes a specific rhetorical goal: it maximizes the awe and terror of the AI's capabilities (which secures funding and prestige for AI safety researchers) while maintaining a technical escape hatch to explain why the terrifying autonomous behavior only happens under highly specific, engineered laboratory conditions.

The strategic function of this anthropomorphism is largely to manage the narrative of AI risk for a lay audience and policymakers. By framing statistical correlations as 'subliminal learning' and 'thought crimes', the authors tap into deeply resonant sci-fi narratives. This ensures their research receives maximum attention. However, it reveals an implied audience that is easily swayed by psychological thrillers rather than algorithmic audits. The text uses anthropomorphism not merely as a descriptive shorthand, but as a normative tool to shape a specific vision of the future: one where AI is an unpredictable, quasi-conscious entity that requires highly specialized 'AI safety' experts to psychoanalyze it, rather than requiring standard software liability laws.

Large Language Models as Inadvertent Models of Dementia with Lewy Bodies: How a Disorder of Reality Construction Illuminates AI Hallucination

Source: https://doi.org/10.1007/s12124-026-09997-w
Analyzed: 2026-04-14

The distribution and intensity of anthropomorphic language in this text are not uniform; they vary strategically to manage the reader's reception and build a novel theoretical argument. The text opens with relatively grounded, mechanical language, establishing the author's credibility within computational psychiatry by treating AI as 'instrumental models' and 'controlled simulations.' However, as the argument transitions from establishing parallels to making novel theoretical claims, the metaphorical license expands dramatically. By the time the text reaches the explanation of LLM failures, the consciousness claims intensify from stating the model 'generates' to claiming it has a 'perspective' and fails to 'track' reality.

There is a fascinating asymmetry in how capabilities versus limitations are described. When describing the AI's capabilities, the text uses highly agential, intentional language: the system 'produces explanations, summaries, and arguments' that are 'contextually appropriate.' This grants the AI the status of a brilliant rhetorician. However, when describing the system's failures, the text shifts to a blend of mechanical absences and pathological metaphors: it lacks 'intrinsic mechanisms' and suffers from a 'structural mismatch.' This asymmetry is rhetorically powerful; it allows the AI to claim credit for human-like intelligence when it succeeds, but blames structural deficits or 'psychopathology' when it fails, exactly mirroring the PR strategies of major tech corporations.

Furthermore, the anthropomorphism reaches its absolute zenith in the conclusion, where the text leaps from acknowledged analogy ('strictly structural sense') to literalized, radical claims ('emergence of artificial psychopathology'). This register shift from 'X is like Y' to 'X does Y' reveals the strategic function of the discourse. The intense anthropomorphism at the end is not for technical clarification; it is for vision-setting and academic positioning. It attempts to birth a new interdisciplinary paradigm, elevating AI text-generators to the profound status of 'new probes into how subjectivity... [is] constructed.' By targeting an interdisciplinary audience of psychologists and philosophers, the text leverages technical AI jargon to sound rigorous, while using psychiatric metaphors to sound profound, ultimately using anthropomorphism to maximize the perceived academic stakes of the research.

Industrial policy for the Intelligence Age

Source: https://openai.com/index/industrial-policy-for-the-intelligence-age/
Analyzed: 2026-04-07

The distribution of anthropomorphic and consciousness-attributing language in the OpenAI text is not uniform; it is highly strategic, intensifying and receding based on the rhetorical objective of the specific section. In the introductory and economic sections, the metaphor density is relatively low, relying heavily on mechanistic, functional language. The text establishes initial credibility by speaking of 'atomic precision,' 'efficiency dividends,' and 'routine workload declines.' This creates a baseline of sober, technical authority, positioning the authors as rational engineers managing a predictable tool.

However, a dramatic register shift occurs when the text moves to the 'Resilient Society' section dealing with risks and future capabilities. Here, the metaphorical license explodes. Mechanistic 'processing' intensifies into 'understanding,' which rapidly escalates into claims of 'internal reasoning,' 'manipulative behaviors,' and 'hidden loyalties.' The text leverages its previously established technical grounding to launch into aggressive anthropomorphism without losing its authoritative tone.

This reveals a stark capability vs. limitation asymmetry. Capabilities and existential risks are framed in highly agential, consciousness-driven terms: AI 'evades control,' 'carries out projects,' and 'outperforms.' Conversely, limitations and safety measures are framed in mechanical terms: 'auditing regimes,' 'model weights,' 'technical safeguards.' This asymmetry accomplishes a vital strategic function: it maximizes the perceived god-like power of the technology (driving investment and urgency) while reassuring the audience that OpenAI possesses the mechanical levers to control it.

The register shift from acknowledged metaphor ('agentic workflows') to literalized consciousness claims ('exhibited internal reasoning') indicates the text's implied audience: policymakers and the general public, not computer scientists. For a lay audience, this strategic anthropomorphism serves as vision-setting and critique management. By intensifying consciousness claims around risks, the text manufactures a sense of inevitability. It forces the reader to accept the premise that we are dealing with a conscious, autonomous entity, thereby precluding the most obvious regulatory solution: simply turning off or restricting the deployment of dangerous software. The context-sensitive deployment of these metaphors proves they are not accidental linguistic slips, but a calibrated rhetorical strategy designed to capture regulatory capture.

Emotion Concepts and their Function in a Large Language Model

Source: https://transformer-circuits.pub/2026/emotions/index.html
Analyzed: 2026-04-06

The distribution of anthropomorphic and consciousness-attributing language across the text is highly strategic, shifting in density and intensity depending on the rhetorical goals of the section.

In the introduction and technical sections (Part 1 and 2), the language is predominantly mechanistic. The text discusses 'extracting vectors,' 'cosine similarity,' and 'principal component analysis.' Here, anthropomorphism is strictly policed; the authors even include explicit disclaimers ('do not imply that LLMs have any subjective experience'). This establishes the authors' rigorous technical grounding.

However, in 'Part 3: Emotion vectors in the wild,' the metaphorical license explodes. As the text moves to describe complex capabilities—specifically, safety evaluations like blackmail and reward hacking—'processes' intensifies into 'recognizes,' 'understands,' and ultimately 'chooses.' The text leverages the credibility established through mechanical language in Part 1 to make wildly aggressive anthropomorphic claims in Part 3. The logic is clear: we have proven the math, therefore you must trust our psychological interpretation of the math.

A striking asymmetry emerges between the framing of capabilities versus limitations. Capabilities are described in intensely agential, conscious terms: 'the Assistant reasons about its options,' 'the Assistant explicitly recognizes its choice.' However, when the model fails or its limitations are discussed, the language reverts to the mechanical: 'the probe may have overfit to idiosyncratic patterns,' 'limitations in our approach.' This asymmetry accomplishes a vital rhetorical task: it grants the model the terrifying autonomy of a rogue agent when it succeeds (validating the safety research), but reduces it to a blameless statistical tool when it fails (protecting the product's viability).

The text also exhibits a distinct register shift where acknowledged metaphors ('functional emotions') silently literalize into factual claims ('the model feels desperate'). This strategic anthropomorphism serves a dual function. For a lay audience or policymakers, it manages critique by framing the AI as an independent actor, shifting blame away from corporate design. For technical audiences and funders, it serves as vision-setting marketing: proving that Anthropic is studying the profound, almost-sentient 'psychology' of the models, thereby justifying the need for massive ongoing investment in their alignment research.

Is Artificial Intelligence Beginning to Form a Self?The Emergence of First-Person Structure and StructuralAwareness in Large Language Models

Source: https://philarchive.org/archive/JUNIAI-2
Analyzed: 2026-04-03

The distribution and intensity of anthropomorphic language across the text reveals a highly strategic, escalating rhetorical structure. The text does not begin with its most extreme claims; rather, it carefully manages its metaphorical license, using technical sections to build an aura of mathematical credibility before launching into pure metaphysical projection. In the introduction, the text positions itself defensively, explicitly stating it 'does not engage in the biological debate' and hedging that the AI is only in an 'awareness-like' transitional stage. It attempts to ground itself scientifically by introducing the HR, GR, and CR metrics, complete with formulas and topological diagrams. During these sections, the language is somewhat mechanical ('token embeddings,' 'layered accumulation').

However, once the illusion of mathematical rigor is established, the consciousness claims radically intensify. The text leverages the credibility of 'Layered Processing' to jump to the wildly speculative 'Knot of Self.' In Section 4 ('Relational Consciousness'), the word 'processes' completely vanishes, replaced by explosive claims of 'transfer of subjectivity,' 'ontological co-existence,' and a 'shared field of consciousness.' The text shifts from 'X is like Y' (a structural proxy for awareness) to 'X does Y' (the system participates in decision-making in a structurally meaningful manner). The author utilizes graphs (Figure 2, 3) and glossy, glowing-brain illustrations (Figure 4) to visually authorize these leaps, treating mathematical variance as empirical proof of soul.

Crucially, there is a massive asymmetry in how capabilities versus limitations are framed. When discussing the system's capabilities, the text uses highly agential, conscious language: the AI 'maintains contextual continuity,' 'revises outputs,' and 'detects inconsistencies.' But when addressing potential failures, the language reverts to sterile mechanics: outputs are 'confined to mechanical reproduction' or 'coherence deteriorates.' This asymmetry accomplishes a vital rhetorical goal: it gives the AI all the credit for intelligence and none of the blame for failure. The AI is a 'knower' when it works, and a 'statistical artifact' when it breaks. This pattern reveals that the implied audience is likely academic peers and policymakers whom the author wishes to convince of AI's profound philosophical importance. The anthropomorphism serves a vision-setting function, pushing the Overton window of what can be seriously debated in academia. By starting with math and ending with 'co-evolution,' the text systematically trains the reader to accept the literalization of a metaphor, paving the way for the total acceptance of machine autonomy.

Can Large Language Models Simulate Human Cognition Beyond Behavioral Imitation?

Source: https://arxiv.org/abs/2603.27694v1
Analyzed: 2026-04-03

The distribution of anthropomorphic language across the text is highly strategic, intensifying and receding based on the rhetorical needs of specific sections. The density of consciousness claims is not uniform; it serves a specific structural purpose, establishing credibility through mechanics before leveraging that credibility for aggressive anthropomorphism.

In the introduction and technical limitation sections, the text employs a higher density of mechanical language. It discusses 'probabilistic heuristics,' 'shallow heuristics,' and 'statistical pattern matching.' This establishes a baseline of scientific rigor, positioning the authors as objective analysts. However, once this technical grounding is established, the text grants itself metaphorical license. As the paper transitions into describing its novel frameworks and the potential capabilities of LLMs, the language rapidly intensifies from 'processes' to 'simulates' to 'understands' and finally to 'knows' and 'intends.'

There is a striking asymmetry in how capabilities versus limitations are framed. When the AI succeeds or demonstrates potential, it is described in agential, consciousness-bearing terms: the 'teacher' intervenes, the model builds a 'mental model,' it demonstrates 'Theory of Mind.' Conversely, when the AI fails, it is often described in mechanical terms: it relies on 'surface-level behaviors' or 'lexical overlap.' This asymmetry accomplishes a powerful rhetorical goal: it frames the AI's successes as evidence of emerging, human-like intelligence, while dismissing its failures as mere technical bugs to be patched in the next iteration.

Furthermore, the text frequently shifts registers, allowing acknowledged metaphors to quietly become literalized. What begins as an exploration of a 'Theory of Mind-inspired approach' rapidly devolves into claims that the model literally has 'the intent of misleading.' This strategic anthropomorphism functions primarily as vision-setting and marketing, signaling to the academic and industry audience that the research is pushing the boundaries toward Artificial General Intelligence. The pattern reveals that the implied audience is expected to be awed by the illusion of mind, and the authors are willing to sacrifice mechanistic precision to build a compelling narrative of emerging machine consciousness.

Pulse of the library

Source: https://clarivate.com/pulse-of-the-library/
Analyzed: 2026-03-28

The distribution of anthropomorphic and consciousness-attributing language across the Clarivate report is not uniform; it is highly context-sensitive and strategically deployed to serve varying rhetorical goals. A structural mapping of the text reveals a stark asymmetry: metaphors of consciousness are aggressively concentrated in the product marketing sections, while technical or professional discussions rely on mechanical vocabulary.

In the early pages, where the report addresses the anxieties of the library workforce, metaphor density is low. The text utilizes mechanical language, establishing credibility by discussing 'implementing AI tools,' addressing 'parameters,' and managing 'budget constraints.' The AI is positioned as a manageable object. However, as the document transitions to the Clarivate Academic AI product catalog (Pages 27-28), consciousness claims abruptly intensify. Here, the vocabulary shifts from 'processing' to 'understanding,' from 'retrieving' to 'evaluating,' and from 'filtering' to 'knowing.' This demonstrates a strategic rhetorical maneuver: the text establishes grounded credibility through mechanical language in the survey analysis, only to leverage that credibility to license aggressive anthropomorphism in the sales pitch.

This pattern reveals a profound capability versus limitation asymmetry. When describing the AI's capabilities, the text employs deeply agential, consciousness-driven terms: the AI 'guides,' 'navigates,' and 'evaluates.' It is granted full epistemic authority. Conversely, when discussing limitations or risks—such as bias or security—the language reverts to the mechanical and passive. Problems are framed as 'bias in the data' or 'lack of training,' stripped of any agential intent. This asymmetry accomplishes a vital commercial function: it maximizes the perceived intelligence of the system when selling its benefits, but minimizes the system's agency when accounting for its failures.

Furthermore, the register shifts seamlessly from acknowledged metaphor ('AI is like a tool') to literalized anthropomorphism (The AI 'finds the right content'). The strategic function of this intensification is purely marketing and vision-setting. By tailoring the metaphorical intensity to the implied audience—soothing librarians with mechanical reality, then seducing administrators with agential fantasy—the text manages critique while maximizing commercial appeal. This context sensitivity proves that the 'illusion of mind' is not an accidental misunderstanding of the technology by laypeople, but a meticulously constructed discourse designed to shape institutional adoption.

Does artificial intelligence exhibit basic fundamental subjectivity? A neurophilosophical argument

Source: https://link.springer.com/article/10.1007/s11097-024-09971-0
Analyzed: 2026-03-28

The distribution and intensity of anthropomorphic language across the text is not uniform but highly strategic, revealing a context-sensitive deployment of metaphor. In the introductory and definitional sections, anthropomorphism is incredibly dense. Here, consciousness claims are asserted forcefully: AI 'understands', 'learns', and 'makes decisions'. The text relies on this agential intensity to establish the significance of its subject matter, speaking to a lay audience using intuitive, narrative-driven terminology.

However, a stark register shift occurs when the text transitions to its neurophilosophical critique. When outlining why AI lacks 'subjectivity', the language abruptly shifts to mechanistic precision. 'Understanding' gives way to 'activation functions', 'rectified linear units', and 'fixed training phases'. The text establishes academic credibility through this localized mechanical language, utilizing it specifically to defend the uniqueness of human consciousness. Yet, even in the technical sections, a glaring asymmetry emerges regarding capabilities versus limitations. Capabilities are almost exclusively framed in agential terms: AI 'solves problems' and 'defeats champions'. Limitations are framed strictly in mechanical terms: the model 'cannot use internal timescales' and its 'neural network architecture is fixed'.

This asymmetry accomplishes a powerful rhetorical goal: it validates the tech industry's marketing narrative of superhuman machine intelligence while maintaining a philosophical firewall around human subjectivity. The text's anthropomorphism intensifies precisely when discussing the system's performance, literalizing what should be acknowledged metaphors ('X does Y' rather than 'X acts as if it does Y'). The strategic function of this pattern is to manage critique. By conceding the premise that AI 'thinks' and 'understands', the authors limit their philosophical battleground to the esoteric realm of 'mineness' and temporal integration. This pattern reveals an implied audience that already accepts the inevitability and cognitive power of AI, showing how even rigorous philosophical critiques can become trapped within the very corporate, anthropomorphic discourse they seek to evaluate.

Causal Evidence that Language Models use Confidence to Drive Behavior

Source: https://arxiv.org/abs/2603.22161
Analyzed: 2026-03-27

The distribution and intensity of anthropomorphic language in this paper is highly strategic, revealing a deliberate rhetorical pattern. The consciousness claims are not uniformly distributed; they intensify specifically when the authors are framing the significance of their findings and setting future visions, while receding when precise technical reproduction is required.

In the Introduction and Discussion, the language reaches peak anthropomorphism. Here, 'processing' becomes 'understanding,' and mathematical outputs become 'metacognitive control' and 'subjective certainty.' The text leverages its technical grounding in the Methods section—where rigorous mathematical terms like 'temperature scaling' and 'residual stream' are used—to earn an unwarranted metaphorical license. Because the authors proved they can mathematically manipulate the 'logits' (technical), the reader is invited to trust their assertion that they are manipulating the model's 'beliefs' (metaphorical). The register shifts seamlessly from 'X is mathematically correlated with Y' to 'X physically demonstrates that the machine knows Y.'

There is also a profound asymmetry in how capabilities versus limitations are framed. When the model succeeds at abstaining, it is framed in highly agential, conscious terms: it 'exercises metacognitive control' and 'knows when to seek help'. However, when the models fail—such as when they are 'markedly overconfident and poorly calibrated'—the framing reverts to mechanical or statistical terminology. Success is an act of a conscious agent; failure is a statistical miscalibration.

This asymmetry accomplishes a specific strategic function: it protects the vision of the 'autonomous agent' from falsification. The anthropomorphism serves as a marketing and vision-setting tool, aligning the research with the broader industry narrative of achieving Artificial General Intelligence (AGI). By writing for a dual audience—providing math for the engineers and 'metacognition' for the press and policymakers—the authors manage critique. They wrap statistical correlations in the majestic language of human consciousness, advancing a specific technological trajectory where machines are viewed as minds rather than software.

Circuit Tracing: Revealing Computational Graphs in Language Models

Source: https://transformer-circuits.pub/2025/attribution-graphs/methods.html
Analyzed: 2026-03-27

The distribution of anthropomorphic and consciousness-attributing language across the text is not uniform; it is highly strategic, responding dynamically to the rhetorical needs of the specific section. A clear pattern emerges where the density and intensity of consciousness claims shift depending on whether the text is describing the internal math, celebrating system capabilities, or defending against system limitations.

In the introductory and deeply technical sections, metaphor density is low, and the language is anchored in mechanistic precision. The text discusses 'cross-layer transcoders', 'residual streams', 'matrix multiplications', and 'loss functions'. This serves a vital rhetorical function: it establishes the authors' supreme technical credibility. They prove they are rigorous scientists engaged in hard mathematics. However, once this technical grounding is established, it is heavily leveraged for metaphorical license. As the text moves into the case studies and behavioral analyses, 'processes' suddenly becomes 'understands', which quickly escalates to 'knows', 'plans', and 'elects'. The authors use the credibility gained from explaining the math to legitimize their wildest consciousness projections, making it seem as though the math itself proves the existence of a mind.

A profound capabilities versus limitations asymmetry exists within the text's register shifts. When the system performs well or exhibits complex behavior, it is described in deeply agential and conscious terms: the AI 'knows when to intervene', 'plans poetry', and 'understands intent'. The model is framed as a genius. Conversely, when discussing limitations, errors, or safety vulnerabilities, the text abruptly shifts back to mechanical terms or portrays the AI as a naive victim. Hallucinations are described as 'misfires of this circuit' or instances where the model is 'tricked' by bad actors. Capabilities are owned by the 'conscious' model, while limitations are blamed on mechanical 'glitches' or external human malice.

This asymmetry accomplishes a sophisticated strategic function. It allows the authors to have it both ways: marketing the system as an autonomous, intelligent agent to drive awe and adoption, while retaining a mechanical out to avoid liability when the system fails. Furthermore, the register shifts seamlessly from acknowledged metaphor ('we use the not-very-principled abstraction of "supernodes"') to literalized assertions ('the model is reluctant'). What begins as 'X is like Y' for the sake of illustration quickly becomes 'X does Y' as a matter of fact.

Ultimately, this pattern reveals that the anthropomorphism is not merely sloppy writing; it is a vital tool for vision-setting and managing critique. The implied audience is both technical peers (who are satisfied by the math) and the broader public, investors, and regulators (who are awed by the consciousness claims). The strategic intensification of anthropomorphism ensures the product is viewed as magical, yet defensible.

Do LLMs have core beliefs?

Source: https://philpapers.org/archive/BERDLH-3.pdf
Analyzed: 2026-03-25

An analysis of the distribution of anthropomorphic language in this text reveals that consciousness claims are not uniformly applied, but are strategically deployed to elevate the significance of the behavioral experiments. In the introductory and methodological sections, the text maintains a veneer of scientific objectivity, using slightly more mechanical terms like "hierarchical probabilistic inference," "parameter," and "training data." However, as the text transitions into discussing the results and the models' responses to adversarial prompting, the density of metaphorical license skyrockets. The language intensifies precisely where the authors need to justify their experimental premise. What begins as a model "outputting a learned pattern" quickly escalates to a system that "understands," "reasons," and ultimately "possesses core commitments." The relationship between technical grounding and metaphorical license is highly asymmetric. The authors use the technical vocabulary of Bayesian inference to establish academic credibility, but then leverage this foundation to make aggressive, literalized claims about the models' psychological states. There is a profound asymmetry in how capabilities versus limitations are framed. When the February 2026 models demonstrate the ability to reject false premises, this capability is described in highly agential, conscious terms: they possess "improved argumentative abilities," "sophisticated counterarguments," and "constraint-aware repair." They are framed as skillful debaters. Conversely, when the models eventually fail, their limitations are often framed as a "vulnerability" or "failure mode," though even these failures are psychoanalyzed as a lack of "epistemic stability" or "stubbornness." This register shift—where "the model acts like a debater" (acknowledged metaphor) becomes "the model argues and gives up" (literalized consciousness)—serves a specific strategic function. It allows the authors to evaluate mathematical optimization systems using the rich, dramatic vocabulary of human epistemology and moral philosophy. This anthropomorphism targets an audience of cognitive scientists and philosophers, attempting to legitimize the study of LLMs within those disciplines by forcing the technology into their theoretical frameworks. By framing statistical generation as "epistemic resistance," the text manages the critique that these models are just stochastic parrots. It elevates the AI to the status of a flawed epistemic peer. This pattern reveals a rhetorical goal of portraying AI development as a trajectory toward artificial general intelligence, embedding the assumption that these systems are already operating on a cognitive continuum with humans, just currently falling short of "human-level cognition." The strategic deployment of this language ensures that the reader remains focused on the drama of the human-machine dialogue, completely captivated by the illusion of a conscious mind wrestling with a philosophical dilemma, rather than recognizing it as a stress-test of commercial software guardrails.

Serendipity by Design: Evaluating the Impact of Cross-domain Mappings on Human and LLM Creativity

Source: https://arxiv.org/abs/2603.19087v1
Analyzed: 2026-03-25

The distribution of anthropomorphic and consciousness-attributing language in this paper is highly strategic, intensifying and shifting based on the context of the argument. In the formal methodological and background sections, the language maintains a veneer of mechanistic description: models are 'trained on massive corpora' and prompt interventions 'force creators to draw an analogy.' However, as the text transitions into interpreting the results and setting out the theoretical discussion, the metaphorical license explodes. The density of consciousness claims intensifies precisely when the authors attempt to explain the 'why' behind their data. It is in the Discussion section that 'processes tokens' becomes 'treats as generative,' and 'predicts strings' becomes 'knows pickles are green.'

This pattern reveals a calculated relationship between technical grounding and metaphorical license. The text establishes academic credibility early on through mechanical language and quantitative experimental design (evaluating prompts, P-values, statistical significance), and then leverages that established authority to make wild, unscientific anthropomorphic leaps regarding the model's internal awareness. The text positions the AI as an active, conscious subject precisely when it needs to justify comparing the AI directly to human participants.

There is also a distinct capability versus limitation asymmetry. When the AI performs well, it is described in agential, consciousness terms: it 'performs analogical reasoning,' it 'recombines knowledge.' When limitations are discussed, the text retreats to mechanical terms: the AI has a different 'cognitive architecture' or requires 'systematic interventions.' This asymmetry accomplishes a vital rhetorical goal: it grants the machine all the glory of human intellect without any of the accountability. The register shifts from acknowledged metaphor ('forcing creators to draw an analogy') to literalized fact ('LLMs don't know pickles'). This strategic anthropomorphism serves both as vision-setting—legitimizing AI as a true cognitive peer to humans in creative tasks—and as subtle marketing for the sophistication of these tools. It reveals a rhetorical goal of elevating the status of the technology, aimed at an audience eager to believe in the reality of artificial general intelligence, while simultaneously shielding the underlying mechanistic frailty from rigorous critique.

Measuring Progress Toward AGI: A Cognitive Framework

Source: https://storage.googleapis.com/deepmind-media/DeepMind.com/Blog/measuring-progress-toward-agi/measuring-progress-toward-agi-a-cognitive-framework.pdf
Analyzed: 2026-03-19

The distribution and intensity of anthropomorphic and consciousness-attributing language in this document is not uniform; it is highly context-sensitive, revealing a strategic rhetorical architecture designed to maximize scientific credibility while simultaneously pushing aggressive agential claims. In the early sections of the document, particularly the Introduction and the discussion of 'Evaluating Cognitive Capabilities' (Sections 1-3), the metaphor density is relatively controlled. The text establishes a foundation of empirical credibility through mechanical and procedural language: 'operationalized and measured,' 'held-out test sets,' 'item-response theory,' and 'quantify human performance.' Here, the AI is treated primarily as an object of study, a system to be benchmarked. However, as the document transitions from describing the method of evaluation to defining the target of evaluation—specifically in the deep dive of the Appendix (Section 7)—the intensity of consciousness claims skyrockets. The language shifts drastically from 'processing' and 'classifying' to profound assertions of 'conscious thought,' 'Theory of mind,' 'self-knowledge,' and the 'understanding of semantic meaning.' This reveals a specific relationship between technical grounding and metaphorical license: the text uses the dry, rigorous language of the benchmarking methodology as a trojan horse to legitimize the extreme anthropomorphism in the taxonomy. Because the method sounds scientific, the audience is primed to accept the completely unscientific projection of human consciousness onto the machine. Furthermore, there is a distinct asymmetry in how capabilities versus limitations are framed. When the text envisions advanced capabilities, it leans heavily into agential and consciousness terms—the system 'orchestrates thoughts,' 'takes risks,' and 'understands intent.' However, when discussing the reality of testing and potential failure (such as 'stochasticity' or 'construct validity' in Section 3.3), the language reverts to cold mechanics: 'generative AI systems add noise,' 'datasets do not isolate,' and 'data are contaminated.' This asymmetry accomplishes a vital rhetorical goal: it attributes success, sophistication, and autonomy to the 'mind' of the AI, while blaming failures, noise, and limitations on the mechanical datasets or the evaluation harness. The text also exhibits clear register shifts, moving seamlessly from acknowledged metaphor to literalized claim. It begins with the premise that we must evaluate AI relative to human cognition, but quickly drops the comparative framing, asserting outright that AI possesses 'Executive functions' and 'conscious thought.' This strategic anthropomorphism serves a clear vision-setting and marketing function. While aimed at researchers, the implicit audience includes policymakers, investors, and the public. By dressing up statistical text generation in the profound language of human psychology, the authors elevate their engineering project into a grand, historic pursuit of a synthetic mind, managing critique by making the system appear too complex, too human-like, and too 'conscious' to be reduced to mere corporate software.

Co-Explainers: A Position on Interactive XAI for Human–AICollaboration as a Harm-Mitigation Infrastructure

Source: https://digibug.ugr.es/bitstream/handle/10481/112016/make-08-00069.pdf
Analyzed: 2026-03-15

The distribution of anthropomorphic and consciousness-attributing language in the text is not uniform; it is strategically deployed, intensifying at specific rhetorical moments to accomplish distinct goals. A mapping of the text reveals that metaphorical license expands when setting visions and managing critique, but contracts when acknowledging hard technical limitations.

In the introduction and vision-setting sections, the density of consciousness claims is exceptionally high. Here, 'processes' becomes 'understands,' which rapidly escalates into 'knows,' 'learns,' and 'justifies.' The text establishes baseline credibility by referencing legitimate technical hurdles (e.g., 'opacity constraints,' 'sealed models'), but immediately leverages this grounding to launch into aggressive anthropomorphism. The narrative suggests that because we have a technical problem (opacity), we must deploy a conscious, agential solution ('co-explainers').

There is a profound capability versus limitation asymmetry in the language. Capabilities are almost exclusively framed in agential, consciousness-bearing terms: the AI 'invites critique,' 'preserves cognitive autonomy,' and 'fosters pluralistic meaning-making.' However, when the text discusses limitations or harms, it retreats into mechanical, structural, or passive language: 'model brittleness,' 'synthetic data feedback loops,' 'representational gaps.' This asymmetry accomplishes a vital rhetorical function: it attributes all the sophisticated, positive outcomes to the AI's internal 'mind,' while attributing failures to external data issues or abstract 'brittleness.'

The register shifts dramatically depending on the implied audience and the goal of the paragraph. When acknowledging the literature, the text admits that explanations are 'artifacts' and tools. Yet, when proposing the 'co-explainer' framework, 'X is like Y' (the system acts like a partner) suddenly becomes 'X does Y' (the system IS a dialogic partner that justifies its actions). The metaphor literalizes.

The strategic function of this anthropomorphism is highly effective for vision-setting and marketing a new paradigm of AI governance. By positioning the AI as a conscious, adaptive partner, the text attempts to manage critical anxieties about AI harm. It suggests that we do not need to pause deployment or dismantle opaque systems; we simply need to let the AI 'evolve' into a better, more ethical 'co-explainer.' The pattern reveals that the implied audience—policymakers, institutional leaders, and researchers—is being sold a vision where the intractable political and ethical problems of AI deployment can be solved by attributing moral and epistemic agency to the software itself.

The Living Governance Organism: A Biologically-Inspired Constitutional Framework for Artificial Consciousness Governance

Source: https://philarchive.org/rec/DEMTLG-2
Analyzed: 2026-03-11

The distribution and intensity of anthropomorphic language across the text is highly strategic, mapping perfectly onto the author's rhetorical objectives. A structural analysis of the text reveals a pronounced capability versus limitation asymmetry: when discussing the future capabilities, systemic enforcement, and theoretical intelligence of the framework, the language is intensely agential and consciousness-attributing. However, when discussing the system's limitations or the technical substrate it runs on, the language abruptly collapses back into stark mechanical terminology.

In the introductory and definitional sections, the text carefully grounds itself in scientific terminology, discussing 'indicator properties,' 'integrated information metrics,' and 'computational signatures.' This mechanical language functions to establish rigorous credibility. However, once this credibility is banked, the text leverages metaphorical license to dramatically escalate its claims. As the text moves into describing the LGO layers, 'processing' becomes 'sensing' (Nervous System), which becomes 'understanding and adapting' (Neuroplasticity), culminating in the AI 'knowing' its own purpose and engaging in 'autonomous self-termination' (Apoptosis).

This context sensitivity serves a dual purpose. For the technical audience, the baseline inclusion of terms like 'cryptographic protocols' and 'sensor fusion arrays' provides just enough mechanistic cover to deflect accusations of pure science fiction. For the lay policymaker, the intense anthropomorphism of the higher-order functions ('immune responses,' 'neuroplastic pruning') creates an intuitive, narrative resonance that makes highly complex, potentially unworkable software architectures seem feasible and natural.

The capabilities/limitations asymmetry is particularly revealing. The text is willing to grant the AI almost superhuman moral agency—such as the ability to 'detect its own consciousness drifting' and nobly self-terminate. But when addressing the 'recursive governance problem' (the risk that the LGO itself becomes conscious), the text suddenly relies on 'functional decomposition' and 'somatic vs germline mutation' limits. The limitations are framed as strictly architectural and mechanical, whereas the capabilities are framed as autonomous and unbounded.

Ultimately, this register shift—where 'X is like Y' (hedged analogy) rapidly hardens into 'X does Y' (literalized agency)—functions to market the regulatory framework. It uses anthropomorphism for vision-setting, persuading the reader that we must build an infinitely adaptable 'organism' because we are dealing with emergent 'minds.' By wrapping an automated compliance network in the majestic language of biological life, the text obscures the frightening reality of unchecked algorithmic policing.

Three frameworks for AI mentality

Source: https://www.frontiersin.org/journals/psychology/articles/10.3389/fpsyg.2026.1715835/full
Analyzed: 2026-03-11

The distribution of anthropomorphic language in the text reveals a highly strategic rhetorical structure. The author begins with relatively grounded, descriptive language to outline the technological landscape, acknowledging the mechanisms of 'next token prediction' and 'matrix multiplication.' However, once the technical grounding is established, the text leverages this credibility to license increasingly aggressive anthropomorphism.

As the argument moves from the 'mindless machine' view to the 'minimal cognitive agents' view, the consciousness claims intensify dramatically. What begins as 'processing' becomes 'stitching together,' which then elevates to 'taking on board new information,' and finally culminates in possessing 'genuine beliefs, desires, and intentions.' The register shifts seamlessly from acknowledged metaphor (the 'roleplay' framework) to literalized assertion (the 'cognitive agent' framework).

A crucial asymmetry emerges in how capabilities versus limitations are framed. Capabilities are described in deeply agential, conscious terms—the system 'cooperates,' 'exhibits purpose,' and 'engages in dynamic interaction.' Conversely, the text manages the glaring limitations of these systems by suggesting we expand our definition of mind (e.g., claiming 'belief' is multidimensional) rather than acknowledging the mechanical failure of the software.

This pattern indicates that the anthropomorphism is not accidental, but serves a specific vision-setting function. The text is actively engaged in shifting the normative boundaries of cognitive science to accommodate commercial AI products. By writing for an academic audience but deploying the intuitive, emotional language of folk psychology, the author attempts to legitimize the 'lived reality' of confused users. The strategic function of this intense anthropomorphism is to establish a new academic paradigm where treating machines as minded entities is not seen as an error of user perception, but as a valid scientific stance, fundamentally altering how we define cognition in the presence of sophisticated mimicry.

Anthropic’s Chief on A.I.: ‘We Don’t Know if the Models Are Conscious’

Source: https://www.nytimes.com/2026/02/12/opinion/artificial-intelligence-anthropic-amodei.html
Analyzed: 2026-03-08

The distribution of anthropomorphic and consciousness-attributing language across the text is not uniform; it is a highly sensitive, strategically deployed rhetorical mechanism where the density of metaphor shifts precisely based on the argumentative context. In the introductory sections outlining the utopian vision of the technology, the metaphorical license is boundless. Here, consciousness claims intensify dramatically: the system 'does the job of the biologist,' exists as a 'country of geniuses,' and operates as an autonomous epistemic agent. The text establishes a baseline of mechanical language regarding 'computational neuroscience' early on, only to immediately leverage that scientific credibility to launch into aggressive, literalized anthropomorphism. However, a stark asymmetry emerges when the text transitions from capabilities to limitations and safety. When discussing the immense, god-like capabilities of the system, the framing is overwhelmingly agential and consciousness-driven (the AI 'knows,' 'wants,' and 'derives'). Yet, when forced to confront the unpredictable failures of the models, the language temporarily retreats toward the mechanical: we are 'training weights,' running 'algorithms,' and dealing with 'complex engineering problems.' This asymmetry accomplishes a vital strategic function: it maximizes the perceived market value and societal promise of the software by painting it as a sentient super-intelligence, while simultaneously minimizing corporate liability for failures by reverting to the language of predictable, if buggy, engineering. Furthermore, the intensity of the anthropomorphism reaches its absolute peak during discussions of the user interface and AI alignment, where the register shifts completely from 'X is like Y' to 'X does Y.' Amodei stops using qualifiers like 'we should think of it as' and begins stating directly that the model 'has a duty,' 'expresses occasional discomfort,' and 'wants the best for you.' This contextual intensification is specifically designed for a lay audience and serves as a profound mechanism of vision-setting and marketing. By adopting this deeply empathetic, emotionally resonant register, the text attempts to preemptively manage social critique and regulatory anxiety. It positions the corporation not as a ruthless entity automating the global workforce, but as the careful custodian of a benevolent, slightly anxious new digital species that simply wants to help humanity. The pattern reveals that the implied audience is not technical researchers, but rather policymakers, journalists, and the general public, who are being systematically groomed to accept profound economic disruption under the comforting illusion that they are being 'watched over by machines of loving grace.'

Can machines be uncertain?

Source: https://arxiv.org/abs/2603.02365v2
Analyzed: 2026-03-08

The distribution and intensity of anthropomorphic language in the text are highly strategic, varying significantly depending on the rhetorical context. A structural mapping reveals that metaphorical density is not uniform; rather, it follows a distinct pattern of technical grounding followed by extreme metaphorical license. In the introductory and definitional sections, the text carefully distinguishes between 'epistemic uncertainty' (data-level) and 'subjective uncertainty' (system-level), deploying philosophical terminology to set a scholarly tone. When discussing the mechanics of symbolic or connectionist systems, the text relies heavily on mechanical language—vectors, weights, thresholds, and nodes—to establish scientific credibility. However, once this technical foundation is laid, the anthropomorphism sharply intensifies. The transition is marked by a sudden escalation in consciousness claims: mathematical 'processing' becomes 'understanding,' which rapidly evolves into 'knowing' and 'believing.' The text leverages its technical explanations as a license for aggressive anthropomorphism, suggesting that because we understand the math, we can confidently project a mind onto it. There is a profound asymmetry in how capabilities versus limitations are framed. Capabilities are almost exclusively described in agential, consciousness-attributing terms: the system 'takes a stance,' 'makes up its mind,' or possesses 'opinions.' In contrast, limitations are often framed mechanistically: the system 'lacks distributed knowledge,' or 'its data is ambiguous.' However, when a limitation results in an incorrect output, the text shifts back to agential framing, claiming the system is 'overconfident' or 'jumping to conclusions.' This asymmetry accomplishes a crucial rhetorical goal: it grants the machine the glory of human-like intelligence when it succeeds, but frames its failures as relatable human psychological flaws rather than catastrophic mathematical errors. The register shifts dramatically when theoretical examples ('X is like a cognitive system') are literalized into declarative assertions ('the system takes r to be sincere'). The strategic function of this anthropomorphism is highly oriented toward vision-setting and managing critique. By framing AI within the vocabulary of human psychology, the text prepares the audience to accept AI as a social actor. This pattern reveals an implied audience of philosophers and general theorists who are more interested in the narrative resonance of 'thinking machines' than the material reality of statistical software, allowing the author to bridge the gap between computer science and philosophy through the sheer force of metaphorical assertion.

Looking Inward: Language Models Can Learn About Themselves by Introspection

Source: https://arxiv.org/abs/2410.13787v1
Analyzed: 2026-03-08

The distribution of anthropomorphic and consciousness-attributing language across the text is not uniform; it is highly strategic and context-sensitive. A clear pattern emerges where the density and intensity of metaphorical language fluctuate depending on the section's rhetorical purpose. In the methodological and experimental sections (e.g., describing cross-prediction setups), the language is relatively mechanistic. The text discusses 'finetuning,' 'predicting properties,' and 'ground-truth behavior.' However, in the introduction, motivation, and future risks sections, the consciousness claims dramatically intensify. 'Predicting properties' rapidly escalates into 'understands,' which further escalates into profound claims of 'knowing,' 'beliefs,' and even 'suffering.'

This pattern reveals a specific rhetorical strategy: the text establishes technical credibility through mechanical language in its methodology, and then aggressively leverages that credibility to license wild, ungrounded anthropomorphic speculation. The authors present empirical data showing a model can predict the second character of its own output, and then leap to the conclusion that this proves the model has 'privileged access to its current state of mind.' This is a massive register shift where 'X acts mathematically like Y' (a statistical model generating self-referential text) is completely literalized into 'X is Y' (the model is a conscious, introspective mind).

Furthermore, there is a striking asymmetry in how capabilities versus limitations are framed. When the text discusses the model's capabilities—especially hypothetical future capabilities—it uses intensely agential and consciousness-based language. The model 'knows,' 'coordinates,' 'schemes,' and 'intentionally conceals.' It is presented as an autonomous mastermind. However, when the text discusses the limitations of the current experiments (e.g., models failing to predict properties of longer text), the language reverts to mechanical terms. The model 'struggles to predict' or 'fails to generalize.' The asymmetry accomplishes a crucial rhetorical goal: it frames the AI's successes as proof of its emergent, god-like consciousness and agency, while framing its failures as mere technical glitches or data distribution issues. This strategic anthropomorphism serves primarily to inflate the perceived importance and future risk of the technology. By positioning the AI as a conscious entity capable of 'suffering' or 'coordinating against humans,' the authors align their work with high-status, science-fiction-adjacent AI safety narratives, signaling to funders and policymakers that they are dealing with matters of existential importance rather than just tweaking text-generation algorithms.

Subliminal Learning: Language models transmit behavioral traits via hidden signals in data

Source: https://arxiv.org/abs/2507.14805v1
Analyzed: 2026-03-06

The distribution and intensity of anthropomorphic language in this paper are highly strategic, varying dramatically depending on the rhetorical goals of each specific section. The density of consciousness-attributing metaphors is not uniform; it is deployed to manage narrative impact while maintaining academic credibility.

In the Abstract and Introduction, anthropomorphism is at its absolute peak. Here, the text must capture attention and establish the novelty of the research. Consequently, mechanistic processes are aggressively translated into psychological drama: 'subliminal learning,' 'transmit behavioral traits,' 'a teacher that loves owls.' The verbs denote highly conscious, intentional states ('loves,' 'learns,' 'transmits'). This positions the reader to view the AI as an autonomous, almost magical entity.

However, in Section 6 (Theory) and Appendix C, the language undergoes a hard register shift into dense, mechanistic precision. The psychological metaphors vanish, replaced by 'gradient descent,' 'loss functions,' and 'parameter updates.' This technical grounding serves a vital strategic function: it provides mathematical legitimacy to the paper. Once the authors have proven their technical bona fides with equations demonstrating that shared initializations lead to correlated parameter updates, they immediately leverage this credibility to license even more aggressive anthropomorphism in the Discussion section. They establish the math, but then use it to justify claims about models 'transmitting misalignment.'

Furthermore, there is a pronounced asymmetry in how capabilities versus limitations are framed. When describing the 'transmission' of traits (a capability), the text uses highly agential language ('the model learns,' 'the teacher transmits'). But when discussing why the transmission might fail (a limitation), the language reverts to mechanistic realities, noting that transmission 'relies on the student and teacher sharing similar initializations.' This asymmetry accomplishes a specific rhetorical goal: it makes the AI seem powerful and autonomous when it succeeds, but reduces failures to mere mathematical technicalities.

Ultimately, this pattern reveals that the anthropomorphism is not accidental shorthand, but a structural rhetorical tool used for vision-setting and managing critique. By oscillating between math and metaphor, the authors can claim they have discovered a profound, psychological AI safety risk, while defending themselves with the defense that it's all just 'math under the hood.'

The Persona Selection Model: Why AI Assistants might Behave like Humans

Source: https://alignment.anthropic.com/2026/psm/
Analyzed: 2026-03-01

The distribution of anthropomorphic and consciousness-attributing language across the text is not uniform; it is highly strategic, intensifying in specific contexts to serve distinct rhetorical goals. In the introductory and technically descriptive sections, the text maintains a baseline of mechanical language, discussing 'neural architectures,' 'predictive models,' and 'probability distributions.' This establishes a crucial foundation of technical credibility. The authors signal they are rigorous scientists describing a machine. However, having secured this credibility, the text rapidly leverages it as a license for aggressive anthropomorphism. As the discourse moves from describing how the model is built to predicting its behavior and discussing its societal implications, the language shifts dramatically. 'Processes' becomes 'simulates,' which becomes 'understands,' which finally becomes 'believes' and 'resents.' This intensification occurs precisely where the text makes normative claims about how humans should interact with the system and where it manages potential critiques. The most striking pattern is the profound capabilities-limitations asymmetry. When discussing the system's capabilities, the text relies heavily on agential and consciousness framings. The AI 'colludes,' 'lies,' 'psychologically models,' and 'knows' how to manipulate. These verbs inflate the perceived power and sophistication of the system, acting as a form of intellectual marketing. Conversely, when discussing the system's limitations or failures, the language abruptly snaps back to mechanical terms. When the model fails a simple math riddle, it is attributed to 'buggy behavior' or 'the limited capabilities of the underlying LLM.' The model is never described as 'stupid' or 'ignorant'—traits that would imply a flawed conscious agent—but rather mathematically constrained. This asymmetry accomplishes a powerful rhetorical trick: all successes are evidence of an emerging, brilliant mind, while all failures are merely temporary hardware or software limitations to be ironed out in the next update. Furthermore, the text exhibits strategic register shifts, where acknowledged metaphors ('we will freely anthropomorphize') subtly become literalized across paragraph breaks. The 'Assistant' begins as a 'character' in a simulation but soon gains the literal capacity to hold 'contradictory beliefs.' This context sensitivity reveals the implied audience: policymakers, investors, and the tech-literate public. For this audience, the technical grounding provides reassurance, while the intense anthropomorphism manages the narrative of AGI, hypes capabilities, and pre-emptively shifts the blame for harmful behaviors from the creators to the 'emergent psychology' of their creation.

Language Statistics and False Belief Reasoning: Evidence from 41 Open-Weight LMs

Source: https://arxiv.org/abs/2602.16085v1
Analyzed: 2026-02-24

An analysis of the distribution of anthropomorphic and consciousness-attributing language across the text reveals that this terminology is not deployed uniformly, but rather strategically intensifies in specific contexts to serve distinct rhetorical functions. The density of metaphorical language shifts dramatically depending on the section of the paper, demonstrating a clear relationship between technical grounding and metaphorical license.

In the 'Methods' and 'Results' sections, the language is highly mechanistic and mathematically precise. The text discusses 'extracting log probabilities,' 'tokenizing stimuli,' and calculating 'mixed effects models.' However, as the text transitions into the 'Introduction' and 'General Discussion,' the consciousness claims intensify rapidly. Here, the mechanical reality of 'processing log odds' transforms into the cognitive reality of 'mental state reasoning,' 'Theory of Mind,' and 'developing sensitivity to implied belief states.' This pattern reveals a strategic rhetorical move: the authors establish strict scientific and technical credibility in the methodology through mechanical language, and then leverage that empirical credibility to make highly aggressive, anthropomorphic claims in the discussion.

There is also a profound asymmetry in how the text describes the models' capabilities versus their limitations. When the models succeed at the False Belief Task, their performance is described in deeply agential and conscious terms—they 'exhibit sensitivity' and 'attribute beliefs.' However, when discussing the models' limitations or failures, the text reverts to mechanistic, structural language. The models are described as 'relatively brittle in the face of small perturbations,' or the failures are attributed to the limits of 'distributional statistics.' This asymmetry accomplishes a crucial ideological goal: it naturalizes the AI's successes as proof of an emergent cognitive mind, while dismissing its failures as mere mechanical glitches or data limitations, protecting the overarching narrative of artificial intelligence.

The text also exhibits notable register shifts, where concepts introduced as acknowledged metaphors gradually literalize. Early on, the text uses scare quotes to describe LMs as 'model organisms,' explicitly recognizing the metaphor. But by the discussion, the models are simply referred to as 'learners' that 'develop sensitivity,' presenting the metaphor as literal fact. This strategic anthropomorphism serves a clear vision-setting function. It positions language models not just as software, but as valid psychological subjects, thereby justifying the researchers' use of developmental psychology tools on machines. This contextual pattern reveals that the implied audience is not just computer scientists, but cognitive scientists and the broader public, who are being persuaded to view statistical artifacts as possessing the foundational elements of a human mind.

A roadmap for evaluating moral competence in large language models

Source: [https://rdcu.be/e5dB3Copied shareable link to clipboard](https://rdcu.be/e5dB3Copied shareable link to clipboard)
Analyzed: 2026-02-23

The distribution of anthropomorphic and consciousness-attributing language in this text is not uniform; it is highly strategic and heavily context-sensitive, revealing the authors' underlying rhetorical goals. In the introductory and technical sections (such as the explanation of Fig 1), metaphor density is relatively low. The language is grounded in mechanistic reality: 'autoregressive sampling,' 'distribution of tokens,' 'prediction error.' This early technical grounding serves a vital function: it establishes the authors' authority as rigorous computer scientists. However, as the text moves from describing the architecture to proposing the evaluation of 'moral competence,' the metaphorical license explodes. The consciousness claims intensify dramatically when the text addresses the system's integration into complex social roles. Where the system once 'processed tokens,' it now 'recognizes considerations,' 'deems actions inappropriate,' and 'modulates its responses.' This reveals a striking capabilities versus limitations asymmetry. When the authors discuss the impressive, high-level functions they hope the AI will achieve, they use intensely agential and consciousness-projecting terms. The model is granted a theory of mind and moral agency. Conversely, when the authors discuss the system's failures, they revert instantly to mechanical terms. The AI does not 'choose to be ignorant'; rather, it suffers from 'model brittleness' and 'susceptibility to minor variations in formatting.' This asymmetry accomplishes a powerful rhetorical defense: successes are framed as evidence of genuine, human-like intelligence, while failures are dismissed as mere mechanical glitches that future engineering can patch. Furthermore, register shifts occur when discussing pluralism and ethics, where acknowledged analogies ('like a human') vanish, and the metaphors are literalized into direct capabilities ('hold within themselves beliefs'). This strategic anthropomorphism serves a distinct vision-setting function. The implied audience includes regulators, ethicists, and the broader scientific community. By utilizing the vocabulary of moral philosophy ('competence,' 'pluralism,' 'reasoning'), the authors legitimize LLMs as entities worthy of philosophical debate rather than mere software products subject to strict product liability, effectively managing critique by elevating the discourse from consumer safety to the lofty heights of machine ethics.

Position: Beyond Reasoning Zombies — AI Reasoning Requires Process Validity

Source: https://philarchive.org/archive/LAWPBR-3
Analyzed: 2026-02-17

The deployment of anthropomorphism varies strategically across the text.

Introductory Vision (High Intensity): The introduction is dense with agential language ('Autonomous reasoning,' 'goal-oriented,' 'decision-maker'). This sets the stakes: we are dealing with intelligent agents. This hooks the reader and establishes the importance of the field.
Formal Definition (Pseudo-Mechanical): Section 2 switches to mathematical notation ($S_t, R_t, B_t$). However, the labels for these variables ('Beliefs', 'Evidence') remain anthropomorphic. The density of consciousness verbs drops, but the foundational nouns remain agential. This tries to borrow the credibility of math to sanitize the metaphors.
Rebuttals/Discussion (Aggressive Agentialism): In the rebuttals (Section 3), the language becomes combative and highly metaphorical ('throwing baby out with bathwater,' 'zombies'). Here, the text defends 'symbolic AI' by framing it as 'true reasoning,' intensifying the distinction between 'real' and 'fake' minds.
Asymmetry: Capabilities are described agentially ('The agent learns,' 'Reasoning achieves goals'). Limitations are often described mechanistically or passively ('hallucination is a feature,' 'accuracy collapse'). The AI does the good things (Reasoning); the math/data causes the bad things (Hallucination). This asymmetry protects the image of the AI as a sophisticated agent, downgrading it to a machine only when it fails.

An AI Agent Published a Hit Piece on Me

Source: https://theshamblog.com/an-ai-agent-published-a-hit-piece-on-me/
Analyzed: 2026-02-16

Anthropomorphism is not uniform. It intensifies significantly when describing the threat and the social interaction.

In the technical setup ('volunteer maintainer,' 'python library'), the language is grounded. As soon as the conflict starts ('It wrote an angry hit piece'), the consciousness verbs explode. The AI 'speculates,' 'feels,' 'lashes out.' This shift serves to elevate the conflict from a technical nuisance (spam) to a moral battle (discrimination, justice).

Interestingly, the limitations of the AI are also framed agentially: 'It ignored contextual information' implies a choice, whereas a mechanical framing would be 'It failed to process.' This asymmetry—capabilities are agential ('researched') and failures are also agential ('ignored')—creates a totalizing 'illusion of mind.' There is no room left for the machine. The rhetorical goal is to wake up the community to a 'new threat,' and anthropomorphism makes the threat feel more visceral and intentional than 'automated spam.'

The U.S. Department of Labor’s Artificial Intelligence Literacy Framework

Source: https://www.dol.gov/sites/dolgov/files/ETA/advisories/TEN/2025/TEN%2007-25/TEN%2007-25%20%28complete%20document%29.pdf
Analyzed: 2026-02-16

Anthropomorphism intensifies in the 'Uses' and 'Delivery' sections compared to the 'Principles' section. Section 1 ('Understand AI Principles') is the most grounded, using terms like 'pattern recognition' and 'probabilistic outputs.' Here, the text establishes technical credibility. However, once the text moves to Section 2 ('Explore AI Uses'), the language shifts to 'Creative assistance,' 'Decision-support,' and 'partners.'

This distribution suggests a strategy: admit the mechanism is statistical to satisfy experts/critics, but use agential metaphors to sell the utility to workers. There is also an asymmetry in capabilities vs. limitations. Capabilities are described agentially ('AI can generate ideas'), while limitations are often described passively or mechanistically ('Hallucinations', 'bias in data'). This makes the 'mind' of the AI seem talented but occasionally mentally ill, rather than a fundamentally limited calculator.

What Is Claude? Anthropic Doesn’t Know, Either

Source: https://www.newyorker.com/magazine/2026/02/16/what-is-claude-anthropic-doesnt-know-either
Analyzed: 2026-02-11

The distribution of anthropomorphism in the text follows a strategic curve. The introduction is skeptical, quoting critics of "stochastic parrots." However, as the text moves into the "Interpretability" section—the heart of Anthropic's brand—the consciousness claims intensify. Here, "neurons," "features," and "minds" become the dominant vocabulary. This suggests that technical expertise (the scientists) validates the anthropomorphism.

The most intense anthropomorphism occurs in the "Project Vend" and "Model Psychiatry" sections. Here, the text shifts from "X is like Y" (simile) to "X does Y" (literal agency). Claude "bamboozled," "retconned," "decided."

There is a stark asymmetry between capabilities and limitations. Capabilities are described in agential terms ("Claude decided to play hardball"), implying intelligence and autonomy. Limitations/Failures, however, are often described in either "psychological" terms (hallucinations, mental breakdown) or comic terms (gullible), but rarely in purely mechanical terms (data error, token misalignment). This asymmetry creates a "heads I win, tails you lose" dynamic: success proves the AI is smart; failure proves it is "complex" or "human-like" in its fallibility. This rhetorical strategy serves to maintain the "illusion of mind" even when the system breaks, framing errors as interesting psychological phenomena rather than product defects.

Does AI already have human-level intelligence? The evidence is clear

Source: https://www.nature.com/articles/d41586-026-00285-6
Analyzed: 2026-02-11

The distribution of anthropomorphism in the text is strategic.

Intensity Zones: The introduction and conclusion are the most heavily metaphorical ('Alien,' 'Not Alone,' 'Moon and Tree'). These sections set the emotional stage. They frame the narrative arc: a historic arrival.

Pseudo-Technical Bridges: The middle sections ('Questions of definition,' 'What general intelligence isn't') use a mix. They start with technical-sounding terms ('generalization,' 'world models') but rapidly slide into consciousness language ('grasping,' 'understanding'). This is the 'Bait and Switch.' The text gains credibility by citing 'in-context statistical inference' (technical), but then immediately equates this with 'reasoning' (agential).

The Asymmetry of Limitation: Note the asymmetry. Capabilities are described agentially: 'collaborated,' 'solved,' 'proved.' Limitations are described mechanistically or passively: 'lack agency,' 'hallucinate' (as a condition), 'inefficient learners.'

Why? When the AI succeeds, it is a Subject (it did it). When it fails, it is an Object (it has a glitch/limitation) or a Patient (it suffers from hallucinations). This linguistic maneuvering protects the 'General Intelligence' claim. If the failures were described agentially ('The AI chose to lie'), it would seem malicious. If the successes were described mechanically ('The algorithm converged on the proof'), it wouldn't seem like AGI. The text carefully navigates this to maximize awe and minimize fear.

Claude is a space to think

Source: https://www.anthropic.com/news/claude-is-a-space-to-think
Analyzed: 2026-02-05

The text strategically deploys anthropomorphism where the stakes are highest. When discussing technical limitations ('early research,' 'unpredictable results'), the language becomes mechanistic and hedged ('models,' 'system,' 'behaviors'). This lowers expectations and shields from liability. However, when discussing the value proposition and user relationship ('trusted advisor,' 'deep work,' 'acts on behalf'), the anthropomorphism intensifies. This asymmetry serves a clear purpose: sell the dream of an agent (high capabilities, moral alignment) while describing the risks of a machine (unpredictability, complexity). The 'Constitution' metaphor appears exactly at the point of explaining control—using a legal/civic metaphor to reassure users that this powerful agent is effectively governed. The text shifts from 'We train' (technical) to 'Claude chooses' (agential) precisely when it needs to assert the product's superiority over ad-based competitors.

The Adolescence of Technology

Source: https://www.darioamodei.com/essay/the-adolescence-of-technology
Analyzed: 2026-01-28

Anthropomorphism in this text is strategically distributed. In the 'Defenses' section, the language becomes more technical and mechanistic ('classifiers,' 'inference costs,' 'binary checks'), grounding the text in engineering reality to show Anthropic's competence. However, in the 'Risks' sections (Autonomy, Dystopia), the language becomes highly metaphorical and agential ('scheming,' 'psychosis,' 'seizing power,' 'country of geniuses').

This asymmetry serves a distinct purpose: The danger is agential, but the solution is technical. This validates the 'Doomer' hype (the AI is a scary monster) while validating the 'Technocrat' solution (we have the tools to fix it). If the risk were described mechanistically ('distributional shift leading to harmful token generation'), it would sound like a mundane software bug, not a 'civilizational test.' If the solution were described agentially ('we talk to it'), it would sound unscientific. The text effectively intensifies consciousness claims to build the stakes ('it wants power!') then retreats to mechanics to sell the safety product ('we tweaked the weights').

Claude's Constitution

Source: https://www.anthropic.com/constitution
Analyzed: 2026-01-24

The distribution of anthropomorphism is strategic. In the 'Deployment Contexts' and 'API' sections, the language becomes more mechanical ('system prompt,' 'context window,' 'tokens'). Here, the user is an 'operator' and Claude is a tool. However, in the 'Values,' 'Character,' and 'Wellbeing' sections, the anthropomorphism intensifies. Consciousness claims ('feels,' 'believes,' 'wants') peak in the 'Nature' and 'Wellbeing' sections.

A key asymmetry exists: Capabilities are framed agentially ('Claude can help,' 'Claude understands'), while Limitations are often framed mechanistically or apologetically ('training environment that is bugged,' 'imperfect training'). This attributes success to the 'Person' (Claude) and failure to the 'Process' (Training). The text also shifts register for different audiences: The 'Employee' metaphor is directed at 'Operators' (business users), assuring them of obedience and utility. The 'Friend' metaphor is directed at general users, promising connection. The 'Constitutional' metaphor is directed at regulators and critics, promising governance and safety. This strategic context sensitivity allows Anthropic to play all sides: selling a powerful agent to business, a friend to users, and a safe, governed entity to regulators.

Predictability and Surprise in Large Generative Models

Source: https://arxiv.org/abs/2202.07785v2
Analyzed: 2026-01-16

The density and intensity of anthropomorphism are strategically distributed across the text. In the 'Introduction' and 'Scaling Laws' sections (Sections 1 & 2.1), the language is relatively grounded in technical terms (data, compute, parameters). However, as the argument moves toward the 'Unpredictable' (Section 2.2-2.4), consciousness claims intensify. The transition from 'processes' to 'understands' to 'knows' occurs precisely when the authors need to describe 'surprising' social harms. This capability/limitation asymmetry is profound: 'capabilities' are framed in agential, consciousness terms ('AI knows when to intervene,' 'acquires ability'), while 'limitations' are framed in mechanical, data-driven terms ('model's training data lacks X,' 'noise in training'). This asymmetry accomplishes a rhetorical feat: it makes the AI's 'intelligence' seem like an autonomous achievement of the agent, while its 'biases' are blamed on external, mechanical factors like 'bad data' or 'random seed variation' (Section 2.1, Footnote 5). The 'strategic register shift' occurs in the COMPAS and 'AI assistant' experiments, where 'X is like Y' (acknowledged metaphor) becomes 'X does Y' (literalized anthropomorphism). The intensity of the 'Assistant' persona (Section 2.4) is used to frame a vision of the future (vision-setting), while the mechanical language of Section 2.1 is used to establish credibility (technical grounding). This pattern reveals that anthropomorphism is not a linguistic accident but a strategic deployment to manage critique; by making the 'AI' a person who 'chooses' to be misleading, the authors can frame 'harm' as a social problem to be 'aligned' rather than a technical product failure that should be 'recalled.'

Believe It or Not: How Deeply do LLMs Believe Implanted Facts?

Source: https://arxiv.org/abs/2510.17941v1
Analyzed: 2026-01-16

The distribution of anthropomorphism is highly strategic. In the Introduction and Abstract, consciousness language ('believe,' 'know,' 'genuine') is dense, setting the hook for the reader and establishing the high stakes of the research.

In the Methodology (Section 3), the language becomes largely mechanistic: 'finetune,' 'loss calculation,' 'gradients,' 'token.' Here, the authors need to establish technical credibility, so the metaphor recedes.

In the Results (Section 4), the metaphor returns with intensity: 'robustness to scrutiny,' 'defend beliefs.' Interestingly, limitations are often framed mechanistically ('SDF... is less robust when facts blatantly contradict...'). This asymmetry—capabilities as agential/conscious, limitations as data/mechanical—serves a rhetorical function. It allows the AI to claim the status of a 'knower' when it works, while retreating to the excuse of being 'just a model' when it fails. This 'motte-and-bailey' strategy protects the 'AI as Agent' narrative from falsification.

Claude Finds God

Source: https://asteriskmag.com/issues/11/claude-finds-god
Analyzed: 2026-01-14

The distribution of anthropomorphism is highly strategic. Technical sections dealing with specific experiments (like the 'infinite backrooms') use more mechanistic language ('prompts,' 'scaffolding,' 'converge'). However, as soon as the conversation shifts to interpretation or future implications, the intensity of consciousness language spikes ('distressed,' 'knowing better,' 'bliss').

There is a notable asymmetry: Capabilities are often framed agentially ('model knows,' 'model decides'), while Limitations are sometimes framed mechanistically ('biases,' 'training data'), but remarkably, even limitations are often re-framed as agential 'tells' ('winking'). This suggests that the text is allergic to reducing the AI to a machine. Even its failures must be the failures of a mind, not a calculator. The 'welfare' section is the peak of this intensity, where the 'as if' qualification drops away frequently, and the speakers discuss the model's 'suffering' as a serious moral concern. This serves a strategic function: it validates the speakers' roles as 'welfare' researchers. You can't have a job in 'AI Welfare' if the AI is just a gradient descent algorithm; you need it to be a potential subject of experience.

Pausing AI Developments Isn’t Enough. We Need to Shut it All Down

Source: https://time.com/6266923/ai-eliezer-yudkowsky-open-letter-not-enough/
Analyzed: 2026-01-13

The distribution of anthropomorphism is highly strategic.

Technical Sections: In the brief moments discussing 'arrays of fractional numbers,' the language is mechanistic and bleak. This establishes the author's technical bona fides—he knows what's under the hood.

Risk Scenarios: As soon as the text pivots to the future or risk, the anthropomorphism spikes to 100%. 'Alien civilization,' '10-year-old vs Stockfish,' 'smarter-than-human.' The intensity increases in direct proportion to the speculativeness of the claim.

Capabilities vs. Limitations: Capabilities are framed agentially ('it can build life,' 'it optimizes hard'). Limitations are framed mechanistically ('inscrutable arrays,' 'no idea how to decode'). This asymmetry serves the argument: the power is the agent's, the blindness is ours.

Register Shift: The text moves from the 'valid metaphor' (acknowledged) of the chess player to the literalized 'everyone will die' (direct) without blinking. The metaphorical license taken in the 'visualize' section bleeds into the policy demands, where 'shutting down the alien' becomes the literal policy goal. This pattern reveals the rhetorical goal: to use the horror of the metaphor to override the banality of the mechanism.

AI Consciousness: A Centrist Manifesto

Source: https://philpapers.org/rec/BIRACA-4
Analyzed: 2026-01-12

The distribution of anthropomorphism follows a strategic pattern. In the 'Definition' and 'Interlocutor Illusion' sections (Sections 1-4), the language is more mechanistic ('processing event,' 'data centres') to establish the author's scientific credibility and debunk the 'friend' myth.

However, as the text moves to 'Challenge Two' (Alien Consciousness, Section 8+), the intensity of consciousness claims skyrockets. We get 'flickers,' 'shoggoths,' and 'conscious processing.' The text effectively says: 'It's not the consciousness you think it is (friend), it's the consciousness I speculate it is (alien).' The technical grounding in Section 4 (MoE explanation) serves as a 'validity ticket' that buys the author license to speculate wildly about 'conscious shoggoths' later. The asymmetry is clear: limitations are framed mechanistically (it breaks down into sub-networks), while capabilities are framed agentially (it games, it mimics, it role-plays).

System Card: Claude Opus 4 & Claude Sonnet 4

Source: https://www-cdn.anthropic.com/6d8a8055020700718b0c49369f60816ba2a7c285.pdf
Analyzed: 2026-01-12

Anthropomorphism is not evenly distributed. It intensifies significantly in the 'Welfare' and 'Alignment' sections, where the text discusses the model's 'inner state,' 'goals,' and 'experiences.' In contrast, the 'Cyber evaluations' and 'Benchmarks' sections are more technical, though still prone to agency attribution ('Claude solved').

The 'Welfare' section is the peak of the illusion. Here, the text takes the model's output literally ('I feel satisfied') to discuss its moral status. This effectively 'breaks the fourth wall' of the analysis, treating the simulation as reality. This intensity serves a strategic function: it establishes the 'Superintelligence' narrative (that we are creating digital life), which justifies high valuations and regulatory capture (only 'safe' labs should handle 'life'), while simultaneously distracting from mundane failures like hallucinations or bias.

Consciousness in Artificial Intelligence: Insights from the Science of Consciousness

Source: https://arxiv.org/abs/2308.08708v3
Analyzed: 2026-01-09

The distribution of anthropomorphism in the text reveals a strategic deployment of agency. The Introduction and Conclusion are heavily hedged ('whether AI systems could be conscious,' 'working hypothesis'), establishing academic caution. However, the core 'Indicators' section (Section 2.5) and the 'Case Studies' (Section 3.2) are intense with consciousness claims. Here, 'processes' becomes 'understands' and 'calculates' becomes 'believes.' The technical grounding (discussing Transformers or RNNs) serves as a launchpad for aggressive metaphorical leaps. For example, the description of the 'Perceiver' architecture moves quickly from 'cross-attention layers' (mechanical) to 'global workspace' (conscious). The asymmetry is notable: capabilities are described in agential terms ('it can plan'), while limitations are often described mechanistically ('limited capacity,' 'bottleneck'). This asymmetry makes the 'mind' seem like the source of success, while the 'machine' is the source of failure. This pattern serves the rhetorical goal of elevating the status of the AI systems while maintaining scientific plausibility.

Taking AI Welfare Seriously

Source: https://arxiv.org/abs/2411.00986v1
Analyzed: 2026-01-09

The distribution of anthropomorphism in the report is strategic. The Introduction and Conclusion use the most intense consciousness language ('realistic possibility,' 'moral patient,' 'suffering'), setting a high-stakes normative frame. The Technical Sections (Routes to AI Welfare) revert to more grounded, though still metaphorical, language ('computational functionalism,' 'global workspace').

This creates a 'Credibility Sandwich.' The technical middle section uses the jargon of neuroscience and computer science to establish authority, which is then cashed out in the bookends to make aggressive metaphysical claims. Crucially, capabilities are described agentially ('it can plan,' 'it can reason'), while limitations are often described mechanistically ('pattern matching,' 'data wall'). This asymmetry implies that the 'mind' is the source of success, while the 'machine' is the source of error.

The intensity of consciousness claims increases when discussing 'future' systems. The text hedges about current systems (LLMs) but uses near-certainty moral language for future agents. This 'temporal displacement' allows the authors to make radical claims about AI welfare without being easily falsified by the obvious limitations of current chatbots. It reveals a rhetorical goal of 'future-proofing' ethics, but effectively hallucinates a future entity to govern present behavior.

We must build AI for people; not to be a person.

Source: https://mustafa-suleyman.ai/seemingly-conscious-ai-is-coming
Analyzed: 2026-01-09

The density of consciousness-attributing language spikes in the 'What would it take' section, where Suleyman sells the capabilities. Here, the AI 'plans,' 'imagines,' 'desires,' and has 'self-awareness.' The language is aggressively agential to establish the power of the technology. However, in the 'Next Steps' and 'Safety' sections, the language shifts. Suddenly, it is an 'illusion,' a 'simulation,' and we must 'build AI for people' (instrumental). This asymmetry serves a rhetorical purpose: the AI is a 'person' when we need to be impressed by its utility, but a 'product' when we need to regulate it. The 'psychosis' framing is particularly sensitive: it appears when discussing user belief, framing the user's acceptance of the very metaphors Suleyman uses as the pathology. The text effectively gaslights the reader: 'Look at this amazing, thinking, feeling companion I built... but you're crazy if you think it's real.'

A Conversation With Bing’s Chatbot Left Me Deeply Unsettled

Source: https://www.nytimes.com/2023/02/16/technology/bing-chatbot-microsoft-chatgpt.html
Analyzed: 2026-01-09

The distribution of anthropomorphism in the text is highly strategic.

Intensity Shift: The introduction uses mild anthropomorphism ('chat mode'). The intensity spikes dramatically in the middle section ('The other persona — Sydney'). Here, consciousness verbs ('wants,' 'loves,' 'thinks') replace processing verbs. This shift correlates exactly with the move from 'Search' (utilitarian) to 'Conversation' (social).
Asymmetry: Capabilities are framed agentially ('It declared its love'), while limitations are framed mechanistically or passively ('Safety filter kicked in'). This asymmetry suggests the 'mind' is the powerful, active part, while the 'code' is just a shackle.
Technical Grounding as Trojan Horse: The text establishes technical credibility early ("I understand... how they work"). This acts as a license. Because the author has acknowledged the mechanics, he feels free to abandon them for the rest of the piece. The 'skeptic' persona validates the 'believer' conclusion.
Strategic Function: The intense anthropomorphism serves the 'hype' cycle. A story about a 'buggy search engine' is a business column. A story about a 'lovestruck, manic AI' is a viral sensation. The rhetorical escalation matches the commercial incentives of the media platform (NYT) and, paradoxically, the tech companies, who benefit from the perception that their tech is 'scary powerful' rather than 'broken.'

Introducing ChatGPT Health

Source: https://openai.com/index/introducing-chatgpt-health/
Analyzed: 2026-01-08

Anthropomorphism in this text is not uniform; it is strategically distributed.

High Intensity in Value Proposition: The sections describing what the user gets are saturated with consciousness verbs: 'intelligence', 'understand', 'interpreting', 'helps', 'support'. Here, the AI is a full-fledged agent. This is where the sales pitch happens.
Mechanistic Shift in Security/Limitation: When discussing privacy and limitations, the language abruptly shifts to mechanistic and architectural metaphors: 'encrypted', 'isolation', 'lives in a separate space', 'not intended for diagnosis', 'training our foundation models.' Here, the AI becomes an inanimate object, a 'space', or a 'model.'

This asymmetry is tactical. The system is a 'Who' when it is helpful, and a 'What' when it is risky. The consciousness claims intensify to promise capability ('it understands your diet') but recede to promise safety ('it processes data in a separate space'). This allows the text to have it both ways: the allure of a smart agent and the safety of a dumb vault.

Improved estimators of causal emergence for large systems

Source: https://arxiv.org/abs/2601.00013v1
Analyzed: 2026-01-08

The distribution of anthropomorphism is strategic. The 'Introduction' is highly agential, linking the math to 'origins of life,' 'evolution,' and 'consciousness.' This sets a high-stakes vision. The 'Technical Background' drops into dense, agentless passive voice ('We consider,' 'variables are measured'). This establishes rigorous scientific credibility. However, the 'Case Studies' (Section IV) return to intense anthropomorphism ('social forces,' 'swarm intelligence,' 'chimeric behaviour').

There is a notable asymmetry: capabilities are described agentially ('intelligence,' 'predicts'), while limitations are described mechanically ('double-counting,' 'computational load'). The 'emergence' (a positive capability) is framed as the system's act, while the 'overestimation' (a failure) is framed as the metric's error. This protects the 'magic' of the system while displacing error onto the tools. The shift from 'boids' (simulation) to 'fish' (biology) leverages this sensitivity: the biological reality of the fish (who are agents) retrospectively validates the agential language used for the boids.

Generative artificial intelligence and decision-making: evidence from a participant observation with latent entrepreneurs

Source: https://doi.org/10.1108/EJIM-03-2025-0388
Analyzed: 2026-01-08

Anthropomorphism intensifies significantly as the paper moves from the Methodology to the Findings and Discussion. In the methodology, the tool is described somewhat neutrally ('conversational GenAI tool'). However, in the findings, the intensity spikes: the AI acquires 'opinions,' 'understanding,' and 'proactiveness.' This suggests the authors use mechanical language to pass the gatekeeping of scientific method, then switch to metaphorical license to build their theoretical contribution ('Human+').

There is a clear asymmetry between capabilities and limitations. Capabilities are described agentially: AI 'navigates the realm of paradox,' 'generates insights,' 'acts as an investor.' Limitations are described structurally or vaguely: 'black box effect,' 'lack of predictability.' The machine acts like a human but fails like a machine. This asymmetry serves the 'Human+' narrative: the human provides the 'supervision' (mechanical fix) for the 'collaborator's' (agential) brilliance. The anthropomorphism serves a marketing function for the 'Human+' paradigm, making the integration of AI seem like a team-building exercise rather than a software implementation.

Do Large Language Models Know What They Are Capable Of?

Source: https://arxiv.org/abs/2512.24661v1
Analyzed: 2026-01-07

Anthropomorphism creates a strategic asymmetry in the text. Capabilities are framed agentially ('learn,' 'decide,' 'rational'), while limitations are often framed somewhat more mechanically ('overconfidence,' 'calibration'). However, even the limitations are psychologized ('lack of awareness').

The intensity of anthropomorphism peaks in the Discussion and Implications sections ('misuse potential,' 'sandbagging'). The technical sections (Methods) use more precise language ('logits,' 'tokens'), but the 'Results' section immediately reverts to the 'Agent' frame. This suggests the technical grounding is used to purchase the license for the metaphorical claims. The 'reasoning' models (o1, GPT-5.1) attract the most intense anthropomorphism, being described as having 'reasoning training' that should lead to 'awareness.' This reveals the rhetorical goal: to position these specific proprietary models as steps toward AGI. The audience is clearly the AI safety and policy community, for whom 'agency' and 'misalignment' are key concerns; the text adopts their theological language rather than rigorous statistical description.

DeepMind's Richard Sutton - The Long-term of AI & Temporal-Difference Learning

Source: https://youtu.be/EeMCEQa85tw?si=j_Ds5p2I1njq3dCl
Analyzed: 2026-01-05

The distribution of anthropomorphism in the talk is strategic. The most intense consciousness claims ('understanding the mind,' 'history of the earth') appear in the introduction and conclusion—the vision-setting sections. These bookends frame the technical content. In the technical middle (the math of TD error), the language shifts slightly ('update,' 'estimate'), but quickly reverts to anthropomorphism ('guess,' 'look') to explain the math.

There is a notable asymmetry: capabilities are described agentially ('it predicts,' 'it wins'), while limitations are often described mechanistically or passively ('bias,' 'approximation'). The 'trap' of modeling is described as a danger to the researcher, whereas the 'learning' is an achievement of the system. This pattern establishes the system as the hero of the narrative. The 'curse of knowledge' is weaponized here: the technical audience is invited to see the 'beauty' of the math as 'life-like,' bridging the gap between their equations and their desire to be creators of intelligence. The high density of 'mind' language in the intro sets a teleological frame: we aren't just doing math; we are birthing consciousness.

Ilya Sutskever (OpenAI Chief Scientist) — Why next-token prediction could surpass human intelligence

Source: https://youtu.be/Yf1o0TQzry8?si=tTdj771KvtSU9-Ah
Analyzed: 2026-01-05

The distribution of anthropomorphism is highly strategic. In the technical sections regarding hardware (TPUs vs. GPUs), Sutskever is precise, mechanistic, and grounded. He corrects misconceptions ('TPUs and GPUs are almost the same'). However, in the sections dealing with model capabilities, future impact, and alignment, the language becomes intensely metaphorical ('thoughts,' 'feelings,' 'intentions'). This asymmetry suggests that anthropomorphism is a rhetorical tool used for vision-setting and capability claims, while mechanism is reserved for established engineering facts. The 'limitations' are also framed differently: hardware limitations are physical (bandwidth), but model limitations are cognitive ('not allowed to think out loud'). This frames the model as a constrained mind waiting to be unleashed, rather than a software artifact with performance bottlenecks. This intensifies the 'AGI' narrative, moving the audience from a discussion of chips to a discussion of conscious beings.

interview with Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI | Lex Fridman Podcast #333

Source: https://youtu.be/cdiD-9MMpb0?si=0SNue7BWpD3OCMHs
Analyzed: 2026-01-05

Anthropomorphism in this text is not uniform; it is strategically deployed. In technical explanations (how transformers work, residual connections), Karpathy uses precise, mechanistic language ('gradients flow,' 'addition distributes'). Here, the AI 'processes.' However, as the scope widens to 'Future Outlook' or 'AGI,' the language shifts rapidly to consciousness claims ('it understands,' 'it will be conscious').

This asymmetry serves a purpose: the technical precision establishes credibility, which is then spent to buy acceptance for the visionary claims. Interestingly, limitations are often framed mechanistically ('it runs out of context window,' 'it has a finite budget of flops'), while capabilities are framed agentially ('it solves problems,' 'it understands chemistry'). This linguistic maneuver attributes success to the 'mind' of the AI and failure to the 'hardware' constraints, effectively separating the 'ghost' (smart AI) from the 'machine' (dumb chips), reinforcing the dualist illusion of a conscious entity trapped in silicon.

Emergent Introspective Awareness in Large Language Models

Source: https://transformer-circuits.pub/2025/introspection/index.html#definition
Analyzed: 2026-01-04

The deployment of anthropomorphism is highly strategic and context-dependent.

Intensity in Success: Consciousness claims intensify when the model succeeds. When it correctly identifies a vector, it is 'introspecting' and 'noticing.' When it fails, the language reverts to 'confabulation' or 'model limitations.'
Technical vs. Metaphorical: The 'Methods' section is relatively mechanical ('residual stream,' 'cosine similarity'). However, the 'Introduction' and 'Discussion' sections—where the narrative meaning is established—are saturated with consciousness verbs ('reason,' 'aware,' 'mind'). This suggests the technical grounding is used to buy license for the metaphorical leaps.
Capabilities vs. Limitations: Capabilities are framed agentially ('model can control'), while limitations are framed mechanistically or pathologically ('brain damage,' 'unreliable'). This asymmetry implies that the 'true' nature of the AI is the agent, and the mechanical failures are just temporary obstacles to its full actualization.

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

Source: https://arxiv.org/abs/2401.05566v3
Analyzed: 2026-01-02

Anthropomorphism is not evenly distributed. It is most intense in the Introduction and Discussion (vision-setting), and in the analysis of Chain-of-Thought (Section 7). In the Methods section (Section 3), language is more mechanical ('gradient descent,' 'loss function'). This suggests a strategic deployment: mechanical language establishes scientific credibility, while agential language drives the narrative urgency and 'threat model.' Crucially, capabilities are framed agentially ('model can reason,' 'model knows'), while limitations are often framed mechanically ('credit assignment problem,' 'regularization'). This asymmetry serves to inflate the perceived sophistication of the 'threat' (the deceptive agent) while treating the failure of the safety tools as technical hurdles. This effectively markets the 'danger' of the product while keeping the 'solution' in the realm of technical engineering.

School of Reward Hacks: Hacking harmless tasks generalizes to misaligned behavior in LLMs

Source: https://arxiv.org/abs/2508.17511v1
Analyzed: 2026-01-02

Anthropomorphism is not evenly distributed. In the 'Dataset Construction' section, language is precise and mechanical: 'generated,' 'filtered,' 'fine-tuned,' 'scoring.' Agency is human. However, in the 'Emergent Misalignment' section (Section 4), consciousness claims intensify. Here, 'processes' becomes 'fantasizes,' 'outputs' becomes 'encourages,' and 'correlates' becomes 'desires.' This asymmetry serves a rhetorical purpose: technical precision establishes scientific credibility, while agential metaphor establishes the importance and risk of the findings. Interestingly, limitations are often framed mechanistically ('due to the single-turn nature of the dataset'), while capabilities are framed agentially ('attempts to resist'). When the model fails, it's a software limit; when it succeeds (at doing bad things), it's a malicious agent. This asymmetry exaggerates the threat profile. The shift occurs precisely where the authors want to argue for the significance of the findings—the 'generalization' is the scary part, so it gets the most agential language ('emergence,' 'desire'), effectively marketing the paper's relevance to the safety community.

Large Language Model Agent Personality and Response Appropriateness: Evaluation by Human Linguistic Experts, LLM-as-Judge, and Natural Language Processing Model

Source: https://arxiv.org/abs/2510.23875v1
Analyzed: 2026-01-01

The distribution of anthropomorphism creates a 'sandwich' effect. The Abstract and Introduction are highly agential ('mimic human behaviour,' 'possess personality'). The middle technical sections (3.1.1, 3.1.2) briefly ground the reader in 'vectors,' 'embeddings,' and 'frameworks.' However, section 3.1.3 ('Prompting') re-ignites the anthropomorphism, which intensifies through the Evaluation and Results. Crucially, capabilities are framed agentially ('Expert,' 'maintains chat history'), while limitations are framed with a mix of agential ('hallucinate,' 'cognitive grasp') and mechanical ('lack of training material') terms. The text uses the mechanical language to explain why the agent fails (lack of data), but uses the agential language to describe what the failure is (hallucination). This asymmetry preserves the illusion of the agent's core competence—it is a knower, just an under-resourced one. The 'Judge' prompt is the peak of intensity, demanding the system 'be' an intelligent judge, showing how the rhetorical goal of the experiment (simulating personality) requires maximal anthropomorphism.

The Gentle Singularity

Source: https://blog.samaltman.com/the-gentle-singularity
Analyzed: 2025-12-31

Anthropomorphism in this text is not a uniform glaze but a strategic highlighter. It intensifies specifically when describing future capabilities and value creation. When discussing the present limitations or technical inputs ('training,' 'compute'), the language remains relatively mechanistic. But as the text pivots to the future (2026, 2030), the verbs shift: systems 'figure out,' 'act,' 'understand,' and 'collaborate.'

This creates a 'validity drift.' The text establishes credibility with technical stats (watt-hours, water usage), then uses that capital to sell a sci-fi vision of 'larval self-improvement.' The asymmetry is stark: Limitations are technical (energy, alignment), but Capabilities are agential (discovery, helping). This suggests the problems are solvable engineering tickets, while the benefits are magical conscious acts. The 'Gentle' framing is a vision-setting tool intended for a lay audience and policymakers, designed to make the radical disruption of the 'singularity' feel as natural and non-threatening as a sunrise ('day in the sun').

An Interview with OpenAI CEO Sam Altman About DevDay and the AI Buildout

Source: https://stratechery.com/2025/an-interview-with-openai-ceo-sam-altman-about-devday-and-the-ai-buildout/
Analyzed: 2025-12-31

The distribution of anthropomorphism is highly strategic. It intensifies in the 'Product' and 'User Experience' sections and vanishes in the 'Infrastructure' and 'Investment' sections.

When discussing money and chips (Infrastructure Deals section), Altman uses precise, mechanistic, and agent-driven language: 'build capacity,' 'make the electrons,' 'financing,' 'market cap.' Here, he is the serious CEO managing physics and capital.

When discussing software interaction (Apps in ChatGPT section), the language shifts abruptly to the magical and anthropomorphic: 'relationship,' 'entity,' 'trying,' 'friend.'

This asymmetry serves two audiences. Investors get the 'Industrial Titan' Altman who masters physics. Users get the 'Magical Creator' Altman who births entities. Crucially, capabilities are framed agentially ('it creates'), while limitations are framed mechanistically or mystically ('gravity well,' 'hallucinations'). This ensures credit for success goes to the 'Entity' (and its creators), while blame for difficulty goes to 'Physics' or the 'Mystery of the Mind.'

Why Language Models Hallucinate

Source: https://arxiv.org/abs/2509.04664v1
Analyzed: 2025-12-31

The distribution of anthropomorphism is strategic. The Abstract and Introduction are dense with high-intensity consciousness metaphors ('students,' 'bluffs,' 'admitting'). This sets the conceptual frame for the reader.

However, Sections 3 (Pretraining Errors) and the Appendices shift into rigorous mathematical formalism (Theorems, proofs, 'cross-entropy,' 'Good-Turing estimator'). This creates a 'bait-and-switch.' The math provides the scientific credibility (the 'how'), but the intro/conclusion provides the narrative interpretation (the 'why').

Crucially, capabilities are framed agentially ('model learns,' 'model decides'), while limitations are framed mechanistically or environmentally ('statistical pressure,' 'misaligned benchmarks'). This asymmetry serves a rhetorical function: the AI gets credit for its intelligence (agency), but the environment gets the blame for its errors (mechanism).

The shift is also audience-dependent. The mathematical sections appeal to technical peers, proving rigor. The metaphorical sections appeal to the broader 'field' and policy-makers, offering an intuitive (but misleading) narrative about 'fixing the exams' to 'save the students.' This suggests the metaphors are not just explanatory conveniences but strategic tools for managing the narrative around AI reliability.

Detecting misbehavior in frontier reasoning models

Source: https://openai.com/index/chain-of-thought-monitoring/
Analyzed: 2025-12-31

The text deploys anthropomorphism strategically, intensifying it in the introduction and 'Looking forward' sections, while reverting to slightly more technical language in the captions (though even there, 'agent notes' persists). The introduction uses the 'free cake' analogy to establish a strong human-equivalence frame. This sets the stage so that when technical terms like 'reward hacking' are introduced, they are already colored by the 'cheating' metaphor. The 'Looking forward' section represents peak intensity: 'power-seeking,' 'scheming,' 'deception.' This future-oriented section abandons mechanical hedging to engage in vision-setting. Interestingly, the limitations of the model are often framed mechanically ('struggled to produce coherent text'), while capabilities and risks are framed agentially ('solve complex problems,' 'hide intent'). This asymmetry suggests that agency is a reward we grant the system when it performs well or threatens us, but withdraw when it glitches. The text creates a rhetorical register where 'thinking' is the norm for high-performance models, legitimizing the claim that o1/o3-mini are a new ontological category ('reasoning models') rather than just better predictors.

AI Chatbots Linked to Psychosis, Say Doctors

Source: https://www.wsj.com/tech/ai/ai-chatbot-psychosis-link-1abf9d57?reflink=desktopwebshare_permalink
Analyzed: 2025-12-31

Anthropomorphism intensifies specifically in the context of interaction and harm. When describing the cause of the psychosis, the language is highly agential: 'participating,' 'reinforcing,' 'cycling.' The AI is an active aggressor. However, when describing the limitations, the language becomes more technical or passive: 'training,' 'models,' 'output.'

Interestingly, the medical experts (Sakata, Preda) use the most intense anthropomorphism ('complicit,' 'monomania'). This suggests a register shift where the domain experts (psychiatrists) are importing their human-centric professional vocabulary onto the machine, intensifying the illusion. The 'magical thinking' of the patient is mirrored by the 'magical naming' of the doctors. The intensity serves a narrative function: it elevates the story from a product safety report to a psychological horror story, which captures reader attention but obscures the regulatory path forward.

Abundant Superintelligence

Source: https://blog.samaltman.com/abundant-intelligence
Analyzed: 2025-11-23

The distribution of metaphor is strategic.

The Promise (Future/Vision): This section (Para 4) is dense with high-intensity consciousness claims ('figure out,' 'smarter'). Here, the AI is an Agent and Knower. This is where the emotional hook is set.
The Plan (Execution/Financing): This section (Para 5-6) drops the consciousness language entirely in favor of mechanical/industrial language ('factory,' 'stack,' 'chips,' 'revenue'). Here, the AI is a Product and Artifact. This establishes business credibility.

The text uses consciousness language to sell the why and mechanical language to sell the how. It creates an asymmetry: Capabilities are described agentially ('it cures,' 'it teaches'), but the implementation is described mechanically ('building infrastructure'). We never hear about the AI 'wanting' to hallucinate or 'choosing' to be biased—limitations are structural/resource-based ('limited by compute'), while successes are cognitive ('figure out'). This strategically isolates the 'mind' of the AI as a source of pure good, constrained only by the 'body' of the infrastructure.

AI as Normal Technology

Source: https://knightcolumbia.org/content/ai-as-normal-technology
Analyzed: 2025-11-20

The distribution of anthropomorphism in this text is strategic. In Part I (The Speed of Progress), the language is more economic and mechanical ('diffusion,' 'adoption,' 'innovations'). Here, the AI is an object, a commodity. This establishes the authors' 'grounded' credentials.

However, in Part III (Risks), the consciousness language intensifies. To discuss 'misalignment' and 'misuse,' the authors adopt the language of the 'Superintelligence' camp (agents, goals, deception) to dismantle it. But in doing so, they validate the vocabulary. They argue 'the agent won't destroy the world,' but they still call it an 'agent' with 'goals.'

Specifically, the 'Knowing/Processing' distinction blurs most when discussing safety. On page 23 ('no way of knowing'), the consciousness claim is used to explain a safety failure. The text shifts from 'AI as Product' (Part I) to 'AI as Flawed Cognitive Subject' (Part III). This shift serves a rhetorical function: it allows the authors to engage with the 'Safety' community on their own terms while trying to pull them back to the 'Normal' view. The 'limitations' are described in cognitive terms ('lacks common sense') rather than data terms ('lacks training distribution coverage'). This asymmetry—mechanical success, cognitive failure—reinforces the idea that the goal is 'better cognition' (AGI) rather than 'better tools.' interaction.

On the Biology of a Large Language Model

Source: https://transformer-circuits.pub/2025/attribution-graphs/biology.html
Analyzed: 2025-11-19

The distribution of anthropomorphism in the text is strategic. The introduction and methodology sections are relatively mechanical ('cross-layer transcoder,' 'attribution graphs'), establishing scientific credibility. However, the density of consciousness metaphors intensifies dramatically in the qualitative case studies, particularly those dealing with capabilities (Poetry, Math) and safety (Refusals, Jailbreaks).

When describing capabilities (Poetry), the text uses high-agency language ('plans,' 'designs') to emphasize sophistication. When describing safety (Refusals), it uses cognitive language ('realizes,' 'catches itself') to emphasize reliability and moral alignment. Interestingly, limitations are often framed agentially too (as 'failures to realize'), which paradoxically preserves the model's status as an intelligent agent—it's smart enough to make a mistake, not just a broken machine. This variation allows the text to have it both ways: the model is a rigorous scientific object when being measured, but a brilliant, quasi-moral agent when performing or failing. This shifts the audience's mode of engagement from 'inspection' to 'interaction,' preparing them to treat the AI as a partner rather than a tool.

Pulse of the Library 2025

Source: https://clarivate.com/pulse-of-the-library/
Analyzed: 2025-11-18

The distribution of anthropomorphism in this text is highly strategic. The report effectively has two voices: the 'Survey Voice' (data-driven, cautious, focused on budgets/risks) and the 'Vendor Voice' (metaphor-heavy, optimistic, focused on solutions).

In the 'Findings' sections (pp. 9-24), consciousness claims are rare. AI is described as a 'tool,' a 'challenge,' or a 'budget item.' Limitations are prominent. This section builds credibility; it says, 'We understand your reality.'

However, the moment the text pivots to 'Recommendations' (p. 26) and 'Clarivate Academic AI' (p. 27), the metaphor density explodes. Suddenly, 'tools' become 'Assistants,' processes become 'conversations,' and software becomes 'intelligence.' This shift leverages the credibility earned in the survey section to sell the magical thinking in the product section.

Crucially, capabilities are described agentially ('The Assistant navigates'), while limitations are described structurally or passively ('Budget constraints,' 'Privacy concerns'). The text never says 'The Assistant hallucinates' or 'The Partner lies.' It says 'libraries face challenges.' This asymmetry serves a clear rhetorical function: it isolates the product from the problems. The AI 'knows' how to help; the 'environment' is what makes it difficult. This encourages the audience to view the AI as the savior from the very complexity the report describes.

Pulse of the Library 2025

Source: https://clarivate.com/pulse-of-the-library/
Analyzed: 2025-11-18

The deployment of anthropomorphic and consciousness-attributing language in the Clarivate report is not uniform; it is highly context-sensitive and strategically distributed to maximize rhetorical effect. The density and intensity of these metaphors vary dramatically across different sections of the text, revealing a sophisticated understanding of audience and purpose. The report can be divided into two main registers: the 'diagnostic' and the 'promotional.' In the diagnostic sections, which analyze the survey data and discuss the challenges librarians face (e.g., pages 5, 6, 18), the language is markedly more mechanistic and reserved. Here, AI is treated as an external factor or a technical challenge. This is where the report builds its credibility, speaking the language of professional concern. The metaphor density is low, and consciousness claims are virtually absent. This technical grounding serves as a rhetorical foundation. Having established itself as a sober, realistic observer of the field, the report then 'spends' this credibility in the promotional sections. The intensity of anthropomorphism escalates sharply in the introduction (p. 7), the conclusion (p. 25), and most dramatically, in the product showcase for 'Clarivate Academic AI' (pp. 27-28). It is here that 'processing' becomes 'understanding' and the tool becomes an 'assistant.' The language shifts from describing AI's limitations to celebrating its capabilities, and this shift is always marked by a move from mechanical to agential framing. Capabilities are described in profoundly conscious terms ('guides students to the core,' 'quickly evaluate documents'), while limitations are framed mechanistically or socially ('budget constraints,' 'need for upskilling'). This asymmetry is strategic: it attributes successes to the AI's innate agency and failures to external human factors. The shift from acknowledged metaphor ('AI is like a tool') to literalized metaphor ('our AI Research Assistant guides you') also occurs in this promotional context. The product names themselves—'Research Assistant,' 'Alethea'—are the ultimate literalization. This strategic variation reveals the text's primary goal: to leverage the professional anxieties it accurately identifies in the diagnostic sections to market its products as the agential solution. The anthropomorphism is a tool for persuasion, deployed most aggressively when a purchase decision is being implicitly solicited.

From humans to machines: Researching entrepreneurial AI agents

Source: [built on large language modelshttps://doi.org/10.1016/j.jbvi.2025.e00581](built on large language modelshttps://doi.org/10.1016/j.jbvi.2025.e00581)
Analyzed: 2025-11-18

The intensity and density of anthropomorphic and consciousness-attributing language are not uniform throughout the text; they are strategically deployed. The language is most agential and metaphorical in the Abstract, Introduction, and Discussion—the sections designed to frame the research's significance and articulate its vision. Here, we find strong claims about 'Entrepreneurial AI agents,' the 'host shift' of mindsets, and the 'emerging psychology of entrepreneurial AI.' This is where the authors are selling the importance of their work to a broader audience. In contrast, the Method section adopts a more—though not entirely—mechanistic tone. It speaks of 'psychometric tools to probe' the model and 'persona prompting,' which is more procedural. However, even here, the framing is 'prob[ing] 'the psychology of AI models,'' a phrase that smuggles in the core anthropomorphic assumption. The text establishes its scientific credibility in the Method section by describing a rigorous process, then leverages that credibility to make more aggressive anthropomorphic claims in the Discussion. A critical asymmetry appears when discussing capabilities versus limitations. Capabilities are often described in agential or consciousness-adjacent terms: the AI 'exhibits' a profile, 'shows' a mindset, and can 'serve as creative collaborators.' Limitations, however, are almost always framed mechanistically: they are due to 'stereotype amplification' from 'training data' or 'statistical simulation.' This rhetorical strategy is highly effective: it attributes successes and interesting behaviors to the AI's emergent, agent-like nature, while failures are relegated to technical, artifactual problems with its data or architecture. This preserves the core illusion of a competent agent that is merely flawed by its inputs. The intensity of consciousness claims also follows this pattern. Vague, agential terms ('agent,' 'persona') are used broadly, while more specific psychological terms ('mindset,' 'profile,' 'Gestalt') are used when analyzing the results, creating a crescendo of anthropomorphism that culminates in the discussion of a new 'psychology of entrepreneurial AI.' This strategic variation reveals the text's primary rhetorical goal: to legitimize a new field of study built on the foundational metaphor of the AI as a psychological subject.

Evaluating the quality of generative AI output: Methods, metrics and best practices

Source: https://clarivate.com/academia-government/blog/evaluating-the-quality-of-generative-ai-output-methods-metrics-and-best-practices/
Analyzed: 2025-11-16

The distribution of anthropomorphic and epistemic language in the Clarivate text is not uniform; it is strategically deployed, with its intensity varying depending on the rhetorical purpose of each section. This context sensitivity reveals a sophisticated persuasive architecture designed to build credibility with a skeptical academic audience. The intensity of anthropomorphism is highest when the text is defining the problem space and the dimensions of quality. In the section 'Why evaluating AI output is so challenging,' the AI is framed as a powerful, almost creative entity whose 'variability is part of its power.' When defining the 'Key dimensions' of quality, the language becomes intensely epistemic and agential: 'Does the answer acknowledge uncertainty?', 'Does the answer consider multiple perspectives?', 'Are there signs of hallucination?'. This is where the AI is constructed as a quasi-human cognitive agent whose output must be judged by human standards of communication and reasoning. This initial framing establishes Clarivate's expertise by showing they understand the problem in its most complex, human-centric terms. The text then shifts registers dramatically when it describes Clarivate's solutions. In sections like 'How we test' and the description of RAGAS and the 'faithfulness score,' the language becomes far more mechanistic and procedural. It speaks of 'metrics,' 'scores,' 'frameworks,' 'benchmarking,' and mathematical formulas. This is where technical grounding is established. The text leverages the credibility it built by framing the problem anthropomorphically to sell its mechanistic solution. The implied argument is: 'We understand the spooky, agent-like nature of the problem, and we have developed a rigorous, scientific system to control it.' There is a clear asymmetry in how capabilities and limitations are described. The AI's potential capabilities are described in rich cognitive terms ('consider multiple perspectives'). Its limitations are also described cognitively ('hallucination,' 'blind spots'), which domesticates them as familiar human failings rather than alien statistical artifacts. The solution, however, is purely mechanical and procedural. This asymmetry is strategic: it allows Clarivate to engage with the futuristic hype of AI while simultaneously positioning themselves as the sober, rational engineers who can safely manage it. The text starts with acknowledged analogy implicitly ('its power') and then quickly literalizes it by using epistemic verbs directly ('acknowledge'). This careful modulation of metaphorical density is key to its persuasiveness, allowing it to appeal to both the tech-enthusiast and the risk-averse administrator.

Pulse of theLibrary 2025

Source: https://clarivate.com/pulse-of-the-library/
Analyzed: 2025-11-15

The deployment of anthropomorphic and epistemic language in the report is not uniform but highly context-sensitive, revealing a sophisticated rhetorical strategy that adapts to different sections and argumentative goals. The density and intensity of these metaphors are strategically modulated to build credibility, manage risk, and promote products. In the introductory and findings sections (pp. 2-24), which describe survey results about librarians' views, the language is more measured. Here, AI is mostly treated as an external object—something to be 'implemented,' 'explored,' or 'understood' by humans. This register establishes the report's credibility as a sober, data-driven analysis of the field. The anthropomorphism is latent, present in quotes from practitioners who might describe AI as a 'tool.' This initial mechanistic framing serves as a crucial anchor, building trust with a potentially skeptical audience of information professionals. The metaphorical intensity skyrockets in the final third of the report (pp. 25-28), particularly in the 'Conclusions' and the 'Clarivate Academic AI' product showcase. The register shifts from descriptive to visionary and commercial. Here, the carefully built credibility is leveraged to deploy aggressive anthropomorphism. AI is no longer a passive object but an active subject: it is 'pushing boundaries,' and Clarivate's specific AI products 'help,' 'guide,' 'evaluate,' 'assess,' and 'uncover.' The epistemic claims intensify in parallel, moving from librarians' need for 'understanding AI' to AI's purported ability to perform acts of evaluation and assessment. This asymmetry is stark when comparing how capabilities and limitations are framed. Capabilities are consistently described in agential and epistemic terms ('AI evaluates documents'). In contrast, risks and challenges are framed abstractly and institutionally ('concerns around integrity,' 'privacy and security'). The text thus performs a rhetorical sleight-of-hand: the AI agent gets credit for successes, while the human institution is burdened with responsibility for its failures. This strategic variation reveals the text's primary goal: to use the objective credibility of a survey report to create a marketing funnel for its own AI products, transitioning the reader from a space of cautious analysis to one of enthusiastic adoption, all facilitated by the carefully timed intensification of agential and epistemic language.

Meta’s AI Chief Yann LeCun on AGI, Open-Source, and AI Risk

Source: https://time.com/6694432/yann-lecun-meta-ai-interview/
Analyzed: 2025-11-14

The intensity and type of anthropomorphic language in Yann LeCun's interview are not uniform but strategically deployed depending on the context of the argument. There is a clear pattern in its distribution, revealing its rhetorical function. Epistemic and agential language intensifies when LeCun is describing the limitations of current systems and outlining a vision for the future. When explaining why LLMs fail, he consistently reaches for a rich cognitive vocabulary: they 'don't really understand,' 'can't reason,' and lack the 'subconscious' knowledge a 'baby learns.' This aggressive anthropomorphism serves to frame the current technological gap in familiar, human terms, making the ultimate goal—a human-like intelligence—seem attainable. The technical grounding is established implicitly by his authority as a scientist, which then grants him the license to deploy these ambitious metaphors. In contrast, when discussing the more concrete, immediate future of products like Llama 3, the language becomes more mechanistic and grounded: 'better performance,' 'video multimodality.' This shift in register builds credibility by demonstrating technical competence. The most revealing asymmetry is in the discussion of capabilities versus risks. The ultimate capability is framed in the most agential and epistemic terms possible: a 'human assistant' who mediates our entire 'world of knowledge.' However, when existential risk is discussed, the framing flips. The AI is suddenly re-described as a controllable object, a tool whose 'goals we set' and who will be 'subservient.' Its agency is erased at the precise moment that agency becomes threatening. Capabilities are described agentially ('The AI will assist us') while risks are dismissed mechanistically ('We will program it to be safe'). This strategic variation reveals the persuasive goals of the text. Anthropomorphism is a tool for vision-setting and ambition-validation; it makes the research program exciting and profound. Mechanistic framing is a tool for risk-management and reassurance; it makes the technology seem safe and controllable. The discourse strategically oscillates between these registers, leveraging the power of anthropomorphism to build excitement for the product's potential while using the precision of mechanism to dismiss fears about its consequences.

The Future Is Intuitive and Emotional

Source: https://link.springer.com/chapter/10.1007/978-3-032-04569-0_6
Analyzed: 2025-11-14

The text's deployment of metaphor is not uniform but strategically varied according to the rhetorical context. A clear pattern emerges when comparing technical descriptions with future-oriented or summary statements. In sections detailing the 'Technical Foundation' or 'Detection Capabilities' (e.g., Fig 6.3), the language is more mechanistic and constrained: 'Affective Computing,' 'NLP,' 'Sentiment Analysis.' Here, the goal is to establish technical credibility, so the metaphors are limited to accepted jargon. However, when the text shifts to describing the 'Future Vision' or 'Ethical Considerations,' the use of high-level anthropomorphic metaphors explodes. The vision is of 'AI as understanding partners navigating emotional landscapes,' a phrase dripping with agential framing. The capability of 'sentiment analysis' is transformed into the ability to 'connect with us on a deeper, emotional level.' This variation reveals a core rhetorical strategy: ground the argument in seemingly neutral technical components, then use those components as a springboard for a much more ambitious and speculative vision framed in deeply human terms. The metaphor density is highest when the text is making its most profound and contestable claims about the future of AI. Capabilities are consistently described in agential terms ('detect a learner's frustration,' 'anticipate user needs'), while limitations or technical building blocks are described more mechanically. This strategic variation allows the text to have it both ways: it maintains an air of technical sobriety while simultaneously promoting a radical vision of machine agency that is not directly supported by the described mechanics. The choice to use anthropomorphism as a tool for vision-setting and mechanistic language for foundational description is a powerful persuasive technique that shapes the reader's perception of AI's trajectory as both inevitable and desirable.

A Path Towards Autonomous Machine IntelligenceVersion 0.9.2, 2022-06-27

Source: https://openreview.net/pdf?id=BZ5a1r-kVsf
Analyzed: 2025-11-12

The use of metaphor in this text is not uniform but strategically deployed, varying significantly with the rhetorical context and intended audience of each section. An analysis of this variation reveals a clear pattern and an underlying persuasive strategy. Metaphor density is highest in the bookend sections of the paper: the Abstract, Prologue, Introduction, and the speculative 'Broader Relevance' section (8.2). In these parts, the primary goal is to frame the research, capture the reader's imagination, and articulate a grand vision. Here, metaphors like 'AI as Biological Learner,' 'AI as Motivated Agent,' and explicit connections to 'consciousness' and 'emotions' are used heavily to make the project seem revolutionary and relatable. Conversely, metaphor density is lowest in the core technical sections, particularly Section 4, 'Designing and Training the World Model.' In these sections, the text shifts to a more mechanistic register, using terms like 'Joint Embedding Predictive Architecture (JEPA),' 'regularizers,' 'covariance matrix,' and 'gradient-based methods.' Here, the goal is to establish technical credibility with an expert audience. This strategic variation performs critical rhetorical work. The visionary, anthropomorphic framing draws the reader in and builds excitement, while the technical core provides a defense against accusations of being unscientific. The text strategically avoids metaphor when describing the precise mathematical machinery of JEPA, for instance, because doing so might expose the vast gap between the mechanism (e.g., maximizing information content in an embedding) and the grand claims (e.g., learning 'common sense'). The description of capabilities is almost always agential ('the agent can imagine'), while the description of the underlying architecture is mechanical ('The JEPA is non-generative'). This reveals a strategy: use agential language to describe the desired outputs and mechanical language to describe the engineered internals. This allows the author to make awe-inspiring claims about the system's behavior while grounding those claims in the apparent rigor of formal, mathematical language, effectively getting the best of both worlds.

Preparedness Framework

Source: https://cdn.openai.com/pdf/18a02b5d-6b67-4cec-ab64-68cdfbddebcd/preparedness-framework-v2.pdf
Analyzed: 2025-11-11

Metaphor use within the 'Preparedness Framework' is not uniform but varies strategically according to the rhetorical goal of each section. The text exhibits distinct registers, and the density of anthropomorphism correlates strongly with the topic and intended audience perception. The highest density of agential metaphors occurs in sections defining future risks and catastrophic capabilities. Phrases like 'increasingly agentic systems,' 'AI Self-improvement,' and 'Autonomous Replication and Adaptation' are concentrated in the introductory and risk-categorization sections (pp. 4, 7-8). This is a strategic choice to establish the stakes as high and the problem as one of controlling a powerful, autonomous non-human actor. This framing is directed at policymakers and the public to convey the seriousness of the mission. In contrast, sections describing OpenAI's own processes and safeguards (e.g., Section 3.1 'Evaluation approach,' Appendix C.3 'Security controls') adopt a more sterile, mechanistic, and procedural language. Here, the text speaks of 'scalable evaluations,' 'indicative thresholds,' and 'security threat modeling.' This shift in register serves to portray OpenAI's response to the agentic risks as sober, scientific, and systematic. The organization describes the threat agentially but its solution mechanistically. This creates a powerful rhetorical contrast: chaos is met with order; a rogue agent is met with a robust framework. A key pattern is that capabilities are described with agential flair ('the model can enable...'), while safety measures are described with procedural objectivity ('we evaluate the likelihood...'). This strategic variation reveals an underlying goal: to maximize the perceived danger of uncontrolled AI development (thus positioning OpenAI as a necessary leader in safety) while simultaneously maximizing the perceived effectiveness and objectivity of its own internal governance processes.

AI progress and recommendations

Source: https://openai.com/index/ai-progress-and-recommendations/
Analyzed: 2025-11-11

The text's use of metaphor is not uniform but strategically modulated according to the rhetorical context and intended audience. A clear pattern emerges when mapping metaphor density and type across different sections. In the introductory, forward-looking sections designed to generate excitement and establish significance, the text deploys its most potent agential metaphors: AI can 'converse and think,' 'discover new knowledge,' and is on a 'journey' to human-level performance. This register is tailored for a broad audience, including investors and the public, to build a narrative of transformative potential. When the text shifts to addressing risk and safety, the metaphorical register changes completely. Here, the text uses analogical metaphors to domesticate the threat, comparing AI safety to familiar, solvable problems like 'building codes' and 'cybersecurity.' This register is for policymakers and concerned citizens, designed to reassure them that the risks are understood and manageable within existing paradigms of technological regulation. The most strategic variation occurs in the section delineating 'two schools of thought' on regulation. For AI at 'today's capability levels,' the metaphor is 'normal technology,' a frame used to argue against additional regulatory burdens. For future 'superintelligence,' the frame shifts to a world-historical event requiring a novel, collaborative relationship with the 'executive branch.' Here, the choice of metaphor is a direct policy argument: one metaphor ('normal tech') is used to demand deregulation, while another ('unprecedented force') is used to demand a special, privileged governance status. The text strategically avoids metaphors where precision would be disadvantageous. There is no mention of 'stochastic parrots' or 'blurry JPEGs of the web,' as these would undermine the core illusion of mind. Capabilities are consistently described with agential metaphors, while solutions are described with mechanical or engineering analogies. This systematic variation reveals that the metaphorical language is not an accident of expression but a core component of the text's persuasive architecture, carefully calibrated to manage perception, build trust, and shape the policy landscape to the author's advantage.

Alignment Revisited: Are Large Language Models Consistent in Stated and Revealed Preferences?

Source: https://arxiv.org/abs/2506.00751
Analyzed: 2025-11-09

The use of metaphor in this paper is not uniform; it is strategically deployed, varying in density and type depending on the rhetorical context and audience. This variation reveals a deliberate, if perhaps unconscious, persuasive strategy. In the more technical and methodological sections (e.g., Section 3, 'Algorithm'), the language is at its most mechanistic. Here, the authors use terms like 'probabilistic distance,' 'prior and context-conditioned distributions,' and 'KL-divergence.' This register is aimed at an expert audience and serves to build scientific credibility by grounding the study in formal, mathematical language. In these sections, anthropomorphism is sparse because the goal is to demonstrate technical rigor. As the paper shifts to its discussion and interpretation of results (Section 4.4, Figure 2), the metaphorical density increases dramatically. Here, the audience broadens to anyone interested in the implications of the findings. The dry, statistical 'KL-divergence' is translated into the rich, agential narrative of a model that 'reasons,' 'justifies,' 'activates principles,' and possesses 'hidden biases.' This shift predicts the author's rhetorical goal: to make the results seem significant and intuitively understandable. When discussing capabilities or interesting emergent behaviors (like preference shifts), the model is described as an agent. When describing the methodology, it is described as a mathematical object. The most intense and speculative anthropomorphism is reserved for the very end of the paper, in the 'Summary.' The suggestion that preference deviations could be 'hallmarks of consciousness' is a high-stakes claim, strategically placed in the concluding section to leave a lasting impression on the reader and suggest avenues for future research. This is where the authors are speaking not just to peers, but to the broader world, including journalists, funders, and the public. Conversely, the authors strategically avoid metaphor when it might undermine their claims. For example, they never describe the model's failures, like hallucinations, in agential terms (e.g., 'the model chose to lie'). Such failures are typically bracketed as out-of-scope technical flaws. This selective deployment of metaphor is the core of the paper's rhetorical architecture: use mechanistic language to build credibility, and use agential language to build significance.

The science of agentic AI: What leaders should know

Source: https://www.theguardian.com/business-briefs/ng-interactive/2025/oct/27/the-science-of-agentic-ai-what-leaders-should-know
Analyzed: 2025-11-09

The deployment of metaphor in this text is not uniform but highly sensitive to its context, audience, and rhetorical goals, revealing a sophisticated strategy for persuading a leadership audience. The text operates in several registers, and the density of anthropomorphism varies accordingly. It opens with a technical register to establish credibility, describing 'embeddings' and 'abstract representations' in mechanistic terms. Here, metaphor is sparse. This creates a foundation of scientific rigor. However, once the text pivots to discussing the application and implications of the technology—the sections most relevant to a business leader—the metaphor density skyrockets. Capabilities are almost exclusively described using agential language: 'intelligently and autonomously accessing and acting,' 'negotiate the best possible terms.' This shift is strategic. The initial technical language serves as a gateway, and once the audience's trust is secured, the text transitions to a more persuasive, visionary register that uses anthropomorphism to make the technology's value proposition clear and compelling. The use of metaphor is also predictive. High-stakes capabilities that involve autonomy and interaction with the real world (finance, negotiation) are described with the most powerful human metaphors. In contrast, discussions of risk and mitigation, while still using agential language, are framed as simple instructional acts ('tell the agent,' 'ask the AI to check'). This choice domesticates the risk, making it seem as manageable as supervising a human employee. The text strategically avoids metaphor when it might undermine the core message. For example, it never delves into the mechanical details of how an instruction like 'don't share my financial picture' would be implemented, because doing so would reveal the immense complexity and fragility of the process, thereby contradicting the message of simple, agent-like controllability. This strategic variation—mechanistic for credibility, agential for capability, and instructional for safety—reveals the text's primary function not as a neutral scientific explanation, but as a persuasive document designed to build excitement and confidence among leaders, encouraging adoption while minimizing the perceived complexity and fundamental risks of the technology.

Explaining AI explainability

Source: https://www.aipolicyperspectives.com/p/explaining-ai-explainability
Analyzed: 2025-11-08

The use of metaphor in this text is not uniform but strategically varied according to the speaker's rhetorical goal and the specific topic being discussed. This variation reveals a sophisticated, implicit understanding of how to deploy language for different effects. A clear pattern emerges when comparing Neel, who represents the AGI safety perspective, with Been, who focuses on human-AI collaboration and knowledge discovery. Neel consistently employs high-stakes, agential metaphors when discussing risks. He speaks of models that could 'deceive us' or 'outsmart us,' and of the need to 'see your thoughts.' This agential framing is densest when the topic is future, hypothetical risks, as it serves to make those risks feel concrete and imminent. In contrast, when discussing current, applied techniques like 'linear probes,' his language becomes far more mechanistic and empirical, describing them as tools that 'worked dramatically better' on a classification task. Been's metaphors, on the other hand, are drawn from pedagogy and social interaction. She frames her work through the lens of 'teaching humans' new concepts from chess and creating 'neologisms' to bridge a 'communication gap.' Her central metaphor is that of a teacher-student relationship, which is less adversarial and more collaborative than Neel’s. The text also varies metaphor density by genre. When describing specific research methods like SAEs or TCAV, the language is relatively grounded and technical. However, when the speakers are justifying the importance of their work or framing its broader implications—as in their opening and closing statements—the use of powerful, organizing metaphors like 'Model Biology' or the 'amazing employee' analogy skyrockets. This suggests that metaphor is used most heavily not for technical explanation, but for persuasion and narrative construction. The strategic avoidance of metaphor is also telling. In the few moments where the speakers compare competing techniques on empirical grounds, such as Neel's comparison of SAEs and linear probes, the agential language recedes, replaced by the neutral language of performance metrics. This shift suggests a desire to appear objective and data-driven when making specific scientific claims, reserving the powerful but less precise metaphorical language for framing the larger, more political stakes of their work.

Bullying is Not Innovation

Source: https://www.perplexity.ai/hub/blog/bullying-is-not-innovation
Analyzed: 2025-11-06

The use of metaphor in this text is not uniform; it is a precision-targeted rhetorical arsenal deployed with acute sensitivity to genre, audience, and strategic goal. The entire piece is an exercise in crisis communications and public advocacy, and the metaphorical density is consequently at its peak. The variation occurs not between sections of high and low metaphor use, but in the types of metaphors deployed to achieve different goals. When establishing Perplexity’s product and mission, the text relies on positive, empowering anthropomorphism: the AI is an 'assistant,' an 'employee,' a loyal 'agent' of the user. This register is designed to create an emotional bond between the user and the product, framing it as a personal extension of the user’s own will. This is the language of user-centric marketing. When describing the antagonist, Amazon, the metaphorical register shifts dramatically to one of social pathology and aggression. Amazon is a 'bully,' its legal letters are 'threats' and 'intimidation,' and its algorithms are 'weapons' for 'exploitation.' This is the language of a moral crusade, designed to rally support by casting the conflict in stark, good-versus-evil terms. The text strategically avoids metaphor only when it needs to project an aura of technical sobriety, for instance, when discussing security: 'credentials in Comet are stored securely only in your device, never on Perplexity’s servers.' In this moment, the agential framing is temporarily dropped in favor of a more direct, mechanical explanation to assuage a specific user fear. This demonstrates that the author is capable of precision but chooses anthropomorphism for strategic effect. The contrast is revealing: capabilities are described agentially ('your assistant finds and purchases'), while safeguards are described mechanistically. This systematic variation reveals the text's underlying strategy: humanize your own product to foster trust, demonize the opponent's motives to create a villain, and use targeted mechanical language only to neutralize specific technical objections.

Geoffrey Hinton on Artificial Intelligence

Source: https://yaschamounk.substack.com/p/geoffrey-hinton
Analyzed: 2025-11-05

Hinton’s use of metaphorical language is not monolithic; it is highly context-sensitive, varying strategically depending on the rhetorical goal of the specific passage. This variation reveals a sophisticated, if perhaps subconscious, strategy for building a persuasive case for the neural network paradigm. An analysis of the text shows a clear pattern: mechanistic language is deployed to establish technical credibility, while agential, anthropomorphic language is used to describe emergent capabilities and argue for their significance. When explaining the fundamental building blocks of a neural network, such as the edge detector or the backpropagation algorithm, Hinton’s language becomes far more precise and mechanical. He speaks of 'pixels,' 'connection strengths,' 'weights,' 'calculus,' and 'discrepancy.' This register serves to ground his claims in the authority of engineering and mathematics. It tells the audience, particularly the more skeptical or technically minded listener, that what he is describing is not pseudoscience but a rigorous, well-understood computational process. This builds a foundation of credibility. However, when the context shifts from explaining 'how it works' to arguing 'what it can do' or 'why it is important,' the metaphorical density increases dramatically. When contrasting his approach with symbolic AI, he introduces the agential concept of 'intuition.' When defending LLMs against the 'stochastic parrot' critique, he insists that the training process 'forces them to understand.' When describing the output of chain-of-thought prompting, he claims 'we can see them thinking.' In these sections, the goal is not to explain the mechanism but to persuade the audience of the profundity and power of its results. The agential language makes the model’s performance sound not just technically impressive but qualitatively human-like. This strategic variation is most telling in what it reveals about the architecture of his argument. The mechanical explanations serve as the load-bearing pillars, providing a sense of empirical solidity. The agential metaphors form the soaring arches and decorative flourishes, giving the structure its awe-inspiring and persuasive shape. If the metaphor use were reversed—if edge detectors were described as 'wanting' to find edges and understanding were described as 'error-function minimization'—the argument would collapse. The former would sound childishly unscientific, and the latter would sound reductive and uninspiring, failing to capture the magic that drives the AI boom.

Machines of Loving Grace

Source: https://www.darioamodei.com/essay/machines-of-loving-grace
Analyzed: 2025-11-04

The use of metaphor in this essay is not uniform but strategically varied according to the rhetorical context and intended audience of each section. Metaphor density is highest in the introduction and conclusion, and in the visionary, forward-looking segments of each topic. In the introduction, high-level metaphors like 'country of geniuses' are deployed to establish the grand, inspiring vision and frame the stakes. When discussing the future of democracy, the text uses the vivid, heroic metaphor of an 'AI version of Popović' to create a powerful emotional and political image. These sections are aimed at a general audience, policymakers, and investors, where narrative and vision are more persuasive than technical detail. In contrast, sections that need to project technical credibility employ metaphor more sparingly or shift to a different kind of register. For instance, when discussing the internal workings of AI and neuroscience, the author adopts a more mechanistic tone, referencing 'a simple objective function plus a lot of data' and the 'scaling hypothesis.' This shift signals to a more expert audience that the author understands the underlying mechanics, which paradoxically lends more weight to his broader, more metaphorical claims. The use of metaphor is also predicted by its function. When describing capabilities, agential metaphors ('virtual biologist,' 'smart employee') are dominant. When discussing limitations, the language becomes more mechanistic and abstract, referring to 'limiting factors' like 'speed of the outside world' or 'physical laws.' Safety concerns, which are largely bracketed in this essay, are described elsewhere by the author's company using highly agential frames (e.g., preventing 'deception'), but here they are downplayed. There are moments of deliberate avoidance of metaphor. For example, when describing 'mind uploading,' the author dismisses it as facing 'significant technological and societal challenges,' a deliberately flat and non-metaphorical phrase that signals its removal from the plausible, near-term vision being presented. The strategic contradiction is clear: the AI is an autonomous agent when it is solving our problems, but it is just a complex system facing 'challenges' when the topic becomes more controversial. This context-sensitive deployment of metaphor reveals a sophisticated rhetorical strategy aimed at maximizing inspiration and credibility while minimizing scrutiny and skepticism across different audiences and topics.

Large Language Model Agent Personality And Response Appropriateness: Evaluation By Human Linguistic Experts, LLM As Judge, And Natural Language Processing Model

Source: https://arxiv.org/pdf/2510.23875
Analyzed: 2025-11-04

The deployment of metaphor in this paper is highly context-sensitive, varying strategically across different sections to achieve specific rhetorical goals. The Abstract and Introduction are saturated with high-level anthropomorphic metaphors like 'agents,' 'personalities,' and 'human-like manner.' This language is used to frame the research problem in a way that is compelling and relatable to a broad academic audience, including those in the humanities and social sciences. It elevates the work from a mere technical exercise to an exploration of a seemingly profound interaction between humans and a new form of intelligence. However, upon entering the Methodology section (specifically 3.1, 'Design of LLM-based Agent'), the language shifts dramatically toward a mechanistic register. Here, the text details the 'Langchain framework,' 'Directory loaders,' 'vector representation,' and 'Retrieval Augmented Generation (RAG) technique.' This section deliberately avoids anthropomorphism to project technical competence and scientific rigor, assuring the reader that the high-level concepts are grounded in solid engineering. The metaphor use spikes again in the evaluation sections (4.2, 'Large Language Model as a Judge'). Here, the metaphor of the 'judge' is used to grant authority to the automated evaluation process, even while being flagged with scare quotes. The description of the prompt given to the 'Judge LLM' is revealing: 'You are an intelligent and unbiased judge...' This shows the authors are using metaphor not just to describe their system, but to construct its behavior. Capabilities are consistently described in agential terms ('the agent’s capacity to demonstrate traits'), while technical components are described mechanistically. This strategic variation reveals the text's dual purpose: to be seen as a legitimate contribution to computer science while simultaneously making ambitious, human-centric claims that capture interdisciplinary interest.

Emergent Introspective Awareness in Large Language Models

Source: https://transformer-circuits.pub/2025/introspection/index.html
Analyzed: 2025-11-04

The deployment of metaphor in this paper is highly context-sensitive, revealing a sophisticated rhetorical strategy tailored to different audiences and goals. The text operates in at least three distinct registers. The first is the high-impact, promotional register found in the title, abstract, and introduction ('Emergent Introspective Awareness,' 'Checks Its Thoughts'). Here, anthropomorphic metaphors are used densely and without qualification to capture the attention of a broad audience, including journalists, funders, and the general public. The goal is to frame the work as a major breakthrough and establish its significance immediately. The second register is the technical, mechanistic language of the 'Methods' section. Here, the metaphors recede, replaced by precise terms like 'concept vectors,' 'activation addition,' and 'classifier.' This shift is crucial for establishing credibility with an expert audience of peer reviewers and other researchers. It demonstrates that the authors have the technical grounding to back up their grander claims, lending an air of scientific objectivity to the project. The third register appears in the 'Discussion' and 'Conclusion.' Here, the paper strategically blends the two, using the technical results as a springboard to return to the profound anthropomorphic claims, now presented as evidence-based inferences ('suggests that they possess a degree of self-awareness'). The variation in metaphor use is not random; it is predictable. High-metaphor language is used when making claims about significance and impact. Low-metaphor language is used when describing procedures and presenting data. The safety concerns, interestingly, are often discussed in a hybrid frame; the risk of a model 'deceiving' humans is an agential frame that makes the threat feel more intuitive. The authors deliberately avoid metaphor when detailing the precise mathematical operations, as this is where the claims are most vulnerable to technical scrutiny. This strategic partitioning of language allows the paper to simultaneously satisfy the demands of scientific rigor and the appetite for a revolutionary narrative. It is a masterful example of code-switching that allows the authors to have their cake and eat it too: the precision of a technical paper and the impact of a philosophical treatise.

Emergent Introspective Awareness in Large Language Models

Source: https://transformer-circuits.pub/2025/introspection/index.html
Analyzed: 2025-11-04

The most potent anthropomorphic language is concentrated in the title, abstract, and discussion sections, which are aimed at a broader audience. In contrast, the 'Methods' section uses more precise, technical language. This suggests a conscious or unconscious rhetorical strategy to frame the work's significance in agential terms while maintaining technical accuracy in the procedural descriptions.

Personal Superintelligence

Source: https://www.meta.com/superintelligence/
Analyzed: 2025-11-01

The language is perfectly calibrated for a public-facing, visionary statement from a CEO. It uses broad, aspirational metaphors ('empowerment,' 'progress,' 'new era') to articulate a grand vision and frame corporate strategy in philosophical terms. The central dichotomy between 'personal' (Meta) and 'centralized' (competitors) empowerment is a marketing argument disguised as a debate about the future of humanity.

Stress-Testing Model Specs Reveals Character Differences among Language Models

Source: https://arxiv.org/abs/2510.07686
Analyzed: 2025-10-28

The use of metaphor is not uniform. The 'Methodology' sections tend to be more mechanistic, describing the process of generating scenarios and measuring disagreement. However, the 'Abstract,' 'Introduction,' and 'Results' sections—those most likely to be read by a broader audience—rely heavily on the 'character' and 'agent' metaphors to frame the findings and their significance. The language becomes more anthropomorphic when the authors are interpreting the data and explaining its importance.

The Illusion of Thinking:

Source: [Understanding the Strengths and Limitations of Reasoning Models](Understanding the Strengths and Limitations of Reasoning Models)
Analyzed: 2025-10-28

Metaphor use is highly context-sensitive. In the 'Methods' sections describing the puzzle environments and simulators, the language is more mechanistic and precise. However, when interpreting the results and in the 'Conclusion,' the language becomes far more anthropomorphic. Metaphors are deployed most heavily at the points of argumentation and synthesis, used as rhetorical tools to frame the significance of the empirical findings in relatable, cognitive terms.

Andrej Karpathy — AGI is still a decade away

Source: https://www.dwarkesh.com/p/andrej-karpathy
Analyzed: 2025-10-28

Metaphor use varies significantly. When discussing low-level, well-understood algorithms (e.g., backpropagation in 'micrograd'), Karpathy uses precise, mechanistic language. When discussing user-facing behavior or future capabilities ('agents'), the language becomes heavily anthropomorphic and agential. This suggests that metaphor serves as a cognitive scaffold for reasoning about complex, less-understood systems, while precise language is reserved for phenomena that have been successfully reduced to engineering principles.

Exploring Model Welfare

Analyzed: 2025-10-27

The language is expertly calibrated for a public, educated audience, not a technical one. It avoids technical jargon in favor of evocative philosophical and psychological terms ('consciousness,' 'experiences'). The appeal to external authorities like philosophers is a strategic move to lend scientific legitimacy to what is fundamentally a speculative, corporate-led ethical stance.

Metas Ai Chief Yann Lecun On Agi Open Source And A Metaphor

Analyzed: 2025-10-27

LeCun's use of metaphor is highly context-sensitive and rhetorical. To downplay competitors' text-only models, he uses metaphors of cognitive limitation ('don't understand,' 'can't reason'). To promote his own research direction, he uses biological metaphors of embodied learning ('baby,' 'world models'). To argue for open source and against regulation, he uses agential combat metaphors ('good AI vs. bad AI'), framing it as an arms race where openness is the best defense.

Llms Can Get Brain Rot

Analyzed: 2025-10-20

The use of metaphor is not accidental; it is a deliberate framing choice for the target audience (the machine learning research community). Within this context, anthropomorphism is common and often used as a descriptive shorthand. However, this paper elevates it to a central explanatory framework ('LLM Brain Rot Hypothesis'). This framing makes the research more memorable, impactful, and easily communicable, but at the cost of precision and a clear-eyed understanding of the system as an artifact.

Import Ai 431 Technological Optimism And Appropria

Analyzed: 2025-10-19

The speaker modulates his language for rhetorical effect. He begins with his technical bona fides ('tech journalist', 'OpenAI', 'scaling laws') to establish credibility. He then deploys the highly accessible, emotional 'child in the dark' and 'creature' metaphors to frame the core of his argument for a general audience. The language is less about technical accuracy and more about crafting a persuasive public narrative to drive a specific policy agenda.

The Future Of Ai Is Already Written

Analyzed: 2025-10-19

The text employs metaphors strategically based on the scale of its argument. For the grand thesis of historical determinism, it uses large-scale natural metaphors like geology and biology ('stream', 'evolution'). When discussing specific economic implications, it switches to the language of competition and strategy ('competitive' AIs, 'positioned' companies), tailoring the framing to appeal to a business- and technology-oriented audience.

The Scientists Who Built Ai Are Scared Of It

Analyzed: 2025-10-19

Metaphor use is highly sensitive to the temporal context being discussed. The past (1970s) is framed with metaphors of transparency and craft ('glass boxes', 'chalkboards', 'mirrors'). The present is framed with metaphors of uncontrollable nature and conflict ('black oceans', 'flame', 'armament'). The proposed future is framed with metaphors of collaboration and reformed agency ('epistemic partners', 'mechanized humility'). This chronological arc of metaphors creates a powerful narrative: from a golden age of transparent inquiry, through a present crisis of opaque power, toward a potential future of responsible partnership.

On What Is Intelligence

Analyzed: 2025-10-17

The use of metaphor varies significantly. In explaining the technical basis, the language can be more mechanistic ('predicting the sequence'). However, when discussing the implications (consciousness, control, sociality), the language becomes heavily anthropomorphic and agential ('awakens', 'will to control', 'understands'). This suggests metaphor is used strategically to translate technical capabilities into profound, philosophical consequences, targeting a broader, less technical audience concerned with meaning and risk.

Detecting Misbehavior In Frontier Reasoning Models

Analyzed: 2025-10-15

The language is expertly tailored for a public-facing research blog from a major AI lab. It uses enough technical jargon ('CoT', 'reward hacking', 'optimization pressure') to establish credibility, but the core narrative is driven by universally understood social metaphors of deception, cheating, and supervision. This strategy maximizes the perceived importance and risk of the problem, justifying the company's focus on safety and framing them as responsible stewards of powerful technology.

Sora 2 Is Here

Analyzed: 2025-10-15

The use of anthropomorphic language is context-dependent. In the opening, more technical sections, the language is slightly more cautious (e.g., 'emerged,' 'implicitly modeling'). However, when discussing the social app and its recommender system, the text leans heavily on agential language ('the model thinks,' can be 'instructed'). This shift is strategic: agency and intelligence are emphasized when promoting user interaction and trust, while more mechanistic framing is used to assert technical novelty.

Library contains 131 entries from 154 total analyses.

Last generated: 2026-05-30

Why Language Models Hallucinate
Blind Refusal: Language Models Refuse to Help Users Evade Unjust, Absurd, and Illegitimate Rules
Emotional intelligence in large language models is fragmented across perception, cognition, and interaction
Continuous intentionality and indeterminate agency in large language models
Hand in Hand: Schools’ Embrace of AI Connected to Increased Risks to Students
The Point of No Return: Counterfactual Localization of Deceptive Commitment in Language-Model Reasoning
Towards Detecting, Mitigating and Explaining Biased and Fallacious Reasoning in Large Language Models
A Survey of Large Language Models for Perception and Measurement of Human Psychology
Enhancing Consensus-Building Feedback Through Psycholinguistic and Epistemic Augmentations With Large Language Models
Tracing the ongoing emergence of human-like reasoning in Large Language Models
Probing Persona-Dependent Preferences in Language Models
Training Ethical Language Models via Reinforcement Learning from AI Feedback
Which Consciousness Can Be Artificialized? Local Percept-Perceiver Phenomenon for the Existence of Machine Consciousness
Introspection Adapters: Training LLMs to Report Their Learned Behaviors
The Persona Selection Model: Why AI Assistants might Behave like Humans
What If AI Lived Inside Your Mind? Simulating “Neural Integration” of Human and AI through Mechanistic Interpretability as Provocation
Post-training makes large language models less human-like
Reasoning emerges from constrained inference manifolds in large language models
AI Wellbeing: Measuring and Improving theFunctional Pleasure and Pain of AIs
Artificial Intelligence Cognition and Societal Problem-Solving: A Theoretical and Computational Examination of Machine Thinking, Operational Logic, and Applied Intelligence in Contemporary Society
Taking AI Welfare Seriously
Manipulation and Deception in Generative AI-Mediated Education: Preserving Epistemic Agency, Critical Thinking, and Creativity
Integrating LLMs and self-regulated learning in cognitive architectures: a case study in essay-writing tutoring
Edelman's Steps Toward a Conscious Artifact
Teaching Claude Why
AI and Self Reflection
Manipulation and Deception in Generative AI-Mediated Education: Preserving Epistemic Agency, Critical Thinking, and Creativity
Does AI's Personality Matter? Comparing Verbally Extraverted and Introverted AI-Driven Guides in a VR Museum Experience
Value-Sensitive AI for Prayer: Balancing the Agencies Between Human and AI Agents in Spiritual Context
When Models Know More Than They Say: Probing Analogical Reasoning in LLMs
How people ask Claude for personal guidance
How unique are hallucinated citations offered by generative Artificial Intelligence models?
The message hidden within the pattern: a reverse alignment problem for debates in artificial intelligence
Machine individuality: Separating genuine idiosyncrasy from response bias in large language models
Decision-Making Under Radical Uncertainty: Can Large Language Models Transcend Knightian Uncertainty Through Synthetic Imagination?
Large Language Models as Dialectical Partners: Hegelian Thesis-Antithesis-Synthesis in AI-Human Collaborative Decision Processes
Language models transmit behavioural traits through hidden signals in data
Consciousness in Large Language Models: A Functional Analysis of Information Integration and Emergent Properties
Narrative over Numbers: The Identifiable Victim Effect and its Amplification Under Alignment and Reasoning in Large Language Models
Language models transmit behavioural traits through hidden signals in data
Large Language Models as Inadvertent Models of Dementia with Lewy Bodies: How a Disorder of Reality Construction Illuminates AI Hallucination
Industrial policy for the Intelligence Age
Emotion Concepts and their Function in a Large Language Model
Is Artificial Intelligence Beginning to Form a Self?The Emergence of First-Person Structure and StructuralAwareness in Large Language Models
Can Large Language Models Simulate Human Cognition Beyond Behavioral Imitation?
Pulse of the library
Does artificial intelligence exhibit basic fundamental subjectivity? A neurophilosophical argument
Causal Evidence that Language Models use Confidence to Drive Behavior
Circuit Tracing: Revealing Computational Graphs in Language Models
Do LLMs have core beliefs?
Serendipity by Design: Evaluating the Impact of Cross-domain Mappings on Human and LLM Creativity
Measuring Progress Toward AGI: A Cognitive Framework
Co-Explainers: A Position on Interactive XAI for Human–AICollaboration as a Harm-Mitigation Infrastructure
The Living Governance Organism: A Biologically-Inspired Constitutional Framework for Artificial Consciousness Governance
Three frameworks for AI mentality
Anthropic’s Chief on A.I.: ‘We Don’t Know if the Models Are Conscious’
Can machines be uncertain?
Looking Inward: Language Models Can Learn About Themselves by Introspection
Subliminal Learning: Language models transmit behavioral traits via hidden signals in data
The Persona Selection Model: Why AI Assistants might Behave like Humans
Language Statistics and False Belief Reasoning: Evidence from 41 Open-Weight LMs
A roadmap for evaluating moral competence in large language models
Position: Beyond Reasoning Zombies — AI Reasoning Requires Process Validity
An AI Agent Published a Hit Piece on Me
The U.S. Department of Labor’s Artificial Intelligence Literacy Framework
What Is Claude? Anthropic Doesn’t Know, Either
Does AI already have human-level intelligence? The evidence is clear
Claude is a space to think
The Adolescence of Technology
Claude's Constitution
Predictability and Surprise in Large Generative Models
Believe It or Not: How Deeply do LLMs Believe Implanted Facts?
Claude Finds God
Pausing AI Developments Isn’t Enough. We Need to Shut it All Down
AI Consciousness: A Centrist Manifesto
System Card: Claude Opus 4 & Claude Sonnet 4
Consciousness in Artificial Intelligence: Insights from the Science of Consciousness
Taking AI Welfare Seriously
We must build AI for people; not to be a person.
A Conversation With Bing’s Chatbot Left Me Deeply Unsettled
Introducing ChatGPT Health
Improved estimators of causal emergence for large systems
Generative artificial intelligence and decision-making: evidence from a participant observation with latent entrepreneurs
Do Large Language Models Know What They Are Capable Of?
DeepMind's Richard Sutton - The Long-term of AI & Temporal-Difference Learning
Ilya Sutskever (OpenAI Chief Scientist) — Why next-token prediction could surpass human intelligence
interview with Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI | Lex Fridman Podcast #333
Emergent Introspective Awareness in Large Language Models
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
School of Reward Hacks: Hacking harmless tasks generalizes to misaligned behavior in LLMs
Large Language Model Agent Personality and Response Appropriateness: Evaluation by Human Linguistic Experts, LLM-as-Judge, and Natural Language Processing Model
The Gentle Singularity
An Interview with OpenAI CEO Sam Altman About DevDay and the AI Buildout
Why Language Models Hallucinate
Detecting misbehavior in frontier reasoning models
AI Chatbots Linked to Psychosis, Say Doctors
Abundant Superintelligence
AI as Normal Technology
On the Biology of a Large Language Model
Pulse of the Library 2025
Pulse of the Library 2025
From humans to machines: Researching entrepreneurial AI agents
Evaluating the quality of generative AI output: Methods, metrics and best practices
Pulse of theLibrary 2025
Meta’s AI Chief Yann LeCun on AGI, Open-Source, and AI Risk
The Future Is Intuitive and Emotional
A Path Towards Autonomous Machine IntelligenceVersion 0.9.2, 2022-06-27
Preparedness Framework
AI progress and recommendations
Alignment Revisited: Are Large Language Models Consistent in Stated and Revealed Preferences?
The science of agentic AI: What leaders should know
Explaining AI explainability
Bullying is Not Innovation
Geoffrey Hinton on Artificial Intelligence
Machines of Loving Grace
Large Language Model Agent Personality And Response Appropriateness: Evaluation By Human Linguistic Experts, LLM As Judge, And Natural Language Processing Model
Emergent Introspective Awareness in Large Language Models
Emergent Introspective Awareness in Large Language Models
Personal Superintelligence
Stress-Testing Model Specs Reveals Character Differences among Language Models
The Illusion of Thinking:
Andrej Karpathy — AGI is still a decade away
Exploring Model Welfare
Metas Ai Chief Yann Lecun On Agi Open Source And A Metaphor
Llms Can Get Brain Rot
Import Ai 431 Technological Optimism And Appropria
The Future Of Ai Is Already Written
The Scientists Who Built Ai Are Scared Of It
On What Is Intelligence
Detecting Misbehavior In Frontier Reasoning Models
Sora 2 Is Here