Deconstruct: Tracing the ongoing emergence of human-like reasoning in Large Language Models

About
Analysis Metadata

About This Analysis

This document applies the AI Literacy Deconstructor framework—a rewriting experiment that tests whether anthropomorphic AI discourse can be translated into strictly mechanistic language while preserving the phenomena described.

The core question is not "Is this metaphor bad?" but rather: "Does anything survive when we remove the metaphor?"

Each anthropomorphic frame receives one of three verdicts:

✅ Preserved: Translation captures a real technical process
⚠️ Reduced: Core survives, but accessibility or nuance is lost
❌ No Phenomenon: The metaphor was constitutive—nothing mechanistic underneath

All findings and summaries below were generated from detailed system instructions provided to a large language model and should be read critically as interpretive outputs—not guarantees of factual accuracy or authorial intent.

Overall Verdict - Does anything survive when the metaphor is removed?

⚠️ Partially—significant restructuring required

The empirical findings and the critique of scale survive the translation perfectly, gaining technical precision. However, the theoretical explanation of the findings (Decontextualization Bias explained as a cognitive style and agents possessing toolkits) is so deeply intertwined with agential and cognitive metaphors that a purely mechanistic version requires restructuring the entire theoretical framework from a cognitive-science paradigm to a strict natural language processing and statistical engineering paradigm.

Part 1: Frame-by-Frame Analysis

About this section

For each anthropomorphic pattern identified in the source text, we perform a three-part analysis:

1 Narrative Overlay: What the text says—the surface-level framing

2 Critical Gloss: What's hidden—agency displacement, metaphor type, how/why slippage

3 Mechanistic Translation: The experiment—can this be rewritten without anthropomorphism?

The verdict reveals whether the phenomenon is real (Preserved), partially real (Reduced), or exists only in the framing (No Phenomenon).

Frame 1: The Cognitive Toolkit

Narrative Overlay

"suggesting that pragmatic reasoning is still an emerging ability in the cognitive toolkit of artificial systems."

Magic Words: cognitive toolkit · emerging ability · artificial systems

Illusion Created: This framing invites the reader to imagine that an artificial system possesses an internal, evolving mental container analogous to the biological brain. It suggests that capabilities like pragmatic reasoning are organic, self-developing faculties that mature over time within the system's toolkit, rather than being the direct result of deliberate human engineering, algorithmic design, and dataset curation. It obscures the purely mathematical and feed-forward nature of the system.

Critical Gloss

Metaphor Type: Model as Mind (consciousness projection)

Dimension	Classification	Evidence
Acknowledgment	❌ Naturalized (presented as literal)	The authors present the cognitive toolkit as a direct, unquoted characteristic of artificial systems, treating it as a literal architectural feature.
How/Why	Why (Agential)	It implies the model has an internal mental space containing tools it chooses to deploy. Mechanistically, this refers to the capacity of the model's parameterized matrix multiplications to predict tokens that align with contextual pragmatic implicatures under specific prompting conditions.

Agency Displacement: This construction displaces the human engineers, data annotators, and reinforcement learning designers who decide how training objectives are structured and whether pragmatic patterns are reinforced. The toolkit is presented as a self-contained, self-evolving mental container rather than a human engineering artifact that is built, compiled, and deployed by specific tech corporations.

Mechanistic Translation

POSSIBLE REWRITE:

✎ᝰ NO TRANSLATION POSSIBLE—no mechanistic process identified beneath this framing. ✎ᝰ

❌ Verdict: No Phenomenon (metaphor was constitutive)

The concept of a cognitive toolkit inside a neural network does not correspond to any physical or computational reality. There are only layers, nodes, weights, and mathematical activation functions. The metaphor is purely constitutive, creating a biological illusion where only matrix algebra exists.

Show more frames...

Frame 2: Linguistic Agents

Narrative Overlay

"At the same time, it has been argued that LLMs, while undeniably impressive linguistic agents, have cognitive toolkits that remain fundamentally different..."

Magic Words: linguistic agents · impressive · cognitive toolkits

Illusion Created: This language frames the LLM as an active, autonomous subject (an agent) with its own intentionality and communicative presence. It encourages the reader to view the model as a social participant in language, rather than a passive mathematical function that maps input token distributions to output token distributions based on static, human-curated datasets.

Critical Gloss

Metaphor Type: Model as Agent (autonomous decision-maker)

Dimension	Classification	Evidence
Acknowledgment	⚠️ Conventional Shorthand (field standard)	The phrase linguistic agents is widely used in NLP literature as a standard term to describe interactive text generation software.
How/Why	How (Mechanistic)	Despite the agential language, it describes the model's physical performance: receiving text sequences as input and generating text sequences as output.

Agency Displacement: It obscures the system's nature as an automated feed-forward mathematical function trained on human-produced text. It frames the system as the active agent of language production, sidelining both the human creators of the model and the human writers of the massive training corpora that are actually being reflected.

Mechanistic Translation

POSSIBLE REWRITE:

✎ᝰ At the same time, it has been argued that LLMs, while generating complex text sequences, process text using mathematical structures that remain fundamentally different... ✎ᝰ

✅ Verdict: Preserved (phenomenon is real)

The translation fully preserves the technical observation that LLM architectures do not possess the structural or cognitive correlates of human language processing, despite generating seemingly sophisticated text. No agential phenomenon is lost; precision is gained.

Frame 3: Struggle with Meaning

Narrative Overlay

"they nonetheless struggle with meaning-related components of language"

Magic Words: struggle · meaning-related

Illusion Created: This framing portrays the language model as an active, striving entity, similar to a human student or language learner who is putting in mental effort but finding a task difficult. It implies a conscious attempt to comprehend and an internal experience of difficulty, rather than a structural limitation where the mathematical architecture is simply incapable of mapping symbols to real-world referents.

Critical Gloss

Metaphor Type: Model as Student (learning metaphor)

Dimension	Classification	Evidence
Acknowledgment	❌ Naturalized (presented as literal)	The authors describe the models as struggling with meaning without scare quotes or hedges, treating the difficulty as an active behavioral state.
How/Why	Mixed (both elements)	Struggle implies internal effort (agential), but meaning-related components refers to syntactic vs. semantic performance differences in the generated text (mechanistic).

Agency Displacement: Obscures that the model's failure is a direct result of its training paradigm (next-token prediction on text statistics) which lacks sensory-motor grounding or intentional communication goals, both of which are designed by humans.

Mechanistic Translation

POSSIBLE REWRITE:

✎ᝰ their outputs do not systematically align with target semantics when prompts require contextual or non-literal interpretation ✎ᝰ

⚠️ Verdict: Reduced (core survives, nuance lost)

The translation captures the actual scientific distinction: syntactic success vs. semantic failure. Removing struggle shows that the model is not trying and failing; rather, its mathematical optimization simply does not construct real-world referential relations. However, the vivid shorthand of struggle is lost, making the text less accessible.

Frame 4: Decontextualization Bias

Narrative Overlay

"the reasoning abilities of LLMs are affected by what we term a Decontextualization Bias: a tendency to rely on formal or literal aspects of linguistic input"

Magic Words: reasoning abilities · affected by · tendency to rely

Illusion Created: This terms the model's structural limitations as a human-like cognitive bias. It creates the illusion that the machine has a psychological tendency or preference to ignore context and choose literalness, rather than the reality that the model's architecture and text-only training data mathematically constrain it from computing non-linguistic contextual variables.

Critical Gloss

Metaphor Type: Model as Mind (consciousness projection)

Dimension	Classification	Evidence
Acknowledgment	✅ Acknowledged (explicit metaphor)	The authors introduce this with what we term, signaling that they are defining a new conceptual framework or analogy.
How/Why	How (Mechanistic)	It describes the mathematical property where token prediction is heavily weighted by high-probability literal associations in the training corpus, without conditional probability adjustments for non-literal contextual variables.

Agency Displacement: The bias is presented as a property of the model's reasoning abilities. In reality, the lack of contextual integration is a structural limitation of the transformer's attention mechanism operating over context windows without real-world grounding, designed by human researchers.

Mechanistic Translation

POSSIBLE REWRITE:

✎ᝰ LLM output is shaped by what we term a Decontextualization Bias: a computational property where token generation is dominated by high-probability surface forms and literal training statistics ✎ᝰ

✅ Verdict: Preserved (phenomenon is real)

The mechanistic translation perfectly preserves the scientific core: models perform next-token prediction based on explicit, highly frequent literal patterns because they lack the high-order parameters or training configurations required to compute complex pragmatic variables.

Frame 5: The Cooperative Communicator

Narrative Overlay

"humans exhibit the opposite tendency: they consistently draw pragmatic inferences by adhering to Gricean principles of cooperative communication"

Magic Words: cooperative communication · adhering to Gricean principles

Illusion Created: By directly contrasting LLMs with humans who adhere to cooperative principles, this language implies that LLMs are conversational partners capable of cooperating, but failing to do so. It invites the reader to project social expectations, communicative intent, and cooperative morality onto a non-conscious system that lacks any social presence or understanding of cooperation.

Critical Gloss

Metaphor Type: Model as Person (social/moral actor)

Dimension	Classification	Evidence
Acknowledgment	❌ Naturalized (presented as literal)	The contrast is presented as a direct scientific comparison of human conversational behavior versus LLM output distribution, without noting that models have no social existence.
How/Why	Why (Agential)	It attributes human motivation (cooperation) as the explaining variable for communicative alignment, implying that models are failing to cooperate because they lack this intention.

Agency Displacement: Shifting the blame to the model's failure to cooperate covers up the fact that model deployment occurs without any functional capacity for cooperative intent—a deliberate design choice made by commercial and academic entities.

Mechanistic Translation

POSSIBLE REWRITE:

✎ᝰ NO TRANSLATION POSSIBLE—no mechanistic process identified beneath this framing. ✎ᝰ

❌ Verdict: No Phenomenon (metaphor was constitutive)

LLMs cannot adhere to Gricean principles of cooperative communication because they do not have intentions, beliefs, or desires to communicate. There is no mathematical equivalent of cooperative intent inside a neural network; there is only conditional probability computation. The cooperative metaphor is purely constitutive.

Frame 6: The Adopted Strategy

Narrative Overlay

"A second group... instead adopted a biconditional interpretation... This behavior implies that these models do not engage in genuine pragmatic reasoning, but rather rely on a fixed, rule-based strategy"

Magic Words: adopted · engage · rely on · strategy

Illusion Created: This framing depicts the language models as active strategic decision-makers that choose to adopt a specific rule-based strategy to solve the test. It masks the reality that the uniform output is a statistical artifact of reinforcement learning and instruction tuning, where specific paths in the network have been heavily weighted by human feedback to produce rigid outputs.

Critical Gloss

Metaphor Type: Model as Student (learning metaphor)

Dimension	Classification	Evidence
Acknowledgment	❌ Naturalized (presented as literal)	The authors describe the models as adopting a strategy and relying on rules, treating these agential terms as descriptive of the model's internal operations.
How/Why	Why (Agential)	It explains why the output is uniform by claiming the model relies on a fixed, rule-based strategy, imputing strategic intent to a static mathematical mapping.

Agency Displacement: The strategy is not an active choice by the LLM. It is an artifact of reinforcement learning from human feedback (RLHF) or instruction fine-tuning, where human annotators heavily penalize non-standard logic, leading the model's weights to settle on highly rigid, over-generalized patterns.

Mechanistic Translation

POSSIBLE REWRITE:

✎ᝰ A second group... generated outputs consistent with a biconditional truth-table... This behavior indicates that these models do not generate context-dependent outputs, but rather apply a uniform mathematical transformation ✎ᝰ

⚠️ Verdict: Reduced (core survives, nuance lost)

The mechanistic translation accurately identifies the phenomenon: the models' weights produce static probability distributions that mimic a rigid biconditional rule. This is much more precise than saying the model relies on a strategy, though it lacks the intuitive agential narrative.

Frame 7: Primary Operational Language

Narrative Overlay

"This may suggest that, for these models, English remains the dominant and primary operational language..."

Magic Words: dominant · primary operational language

Illusion Created: This invites the reader to imagine that the model has an internal native language or thinking voice in English, and that it must perform mental translation or cognitive routing when processing other languages. It translates the mathematical reality of corpus imbalance into a psychological trait of multi-lingual identity.

Critical Gloss

Metaphor Type: Model as Mind (consciousness projection)

Dimension	Classification	Evidence
Acknowledgment	❌ Naturalized (presented as literal)	The authors refer to English as the primary operational language without quotes or qualifiers, treating it as an objective structural fact.
How/Why	How (Mechanistic)	It describes the operational profile where performance is superior or mediated by English-language routing due to corpus density.

Agency Displacement: This dominance is a direct consequence of human training decisions—specifically, the massive imbalance of English-language text in the training corpora selected by developers, and the English-centric reinforcement processes. The models' bias is actually the developers' data-selection bias.

Mechanistic Translation

POSSIBLE REWRITE:

✎ᝰ This may suggest that the models' parameter distributions are heavily optimized for English text statistics, resulting in higher-quality latent representations and outputs when processing English tokens ✎ᝰ

✅ Verdict: Preserved (phenomenon is real)

Translating operational language to parameter optimization and latent pathways demystifies the idea of a native tongue or thinking in English, substituting it with the actual mechanics of token representation and training data imbalance.

Frame 8: Integrating Speaker Intentions

Narrative Overlay

"they struggle to integrate contextual cues and speaker intentions in the flexible, inference-driven manner characteristic of human interpretation"

Magic Words: integrate · speaker intentions · inference-driven

Illusion Created: This suggests that the model is capable of modeling and representing the internal mental states (intentions) of the human speakers who wrote the prompts. It creates the illusion of theory of mind, implying the model is reasoning about why a speaker said something, rather than calculating the conditional probability of the next token based on statistical co-occurrences of words.

Critical Gloss

Metaphor Type: Model as Mind (consciousness projection)

Dimension	Classification	Evidence
Acknowledgment	❌ Naturalized (presented as literal)	The authors state that the models fail to integrate speaker intentions, implying that the models have an architectural pipeline that is supposed to process mental states.
How/Why	Why (Agential)	It explains the failure by stating the model cannot integrate intentions, which is an agential concept. Mechanistically, there is no channel or parameter representing an intention; there is only the prompt text.

Agency Displacement: It obscures that the models have no access to speaker intentions because they are not communicating; they are executing a static feed-forward mapping of a provided text string. The human designers are responsible for presenting this prediction task as a social conversation.

Mechanistic Translation

POSSIBLE REWRITE:

✎ᝰ NO TRANSLATION POSSIBLE—no mechanistic process identified beneath this framing. ✎ᝰ

❌ Verdict: No Phenomenon (metaphor was constitutive)

An LLM cannot integrate a speaker's mental state or intention because it only processes tokenized text strings; there is no cognitive mechanism to compute real human mental states, only text statistics. The framing of integrating speaker intentions is constitutive of a cognitive capacity that does not exist in the technology.

Part 2: Transformation Glossary

About this section

Summary table of all translations from Part 1. Provides compact reference showing the full scope of the text's anthropomorphic vocabulary and whether each term survives mechanistic translation.

Original	Translation	Verdict	Note
cognitive toolkit of artificial systems	[No mechanistic equivalent]	❌ No Phenomenon	A cognitive toolkit does not exist inside a neural network. It is a constitutive metaphor with no computational referent.
linguistic agents	text sequence generation software	✅ Preserved	Accurately describes the model's physical input-output behavior while removing agential connotations.
struggle with meaning-related components	fail to generate tokens that systematically align with target semantic referents	⚠️ Reduced	Preserves the functional output failure but loses the intuitive, dramatic metaphor of struggle.
Decontextualization Bias	optimization bias favoring high-probability literal training statistics	✅ Preserved	Explains the functional failure to modulate probabilities as a direct result of text-only training.
adhering to Gricean principles	[No mechanistic equivalent]	❌ No Phenomenon	Models cannot cooperate or adhere to social maxims as they lack communicative intent.
rely on a fixed, rule-based strategy	apply a uniform mathematical mapping function	⚠️ Reduced	Replaces agential strategic planning with the actual mathematical operations of frozen weights.
dominant and primary operational language	language for which parameter distributions are heavily optimized	✅ Preserved	Replaces the concept of a native tongue with the reality of training data imbalance.
integrate speaker intentions	[No mechanistic equivalent]	❌ No Phenomenon	Models cannot represent human mental states; they only calculate token probabilities from text prompts.

Part 3: The Rewriting Experiment

About this section

A complete rewriting of a representative passage from the source text. The goal is to preserve all genuine technical claims while removing anthropomorphic framing. Numbered annotations explain each translation decision.

Why This Passage?

This passage from the Discussion is selected because it represents the critical explanatory core of the paper. Here, the authors transition from reporting empirical data to interpreting why LLMs behave as they do, introducing their primary theoretical contribution (Decontextualization Bias) and explaining the models' performance profiles. The passage is heavily saturated with anthropomorphic and cognitive metaphors to explain why the models failed the pragmatic tests. Translating this passage tests whether their explanatory theory can be formulated in a scientifically rigorous, mechanistic framework without losing its explanatory power.

Original vs. Mechanistic Translation

Original Passage	Mechanistic Translation
Nonetheless, the two general response profiles displayed by LLMs explain the overall distribution of results: biscuit interpretations were nearly absent, whereas accuracy on SCs was comparatively higher. Taken together, these findings suggest that the tested LLMs behave more like competent semantic agents rather than pragmatic reasoners. While models often capture the literal, truth-conditional structure of conditionals, they struggle to integrate contextual cues and speaker intentions in the flexible, inference-driven manner characteristic of human interpretation. This pattern is consistent with what we term a Decontextualization Bias: a tendency to privilege surface form and literal meaning over contextually enriched interpretations, which stems from absence of grounding language in real-world communicative settings. Under this view, models may acquire certain pragmatic regularities, but when literal and enriched interpretations compete, they resort to the former, diverging systematically from human behavior. This aligns with the observation that LLMs appear to have developed formal linguistic competence without fully attaining functional competence, defined as the ability to use language in contextually and world-grounded ways (1). Turning to the interpretive differences between perfected and biscuit readings, human behavior clearly supports this distinction: the two constructions diverged in contexts where a pragmatic inference was required, consistent with accounts in which strengthened readings arise through implicature calculation (58-60), whereas biscuit interpretations depend on reasoning about epistemic independence between antecedent and consequent (59, 61-62, 92). Crucially, LLM behavior mirrors this distinction only weakly. Although some models exhibited a reduced SC vs. BC contrast, the effect was attenuated and highly inconsistent across models. Many models produced either literal, truth-conditional interpretations or uniformly strengthened biconditionals, thereby collapsing the pragmatic distinction between SCs and BCs observed in human judgments. Specifically, rather than flexibly computing different inferences depending on context, models often applied a single interpretive strategy across constructions, as independently observed in previous work (33, 35). These results reinforce the idea that pragmatic enrichment is not a uniform or automatic consequence of large-scale language modeling. Instead, it appears to require mechanisms for context integration and relevance reasoning that are only partially, if at all, captured by current architectures. Overall, we find that reasoning can be conceived as a multi-step process that integrates various components (e.g., temporal sequencing, causal linking, logical processing, pragmatic enrichment, world-based inference), and at present, the outermost layers of this multi-step process, that are responsible for rewriting the interpretability of the linguistic message based on pragmatic, world-grounded inferences, are still under development in LLMs.	Nonetheless, the two distinct distributions of token outputs generated by the LLMs account for the overall variance in the results: outputs matching biscuit interpretations were virtually absent, whereas outputs matching standard conditional patterns showed higher alignment with the target answers. Taken together, these findings indicate that the tested LLMs generate outputs consistent with formal semantic rules rather than dynamic contextual modifications. While the models' parameters yield outputs that align with the literal, truth-conditional structure of conditionals, their neural network computations fail to modulate output token probabilities based on contextual prompt tokens or representations of speaker intent, contrasting with human linguistic processing. This pattern is consistent with what is defined here as a Decontextualization Bias: a computational property where token generation is dominated by high-probability surface forms and literal training statistics, a consequence of training neural networks on text-only corpora without real-world sensorimotor grounding. Under this framework, although the models' parameter weights encode statistical approximations of pragmatic regularities, when literal and contextually enriched pathways yield conflicting token probabilities, the feed-forward computation outputs the literal token, diverging systematically from human behavioral data. This aligns with the observation that LLMs generate syntactically coherent and formally logical text but do not demonstrate functional execution, which would require mapping tokens to real-world environments and dynamic contexts (1). Regarding the structural differences between perfected and biscuit readings, human behavioral data supports this distinction: human judgments diverged in contexts requiring pragmatic inference, aligning with theories where strengthened readings result from cognitive calculations of implicature (58-60), while biscuit readings rely on cognitive evaluations of epistemic independence between antecedent and consequent (59, 61-62, 92). Crucially, LLM token generation mirrors this distinction only weakly. Although some model outputs exhibited a minor statistical difference between standard and biscuit prompts, the effect was minimal and highly inconsistent across different models. Many models generated outputs aligned exclusively with literal truth tables or uniformly with biconditional structures, thereby eliminating the distinct output profiles observed in human responses. Specifically, instead of modulating output probabilities based on specific contextual inputs, the systems' computations applied a uniform mapping function across all prompts, as observed in previous research (33, 35). These results reinforce the conclusion that pragmatic token alignment is not an inherent or automatic outcome of training transformer models on large-scale text corpora. Instead, generating such outputs appears to require computational architectures explicitly designed for context integration and relevance modeling, which are not implemented in current LLM architectures. Overall, the generation of pragmatically appropriate text can be modeled as a multi-stage computation integrating distinct functions (such as sequence order, causal feature extraction, logical truth-mapping, contextual probability adjustments, and world-state simulation); currently, the algorithmic steps required to adjust token interpretability based on real-world constraints and social contexts are not present within LLM architectures.

Original Passage

Mechanistic Translation

Nonetheless, the two general response profiles displayed by LLMs explain the overall distribution of results: biscuit interpretations were nearly absent, whereas accuracy on SCs was comparatively higher. Taken together, these findings suggest that the tested LLMs behave more like competent semantic agents rather than pragmatic reasoners. While models often capture the literal, truth-conditional structure of conditionals, they struggle to integrate contextual cues and speaker intentions in the flexible, inference-driven manner characteristic of human interpretation. This pattern is consistent with what we term a Decontextualization Bias: a tendency to privilege surface form and literal meaning over contextually enriched interpretations, which stems from absence of grounding language in real-world communicative settings. Under this view, models may acquire certain pragmatic regularities, but when literal and enriched interpretations compete, they resort to the former, diverging systematically from human behavior. This aligns with the observation that LLMs appear to have developed formal linguistic competence without fully attaining functional competence, defined as the ability to use language in contextually and world-grounded ways (1). Turning to the interpretive differences between perfected and biscuit readings, human behavior clearly supports this distinction: the two constructions diverged in contexts where a pragmatic inference was required, consistent with accounts in which strengthened readings arise through implicature calculation (58-60), whereas biscuit interpretations depend on reasoning about epistemic independence between antecedent and consequent (59, 61-62, 92). Crucially, LLM behavior mirrors this distinction only weakly. Although some models exhibited a reduced SC vs. BC contrast, the effect was attenuated and highly inconsistent across models. Many models produced either literal, truth-conditional interpretations or uniformly strengthened biconditionals, thereby collapsing the pragmatic distinction between SCs and BCs observed in human judgments. Specifically, rather than flexibly computing different inferences depending on context, models often applied a single interpretive strategy across constructions, as independently observed in previous work (33, 35). These results reinforce the idea that pragmatic enrichment is not a uniform or automatic consequence of large-scale language modeling. Instead, it appears to require mechanisms for context integration and relevance reasoning that are only partially, if at all, captured by current architectures. Overall, we find that reasoning can be conceived as a multi-step process that integrates various components (e.g., temporal sequencing, causal linking, logical processing, pragmatic enrichment, world-based inference), and at present, the outermost layers of this multi-step process, that are responsible for rewriting the interpretability of the linguistic message based on pragmatic, world-grounded inferences, are still under development in LLMs.

Nonetheless, the two distinct distributions of token outputs generated by the LLMs account for the overall variance in the results: outputs matching biscuit interpretations were virtually absent, whereas outputs matching standard conditional patterns showed higher alignment with the target answers. Taken together, these findings indicate that the tested LLMs generate outputs consistent with formal semantic rules rather than dynamic contextual modifications. While the models' parameters yield outputs that align with the literal, truth-conditional structure of conditionals, their neural network computations fail to modulate output token probabilities based on contextual prompt tokens or representations of speaker intent, contrasting with human linguistic processing. This pattern is consistent with what is defined here as a Decontextualization Bias: a computational property where token generation is dominated by high-probability surface forms and literal training statistics, a consequence of training neural networks on text-only corpora without real-world sensorimotor grounding. Under this framework, although the models' parameter weights encode statistical approximations of pragmatic regularities, when literal and contextually enriched pathways yield conflicting token probabilities, the feed-forward computation outputs the literal token, diverging systematically from human behavioral data. This aligns with the observation that LLMs generate syntactically coherent and formally logical text but do not demonstrate functional execution, which would require mapping tokens to real-world environments and dynamic contexts (1). Regarding the structural differences between perfected and biscuit readings, human behavioral data supports this distinction: human judgments diverged in contexts requiring pragmatic inference, aligning with theories where strengthened readings result from cognitive calculations of implicature (58-60), while biscuit readings rely on cognitive evaluations of epistemic independence between antecedent and consequent (59, 61-62, 92). Crucially, LLM token generation mirrors this distinction only weakly. Although some model outputs exhibited a minor statistical difference between standard and biscuit prompts, the effect was minimal and highly inconsistent across different models. Many models generated outputs aligned exclusively with literal truth tables or uniformly with biconditional structures, thereby eliminating the distinct output profiles observed in human responses. Specifically, instead of modulating output probabilities based on specific contextual inputs, the systems' computations applied a uniform mapping function across all prompts, as observed in previous research (33, 35). These results reinforce the conclusion that pragmatic token alignment is not an inherent or automatic outcome of training transformer models on large-scale text corpora. Instead, generating such outputs appears to require computational architectures explicitly designed for context integration and relevance modeling, which are not implemented in current LLM architectures. Overall, the generation of pragmatically appropriate text can be modeled as a multi-stage computation integrating distinct functions (such as sequence order, causal feature extraction, logical truth-mapping, contextual probability adjustments, and world-state simulation); currently, the algorithmic steps required to adjust token interpretability based on real-world constraints and social contexts are not present within LLM architectures.

Translation Notes

#	Original	Translated	What Changed	Why	Verdict
1	behave more like competent semantic agents rather than pragmatic reasoners	generate outputs consistent with formal semantic rules rather than dynamic contextual modifications	Replaced agential metaphors of behavior and identity with output-centric mathematical descriptions.	The original phrase implies the model possesses agency and selects semantic roles. The translation clarifies that the model simply outputs text matching formal syntactic and semantic distributions rather than adjusting probabilities dynamically.	✅ Preserved
2	they struggle to integrate contextual cues and speaker intentions	their neural network computations fail to modulate output token probabilities based on contextual prompt tokens or representations of speaker intent	Replaced the struggle metaphor and direct access to speaker intentions with parameter computation and token modulation.	Models do not struggle or have access to human intentions; they can only process tokenized input. The rewrite specifies the exact technical limitation: failing to adjust conditional token probabilities based on prompt tokens representing context.	✅ Preserved
3	a tendency to privilege surface form	a computational property where token generation is dominated by high-probability surface forms	Translated the agential tendency to privilege into a mathematical optimization bias.	The original implies an active choice or cognitive bias. The translation shows it is a structural property resulting from optimization objectives on word co-occurrence frequencies.	✅ Preserved
4	models may acquire certain pragmatic regularities, but... resort to the former	although the models' parameter weights encode statistical approximations of pragmatic regularities, when literal and contextually enriched pathways yield conflicting token probabilities, the feed-forward computation outputs the literal token	Replaced cognitive learning and retreating metaphors with weight encoding and feed-forward path activation.	The original suggests a conscious entity choosing to retreat to safety. The translation explains that conflict resolution in neural networks occurs through feed-forward activation of higher-probability paths.	✅ Preserved
5	models often applied a single interpretive strategy across constructions	the systems' computations applied a uniform mapping function across all prompts	Replaced agential interpretive strategy with uniform mapping function.	A strategy suggests active problem-solving. A mapping function accurately describes the frozen mathematical transformations applied to the input token matrix.	✅ Preserved
6	are still under development in LLMs	are not present within LLM architectures	Replaced developmental biological/growth metaphor with architectural absence.	The original implies that LLMs are self-evolving systems growing these layers naturally over time. The translation highlights that human developers simply have not engineered these computational mechanisms.	❌ No Phenomenon

What Survived vs. What Was Lost

What Survived	What Was Lost
The core empirical findings of the study remain entirely intact. Stripped of anthropomorphic framing, the paper still robustly demonstrates that current state-of-the-art LLMs systematically fail to generate pragmatically appropriate outputs for standard and biscuit conditionals. The quantitative divergence between human linguistic choices (which show context-sensitive pragmatic enrichment) and LLM outputs (which are rigidly logical or rigidly biconditional) is preserved. Furthermore, the paper's critical argument against scale—that training larger models on more text does not automatically result in pragmatic reasoning—remains fully supported by the technical data and is even clarified by the mechanistic language.	What is lost is the evocative, narrative momentum that makes cognitive science papers engaging for a broad audience. Metaphors like struggle and strategy provide an intuitive, instantly graspable conceptual shortcut that allows non-experts to visualize model failure. The mechanistic translation is dense, jargon-heavy, and cognitively demanding, requiring the reader to translate abstract statistical concepts into functional outcomes. The stylistic sense of drama—the idea of a machine attempting to reason but failing due to a cognitive bias—is replaced by a dry, technical description of probability distributions and feed-forward operations.

What Survived

What Was Lost

The core empirical findings of the study remain entirely intact. Stripped of anthropomorphic framing, the paper still robustly demonstrates that current state-of-the-art LLMs systematically fail to generate pragmatically appropriate outputs for standard and biscuit conditionals. The quantitative divergence between human linguistic choices (which show context-sensitive pragmatic enrichment) and LLM outputs (which are rigidly logical or rigidly biconditional) is preserved. Furthermore, the paper's critical argument against scale—that training larger models on more text does not automatically result in pragmatic reasoning—remains fully supported by the technical data and is even clarified by the mechanistic language.

What is lost is the evocative, narrative momentum that makes cognitive science papers engaging for a broad audience. Metaphors like struggle and strategy provide an intuitive, instantly graspable conceptual shortcut that allows non-experts to visualize model failure. The mechanistic translation is dense, jargon-heavy, and cognitively demanding, requiring the reader to translate abstract statistical concepts into functional outcomes. The stylistic sense of drama—the idea of a machine attempting to reason but failing due to a cognitive bias—is replaced by a dry, technical description of probability distributions and feed-forward operations.

What Was Exposed

The translation exposed several instances of constitutive anthropomorphism where the metaphor has no technical basis. Specifically, the concept of LLMs integrating speaker intentions was exposed as empty; models have no mechanism to represent external minds, only token-co-occurrence probabilities. Similarly, the idea of an LLM adopting a strategy or having emerging abilities under development was shown to be purely rhetorical. These agential frames obscure the frozen nature of the trained weights and project a false sense of active, self-correcting cognitive growth onto a static set of mathematical matrices.

Readability Reflection

The mechanistic translation is highly precise but significantly less readable for a general audience. It reads like an engineering specification sheet rather than a cognitive science paper. To bridge this gap without reintroducing agential myths, one could use structural analogies rather than cognitive ones. For example, comparing the LLM to an automated sorting machine that routes items based on simple labels rather than inspecting their contents. This explains the functional limitation without falsely attributing struggling or intending to the mathematical algorithm.

Part 4: What the Experiment Revealed

About this section

Synthesis of patterns across all translations. Includes verdict distribution, the function of anthropomorphism in the source text, a "stakes shift" analysis showing how implications change under mechanistic framing, and a steelman of the text's strongest surviving claim.

Pattern Summary

Verdict	Count	Pattern
✅ Preserved	3	—
⚠️ Reduced	2	—
❌ No Phenomenon	3	—

Pattern Observations: The experiment revealed a clear split based on the function of the metaphor. Metaphors describing functional output characteristics (e.g., primary operational language, linguistic agents) were easily translated into mechanistic terms with minimal loss, because they describe observable statistical profiles. Metaphors describing internal mechanisms (e.g., struggle, strategy) were reduced, losing narrative accessibility but gaining precision regarding weight optimization and token mapping. Most importantly, metaphors implying theory of mind or developmental growth (e.g., integrating speaker intentions, cognitive toolkit, adhering to cooperative principles) collapsed into No Phenomenon. These metaphors are constitutive rather than descriptive; they do not correspond to any engineering reality inside the transformer architecture, acting purely as rhetorical devices that project human cognitive architecture onto statistical token predictors.

Function of Anthropomorphism

The anthropomorphic framing in this text serves several distinct rhetorical and disciplinary functions. First, it establishes interdisciplinary relevance. By using terms like cognitive toolkit, linguistic competence, and Gricean principles, the authors bridge the gap between computer science, linguistics, and cognitive psychology, allowing cognitive scientists to study LLMs using established human-centric frameworks. Second, it enhances persuasive force. Describing a model as struggling with meaning or adopting a rule-based strategy makes the AI appear more sophisticated and close to human cognition than describing it as failing to adjust token probabilities due to mathematical constraints. Third, and most critically, it obscures engineering accountability. Framing the model's limitations as an inherent Decontextualization Bias that is affected by a lack of real-world grounding makes the failure seem like a natural, systemic developmental stage of the AI. This shifts focus away from the human decisions—specifically, the commercial rush to deploy next-token predictors as general reasoning engines, and the deliberate choices of training data and objectives that ignore functional utility.

What Would Change

If this paper were published in a strictly mechanistic form, its academic and public reception would shift dramatically. It could no longer frame itself as a study on the cognitive development of AI. Instead, it would be recognized as a highly rigorous evaluation of transformer token-probability distributions under specific logical prompts. The claims of finding a Decontextualization Bias would be restated as an evaluation of statistical default pathways in feed-forward networks. ML engineers would find it highly actionable as a baseline of current architecture limitations, but the broader media and general cognitive science community would likely find it less sensational, as it removes the dramatic narrative of machines failing to think like us.

Stakes Shift Analysis

Dimension	Anthropomorphic Framing	Mechanistic Translation
Threat	We should fear that autonomous linguistic agents with sophisticated formal competence but lacking moral or pragmatic grounding will make incorrect decisions in human communicative environments.	Human deployment of static token-prediction algorithms in contexts requiring real-world understanding will result in predictable, rigid system failures.
Cause	The model possesses a Decontextualization Bias and lacks a complete cognitive toolkit to understand human intentions.	Human developers have optimized transformer models on statistical text patterns alone, neglecting the sensory-motor and social architectures required for pragmatic communication.
Solution	We must wait for the outermost layers of its reasoning abilities to develop further through scale or research.	Engineers must design new architectures with explicit contextual modeling, rather than relying on next-token prediction over text corpora.
Accountable	The model itself, as an emerging and developing agential system, bears the primary developmental limitation.	The tech developers and researchers who deploy text-only models for reasoning tasks while ignoring structural engineering limitations.

Reflection: The mechanistic translation radically shifts the policy and safety stakes. Under the anthropomorphic framing, the threat is an abstract, developmental gap in the AI itself, which encourages a passive, watchful attitude as the technology matures. Under the mechanistic translation, the threat is identified as an immediate, active human error: the inappropriate deployment of statistical text generators for complex logical reasoning. The responsibility shifts entirely from the developing AI to the human corporations and engineers, changing the solution from improving the AI's mind to regulating human deployment and engineering practices. This demonstrates that the anthropomorphic metaphor actively shields human developers from accountability.

Strongest Surviving Claim

About this section

Intellectual fairness requires identifying what the text gets right. This is the "charitable interpretation"—the strongest version of the argument that survives mechanistic translation.

The Best Version of This Argument

Core Claim (Mechanistic): Statistical language models trained via next-token prediction on text-only corpora fail to modulate conditional token outputs in response to pragmatic variables. Specifically, when presented with standard and biscuit conditionals, their frozen parameter weights output either literal truth-functional mappings or rigid biconditional associations, failing to reproduce the context-dependent probability distributions observed in human linguistic behavior.

What Retained:

The quantitative performance gap between humans and LLMs on conditional logic tasks.
The classification of model outputs into rigid logical sub-profiles.
The statistical independence of model failure from broad design features like model size, open-weights status, or reasoning-oriented training.

What Lacks:

The narrative concept of an active cognitive toolkit.
The psychological framing of a model struggling or adopting strategies.
The explanatory framework based on the projection of speaker intentions inside the network.

Assessment: The surviving claim is highly significant, scientifically rigorous, and entirely worth publishing. In fact, stripping away the anthropomorphic overlay strengthens the paper's scientific utility by replacing vague cognitive metaphors with precise engineering challenges. It remains an actionable indictment of the belief that scale alone can solve pragmatic language processing, providing clear empirical evidence that current transformer architectures are structurally limited when it comes to context-dependent communication.

Part 5: Critical Reading Questions

About this section

These questions help readers break the anthropomorphic spell when reading similar texts. Use them as prompts for critical engagement with AI discourse.

1 Agency Displacement: When the authors state that a model 'adopted a biconditional interpretation,' what specific choices in the training data, fine-tuning process, or reinforcement learning objectives (designed by humans) forced the model's weights into this rigid mathematical alignment?

2 Consciousness Projection: Does the term 'Decontextualization Bias' describe an active psychological preference of the model, or is it simply a label for the mathematical limitation of a feed-forward network processing token vectors without real-world sensory coordinates?

3 How/Why Slippage: When the text asserts that LLMs 'struggle to integrate speaker intentions,' is there any mechanistic component in a transformer's attention block designed to calculate a speaker's mental state, or does the system simply compute conditional token probabilities based on prompt words?

4 Agency Displacement: The authors suggest that pragmatic abilities are 'still under development in LLMs' as if they are organically growing; who is actually doing the developing, and what specific engineering changes would be required to build these abilities?

5 Domain-Specific: How does framing the LLM as a 'competent semantic agent' shift our expectations of its safety, and how would those expectations change if we defined it as a 'static statistical text-pattern classifier'?

Extended Processing Summary

Gemini's Processing Steps

What you are about to read…

Gemini refers to the text below as "thought summaries." This is an overt consciousness projection because 'intentions' are hallmarks of a conscious mind that 'knows' what it is doing and why. The concealed mechanistic process is probabilistic text generation. Treat this as just another rhetorical artifact—a way of making the model's processing legible.

Defining the Analysis Target

I'm now zeroing in on the core objective: crafting an "AI Literacy Deconstructor" analysis for that paper by Morosi et al. The focus is to meticulously follow the provided JSON schema and exclude any comments or invalid entries. The goal is to provide a concise and compliant output.

Parsing the Target Paper

I'm now engaged in analyzing the Morosi et al. paper, dissecting its key elements. I'm focusing on their work testing 25 LLMs on conditional inferences across multiple languages. I'm pinpointing their key finding about the "Decontextualization Bias". I'm also noting the prevalent anthropomorphic language, where they describe LLMs as agents with a cognitive toolkit, or capable of human-like reasoning.

Mapping the Mental Metaphors

I'm cataloging the prevalent anthropomorphic language, identifying instances where the Morosi et al. paper ascribes human-like cognition or agency to LLMs. I'm noting the "Model as Mind," "Model as Agent," and "Model as Student" metaphors, including phrases like "knows," "understands," "decontextualization bias," and "adopted a biconditional interpretation." The dominance of English is also categorized.

\n\n

Structuring the Data Output

Now, I'm constructing the JSON output based on the provided schema. The frameAnalysis array will contain entries with narrativeOverlay, criticalGloss, and mechanisticTranslation components, as specified. Simultaneously, I'll build the transformationGlossary array. I'm ensuring everything is correctly formatted to comply with the defined structure, in preparation for the upcoming analysis of the LLM paper.

Analyzing the Output Schema

I'm now implementing the detailed schema for the output. I've focused on the structure of the rewrittenExcerpt, experimentRevealed, and criticalReadingQuestions objects and arrays. I'm prioritizing their implementation and the subsequent per-frame analysis. I'm currently working on Frame 1, "The Cognitive Toolkit", aiming to analyze it in detail as part of the analysis.

Deconstructing the Cognitive Toolkit

I'm now deeply engaged in deconstructing Frame 1, "The Cognitive Toolkit," which treats LLMs as having a mind-like toolkit. I'm focusing on the "Model as Mind (consciousness projection)" metaphor, which is described as naturalized within the text. I'm evaluating the displacement of human agency, specifically how the engineering process is subtly downplayed. My focus is now on the mechanistic interpretation of pragmatic reasoning within the model's structure.

Dissecting the Linguistic Agent

I'm now diving into Frame 2, "The Linguistic Agent," analyzing how the Morosi et al. paper characterizes LLMs. I'm focusing on the "linguistic agents" metaphor, and its impact. I've noted the "cognitive toolkits" phrase again, which is very common. The mechanistic translation is being developed. I'm deconstructing the framing of LLMs as active, autonomous subjects. The current focus is on the "Model as Agent (intentionality projection)" metaphor.

Evaluating the "Agent" Framing

I'm now focusing on the "Model as Agent (autonomous decision-maker)" metaphor within Frame 2. It appears as standard shorthand in NLP, referring to generative systems. This framing, however, obscures the system's nature, an automated feed-forward mathematical function trained on human-produced text, thus sidelining its creators. The mechanistic translation is being refined.

Framing the "Struggle"

I'm now examining Frame 3, "The Struggle with Meaning," highlighting the "Model as Student" metaphor. The language creates an illusion of the model as an active, struggling entity, like a student. I've noted the "struggle" and "display" language, which presents the model's limitations without any attribution to the model's designers. The focus is on the metaphor's naturalization within the text.

Interpreting the Statistical Bias

I'm now focusing on Frame 4, "The Decontextualization Bias," examining its implications. I've analyzed the quote, noting the bias is an extension of the inherent statistical training of LLMs, lacking sensory-motor grounding or intentional goals. The "struggle" terminology is absent here, as the focus is solely on syntactic versus semantic performance. My mechanistic translation accurately reflects the syntactic success versus semantic failure.

Analyzing the Bias Narrative

I'm now fully immersed in dissecting Frame 5, the Decontextualization Bias, from the Morosi et al. paper. The key elements are the "tendency," "rely on," "integrating," and "guide" magic words, implying active cognitive processes in the LLM. This builds the "Model as Mind (consciousness projection)" metaphor, creating an illusion of active preference. I'm focusing on the structural reasons for the bias, as well as the displacement of agency away from the system designers.

Evaluating the Bias Detail

I'm now zeroing in on Frame 6, which directly addresses the decontextualization bias. I'm scrutinizing how the paper frames the bias as an inherent "tendency" of the model. I'm focusing on dissecting the "tendency," and assessing whether this agential framing accurately reflects the limitations of the transformer's architecture, which is a mathematical process. The mechanistic translation is being refined.

Analyzing the Conversational Frame

I'm now deep into Frame 7, analyzing "The Model as a Cooperative Communicator" metaphor. I've noted how the paper contrasts the LLM's limitations with human "cooperative communication" and Gricean principles. The "ability to reason," "cooperative communication," and "communicative intent" language is the focus. I'm deconstructing the illusion of a social actor by examining the displacement of agency away from commercial interests.

Refining the Comparative Analysis

I'm now zeroing in on Frame 7. The mechanistic translation is being assessed. I'm focusing on why cooperative motivation is used to discuss an LLM's limitations. I'm noting the contrasts between human cooperative communication versus statistical model behavior. The focus is on the lack of communicative intent. I'm now exploring how the paper's language constructs the illusion of a social actor, a key point.

Deconstructing the Learning Metaphor

I'm currently dissecting how the paper uses the "Model as Student" metaphor in Frame 8. Key phrases like "adopted a biconditional interpretation" and "rely on a fixed, rule-based strategy" create an illusion of active choice. This obscures the role of reinforcement learning from human feedback, which shapes the model's output. The mechanistic translation will expose how rigidly-patterned outputs are the result of training.

Interpreting the Linguistic Bias

I'm now diving into the final frame. I've focused on how Morosi et al. uses terms that anthropomorphize the model's linguistic preferences. I'm focusing on the "dominant and primary operational language" illusion. I've noted that this phrasing constructs the model as having an "inner voice" in English, as though it were a human with preferences, rather than a statistical function. The mechanistic translation is being finalized.

Translating the Final Frame

I am now focusing on the final analysis, meticulously translating "operational language." I'm deconstructing the language used to avoid misinterpretations. My priority is ensuring the mechanistic translation accurately represents the underlying statistical processes, rather than anthropomorphic projections. I'm finalizing the analysis of the displacement of agency.

Demystifying the LLM "Voice"

I'm currently focused on Frame 8 and the concept of "The Model Wants to be Helpful." The phrases "positive response bias" and "tended to provide" suggest a model personality, but I'm exposing parameter optimization and training data imbalances. The mechanistic translation now highlights how biases result from fine-tuning or dataset configurations.

Summarizing the Statistical Bias

I'm now zeroing in on Frame 9's details on "positive response bias" and the "Falcon 7B" example. I'm focusing on the translation, describing the issue as an unbalanced probability distribution for affirmative tokens, and ensuring a mechanistic perspective that aligns with the analysis. I'm building a concise dictionary summarizing the mechanistic translations from Part 1, for the transformation glossary. The rewritten excerpt will be the final step.

Selecting the Core Text

I'm now zeroing in on a key passage from the paper's discussion, specifically paragraphs 3 and 4 of section 5. It deals with the "Decontextualization Bias" and how LLMs behave as "competent semantic agents." My focus is to analyze this text for its explanatory power and to accurately portray how LLMs are framed in terms of agency, particularly focusing on how agency is ascribed, for a detailed "AI Literacy Deconstructor".

Analyzing the Decontextualization Bias

I'm now fully immersed in dissecting the key passage focusing on the Decontextualization Bias. I'm carefully analyzing how the text describes LLMs as exhibiting a tendency to prioritize surface form over contextual interpretation. My translation must highlight the absence of world grounding, the reliance on literal meaning, and how this contrasts sharply with human communication. The final framing analysis is nearing completion.

Evaluating the Contextual Distinction

I'm now zeroing in on the nuances of how LLM behavior diverges from human reasoning within the key text. The focus is on the models' inability to flexibly compute inferences based on context, a key contrast. This involves analyzing the literal interpretations versus contextual ones, highlighting the consistent, single interpretive strategy of the models, in comparison to the flexibility inherent in human pragmatic enrichment.

Refining the Grounding Analysis

I'm currently finalizing the analysis of the Decontextualization Bias passage, focusing on how it frames the contrast between LLM literalism and human contextual understanding. The "multi-step process" of human reasoning is contrasted with LLM limitations. My mechanistic translation emphasizes the lack of pragmatic enrichment and world-based inference in current architectures.

Contrasting the Linguistic Behavior

I'm now zeroing in on the linguistic behavior contrast. I'm focusing on the Decontextualization Bias passage, contrasting the statistical output from the LLM with human reasoning. I'm highlighting the absence of real-world grounding, and also the lack of ability to dynamically compute inferences. The work is nearly done.

Contrasting Model Outputs

I'm now zeroing in on the outputs from LLMs compared to human judgments within the test data. I've noted that human responses are sensitive to pragmatic inferences, while the models' outputs show little differentiation. The models' "computations" apply a uniform mapping across prompts, and I'm deconstructing this.

Confirming the Pragmatic Failure

I am now confirming the core finding: current LLM architectures don't automatically produce pragmatically appropriate outputs. My analysis highlights that achieving this requires specific computational designs. I'm focusing on the multi-stage computations needed for integration. I'm refining my mechanistic translation to capture why current architectures struggle. I'm building translation notes with annotations.

Exposing the Linguistic Pitfalls

I'm now focusing on the distinct categories of LLM failure. I've classified rigid models with default truth-table logic, contrasting this with models that rely on strict biconditional logic. The critique of scale is becoming clear: size alone doesn't grant pragmatic outputs. The anthropomorphic shorthand's simplicity is lost, as the "struggle" narrative obscures the technical explanation. I'm now cataloging the psychological framing.

Revising for Readability

I'm now revising my mechanistic translations to improve accessibility for a wider audience. I'm focusing on the technical precision versus ease of understanding trade-off. My goal is to find a balance between accurate technical descriptions and digestible summaries without technical jargon.

Revising the Explanation

I'm now revising the explanation, focusing on the decontextualization bias. I'm replacing agential language with computational terms, such as "mathematical filters" and "text prediction calculators," to demystify the system's operation. I am assessing the need for restructuring. My mechanistic translation will include the model as a statistical system.

Revising Verdicts and Frames

I'm now revising my previous verdicts and introducing new frames. I'm focusing on highlighting instances where anthropomorphism obscures the lack of underlying technical mechanisms. The "Cognitive Toolkit" frame, which ascribes "toolkit" properties, is now classified as "No Phenomenon" because there is no computational equivalent. I'm revising the "Cooperative Communicator" as "No Phenomenon," as models have no intentionality.

Categorizing the Metaphors

I'm now categorizing all the previous frames. The goal is to classify them as "Preserved," "Reduced," or "No Phenomenon" to better clarify the analysis. I'm focusing on the "Cooperative Communicator" as a "No Phenomenon," because, without relational intent, the Gricean cooperative principle can't be applied. I'll explicitly draft the 8 frames as well.

Categorizing the Final Frames

I'm now finalizing the classification of all eight frames: "Decontextualization Bias," "The Linguistic Agent," and "The Struggle" are classified as "Reduced." "The Cognitive Toolkit," "Cooperative Communicator" and "Integrating Speaker Intentions" are "No Phenomenon." The final step involves a function analysis of the anthropomorphic language and the shifts in stakes, audience and meaning.

Refining the Structural Analysis

I'm now revising the JSON output, particularly focusing on the criticalReadingQuestions. I'm drafting 6 questions aimed at unpacking agency displacement and consciousness projection. The focus is to probe how and why we slip into anthropomorphic readings. I'm also preparing for final validation.

Adapting the JSON Format

Now, I'm refactoring the JSON output to strictly adhere to single-line formatting, eliminating control tokens like newlines or tabs. I'm focusing on ensuring all strings are appropriately quoted and avoiding any comments within the JSON. The goal is to produce valid, dense JSON with no whitespace characters in the output.

Compressing the JSON Output

I'm now implementing the final formatting step: creating a single-line JSON string. This involves removing all whitespace, tabs, and newlines from the existing JSON content. I'll test the single-line output to ensure it's valid according to JSON standards, making sure there are no newlines or tabs. I will draft the remaining content and convert it into a single-line string.

Deconstructing the Linguistic "Agents"

I'm now fully immersed in dissecting Frame 2. It is characterized by the concept of "linguistic agents". I'm analyzing the framing, and its misleading implications. I'm focusing on the illusion of intentionality created, and its displacement of the role of the engineers. The mechanistic translation is still in progress.

Analyzing the Structural Limitations

I am now focusing on Frame 4, the Decontextualization Bias. The "tendency" is analyzed, exposing how a machine's architectural constraints become an illusion of human bias. I'm noting the models' limitations in computing non-linguistic inputs, and contrasting these limitations with human capabilities.

Finalizing the Analysis

I'm now completing my final pass. I'm focusing on "Adhering to Gricean Principles" within the fifth frame, as well as Frame 6. I'm focusing on the "adopted," "engage" terms used. I've re-reviewed "communicative intent," and am building the final output. The "illusion" of a conversational partner is the current focus, for the final analysis.

Finalizing the Outputs

I am now completing the analysis of the text. I'm focusing on the illusion of active decision making and internal language preferences created. I am deconstructing how language can create an illusion.

Classifying the Final Frames

I'm now determining how each phrase fits the schema, and checking them to classify, focusing on each of the remaining phrases. I'm confirming the classification "no phenomenon" when there's no technical system, like speaker intentions. Then, I'll translate the analysis in preparation for the JSON output.

Refining the Output Strategy

I'm now zeroing in on the final phase, ensuring a flawless JSON output. I'm focusing on the single-line format, and am creating a python-like representation in my thoughts to ensure perfect schema alignment. My focus will be on the final string-based JSON. I'm building a rewritten excerpt, with all properties included.

Analyzing the Text's "Tendency"

I'm currently focused on the Morosi paper's language describing LLMs as "competent semantic agents." This framing highlights a shift toward an agential interpretation. I'm focusing on the "tendency to privilege surface form and literal meaning" and contrasting it with the flexible human reasoning. My translation demystifies this "bias," emphasizing architecture's role in contextual understanding.

Translating the Text Focus

I'm now zeroing in on the final text to analyze the decontextualization bias. I'm noting the differing interpretive approaches of LLMs. This is contrasted with human pragmatic inferences, to highlight the model limitations. I'll focus on the implications of the surface form approach.

Translating the Interpretations

I'm now zeroing in on the text's core contrast: LLMs' single interpretive approach versus human, flexible reasoning. I'm noting the lack of "pragmatic enrichment" in current architectures. I'll highlight that LLMs don't automatically produce pragmatic outputs. My translation reflects these differences.

Translating the Text Findings

I'm now zeroing in on the text's core contrast: the LLMs' focus on truth-conditional outputs, contrasting with how their outputs lack contextual modulation. I'm focusing on the "Decontextualization Bias", and how it's linked to real-world grounding and sensorimotor inputs.

Translating the Statistical Detail

I'm now zeroing in on the statistical patterns in the LLM outputs. It's becoming clear that the literal outputs dominate because of the computational design, a feed-forward system, as the human behavior is based on deeper inference. The bias results, with the outputs being syntactically correct yet functionally irrelevant. I'm focusing on contrasting the human behavior and LLM outputs, for my translation.

Finalizing the Translation

I'm now zeroing in on the text that describes uniform mapping and its implications. I'm noting the lack of context integration in the models, contrasting it with human pragmatic responses. The focus is on the multi-stage nature of pragmatic text generation and contrasting that with current LLM designs. The mechanistic translation is the final step.

Refining the Translation Focus

I'm now zeroing in on Frame 5, the decontextualization bias. I'm focusing on "struggle," and comparing how LLMs are framed in terms of agency. I'm noting the absence of grounding. The models are limited by their architectures, not human limitations. I'm finalizing translation notes to clearly reflect this.

Refining the Translation Focus

I'm now completing the final part, focusing on Frame 4 and "surface form" framing. My translation will reflect this as a "mathematical optimization bias." I am assessing the need for revising translations to include the lack of pragmatic functions.

Refining the Final Details

I am now focusing on the final notes on model outputs and pragmatic regularities, as I prepare to classify and output. I'm noting the substitution of the cognitive learning metaphors with technical descriptions of weight encoding and activation. My final translations detail model behavior versus human reasoning.

Refining the Translation Focus

I'm now integrating the final translation details and the JSON output. I'm focusing on "single interpretive strategy" and replacing it with "uniform mapping." I'm also ensuring all the frames are accurately represented. My aim is accurate text mapping. The mechanistic translation will now accurately reflect this.

Finalizing the Translation Details

I am now completing the final translation, clarifying what's preserved and lost by removing the anthropomorphic framing. My verdict for the paper's core findings remains "preserved," but the evocative narrative is lost. I'm focusing on "struggle," and simplifying jargon in the translation.

Refining the Translation Focus

I'm now zeroing in on deconstructing how the paper's anthropomorphic framing is obscuring the system's core function. I'm focusing on "integrating speaker intentions" and the "emergent abilities" metaphors, exposing their lack of technical basis. My translation must highlight the absence of actual intention and the static nature of the underlying mathematical processes.

Revising the Explanation

I'm now revising the explanation. I'm focusing on the translation, replacing any agential language with computational, structural analogies to maintain clarity and accuracy. I'm focusing on contrasting the LLM with an automated sorting machine, that relies on a labeling process rather than internal understanding. I'm assessing the need for restructuring.

Finalizing the Linguistic Analysis

I'm now integrating the final pieces for the JSON output. My analysis categorizes metaphors as "Preserved," "Reduced," or "No Phenomenon" based on their translation feasibility, and a pattern summary is being drafted. I've focused on "struggle," replacing anthropomorphic framing with a mechanistic explanation, and am set to complete the JSON.

Analyzing the Agential Framing

I'm now zeroing in on the rhetorical functions of anthropomorphism. It establishes interdisciplinary relevance, and boosts persuasiveness, by presenting AI as human-like. I'm noting the way it obscures engineering responsibility. My translation now clarifies how the anthropomorphic terms are rhetorical devices.

Refining the Translation Focus

I'm now zeroing in on the text's core claim that models, due to limitations, struggle to compute inferences based on context. My goal is to capture the contrast between models' uniform, literal outputs and humans' pragmatic enrichment. I'm noting the specific role of architectures, like feed-forward networks, and now finalizing the mechanistic translation.

Finalizing the Translation Focus

I'm now deep into revising and finalizing Frame 1, focusing on how framing models as having "cognitive toolkits" displaces the crucial role of engineers. I'm replacing agential phrases and re-focusing on the static statistical processes in the translation. The goal is a precise technical output.

Refining the Safety Stakes

I'm now integrating the reflection on policy and safety. I am revising my analysis to focus on the shift in responsibility from "developing AI" to human error in the deployment of statistical generators. This demands a clear, mechanistic translation of the risks. I am now refining the JSON structure accordingly.

Refining the Safety Stakes

I'm now integrating the safety stakes analysis and its implications for human-AI interaction, refining my analysis to highlight engineering over statistical failures. The anthropomorphic language actively shields engineers from accountability. My translation now clarifies statistical models' architectural limitations.

Refining the Final Translation

I'm now concluding the analysis, focusing on how phrases like "struggle" and "cognitive toolkit" distort system functions. Replacing agential language with structural analogies, I'm ready to categorize and prepare the JSON for output. The rewritten excerpt will now clarify the core findings.

Refining the JSON Implementation

I'm now integrating the final "criticalReadingQuestions". I've finalized my questions and translated them. My aim is accurate text mapping, and I'm set to finalize the JSON output. I'm focusing on crafting the final single-line string for JSON compliance.

Validating the Final JSON

I'm now implementing the validation of my generated single-line JSON string, ensuring there are no newlines or tabs anywhere. I'm focusing on rigorously escaping all strings within the JSON, to guarantee its compliance with the specified format. The final step involves a check for strict adherence to the JSON structure and validity.

Finalizing the Output Formatting

I'm now meticulously crafting the final single-line JSON output, ensuring absolute compliance with all formatting rules. I'm focusing on validating the absence of control characters and backslash escape characters. My goal is a strictly valid and dense JSON string, devoid of any whitespace or newlines beyond required separators.

Finalizing the Output Schema

I'm now generating the single-line JSON, ensuring strict schema adherence - a crucial step. I've confirmed no control characters are present, and all strings are escaped. The goal is complete validity and format compliance. The final test will confirm full JSON structure correctness.

Analysis Provenance

Run ID: 2026-05-25-tracing-the-ongoing-emergence-of-human-l-deconstructor-k9wfhr
Raw JSON: 2026-05-25-tracing-the-ongoing-emergence-of-human-l-deconstructor-k9wfhr.json
Framework: AI Literacy Deconstructor v1.0
Schema Version: 1.0
Generated: 2026-05-25T10:25:35.003Z

Overall Verdict - Does anything survive when the metaphor is removed?​

Part 1: Frame-by-Frame Analysis​

Frame 1: The Cognitive Toolkit​

Narrative Overlay​

Critical Gloss​

Mechanistic Translation​

Frame 2: Linguistic Agents​

Narrative Overlay​

Critical Gloss​

Mechanistic Translation​

Frame 3: Struggle with Meaning​

Narrative Overlay​

Critical Gloss​

Mechanistic Translation​

Frame 4: Decontextualization Bias​

Narrative Overlay​

Critical Gloss​

Mechanistic Translation​

Frame 5: The Cooperative Communicator​

Narrative Overlay​

Critical Gloss​

Mechanistic Translation​

Frame 6: The Adopted Strategy​

Narrative Overlay​

Critical Gloss​

Mechanistic Translation​

Frame 7: Primary Operational Language​

Narrative Overlay​

Critical Gloss​

Mechanistic Translation​

Frame 8: Integrating Speaker Intentions​

Narrative Overlay​

Critical Gloss​

Mechanistic Translation​

Part 2: Transformation Glossary​

Part 3: The Rewriting Experiment​

Why This Passage?​

Original vs. Mechanistic Translation​

Translation Notes​

What Survived vs. What Was Lost​

What Was Exposed​

Readability Reflection​

Part 4: What the Experiment Revealed​

Pattern Summary​

Function of Anthropomorphism​

What Would Change​

Stakes Shift Analysis​

Strongest Surviving Claim​

The Best Version of This Argument​

Part 5: Critical Reading Questions​

Extended Processing Summary​

Overall Verdict - Does anything survive when the metaphor is removed?

Part 1: Frame-by-Frame Analysis

Frame 1: The Cognitive Toolkit

Narrative Overlay

Critical Gloss

Mechanistic Translation

Frame 2: Linguistic Agents

Narrative Overlay

Critical Gloss

Mechanistic Translation

Frame 3: Struggle with Meaning

Narrative Overlay

Critical Gloss

Mechanistic Translation

Frame 4: Decontextualization Bias

Narrative Overlay

Critical Gloss

Mechanistic Translation

Frame 5: The Cooperative Communicator

Narrative Overlay

Critical Gloss

Mechanistic Translation

Frame 6: The Adopted Strategy

Narrative Overlay

Critical Gloss

Mechanistic Translation

Frame 7: Primary Operational Language

Narrative Overlay

Critical Gloss

Mechanistic Translation

Frame 8: Integrating Speaker Intentions

Narrative Overlay

Critical Gloss

Mechanistic Translation

Part 2: Transformation Glossary

Part 3: The Rewriting Experiment

Why This Passage?

Original vs. Mechanistic Translation

Translation Notes

What Survived vs. What Was Lost

What Was Exposed

Readability Reflection

Part 4: What the Experiment Revealed

Pattern Summary

Function of Anthropomorphism

What Would Change

Stakes Shift Analysis

Strongest Surviving Claim

The Best Version of This Argument

Part 5: Critical Reading Questions

Extended Processing Summary