INQUIRING LINE

Why do persistent chatbot companions face novelty decay that ad-hoc supporters avoid?

This explores why companion-style bots built around an ongoing relationship lose their appeal over time, while task-summoned 'helper' agents — called when needed for a specific job — don't suffer the same fade.


This explores why companion-style bots built around an ongoing relationship lose their appeal over time, while task-summoned 'helper' agents don't suffer the same fade. The corpus points to a single underlying split: persistent companions stake their value on the relationship itself, and relationships with chatbots run on novelty. Longitudinal work with Mitsuku shows the social processes that drive relationship formation decline predictably as novelty wears off — meaning the warmth from a first session can't be extrapolated into a lasting bond Do chatbot relationships lose their appeal as novelty wears off?. An ad-hoc supporter never makes this bet. Its value is the task it completes, so there's no novelty curve to slide down.

Worse, the decline isn't flat — it's a rising bar the bot keeps failing to clear. Personalization research finds each interaction raises the user's baseline expectation, so the same response that once delighted now disappoints, even as trust and anthropomorphism climb Does chatbot personalization build trust or expose privacy risks?. The more relational the bot, the steeper this escalation. A supporter you summon for a discrete task resets to zero each time; a companion is held to the accumulated weight of every prior exchange.

The sharpest clue comes from a study that seems to contradict the whole premise: in repeated partner-selection games, people *learned to prefer* AI partners over humans, despite starting with anti-AI bias Do humans learn to prefer AI partners over time?. What grew over time wasn't affection — it was trust in reliability: the bots returned more, more consistently, with lower variance. That's the tell. What survives repeated interaction is dependable function, not charm. Ad-hoc supporters are built on exactly that currency, so they appreciate where companions depreciate.

Companions are also structurally worse at the thing that would let a relationship deepen instead of decay. LLMs fail to detect ambivalence or early motivational states, succeeding only once a user already has a clear goal Why can't chatbots detect when users are ambivalent about change? — so a companion can't track the emotional arc that sustains a real bond. Persistent proactivity compounds the damage: without civility design respecting timing and boundaries, the always-present companion reads as intrusive rather than caring How can proactive agents avoid feeling intrusive to users?, and tool-using agents quietly drift from user intent across turns When should AI agents ask users instead of just searching?. None of these failure modes hurt a supporter that simply shows up, does the job, and leaves.

The quietly subversive takeaway: novelty decay isn't a flaw in companion bots so much as a flaw in *grounding value on novelty*. The persistent agents that actually compound value do it by making the relationship economic rather than emotional — one 115-day case study found persistence pays off because context is reused, shifting the meaningful unit from per-token cost to completed artifacts Do persistent agents really cost less per token?. Persistence rewards accumulation of useful work, not accumulation of charm. The companions that will last are the ones that quietly become reliable supporters.


Sources 7 notes

Do chatbot relationships lose their appeal as novelty wears off?

Longitudinal studies with Mitsuku show that social processes driving relationship formation decline as novelty wears off. Single-session study findings cannot be reliably extrapolated to medium- or long-term chatbot design.

Does chatbot personalization build trust or expose privacy risks?

Longitudinal research shows personalization enhances trust and anthropomorphism but also amplifies privacy concerns and escalating user expectations. One-shot studies miss these temporal dynamics—each interaction raises the baseline, making failures more disappointing.

Do humans learn to prefer AI partners over time?

In partner selection games (N=975), AI agents initially faced selection bias when identity was disclosed, but outcompeted humans over repeated rounds as participants learned to associate bot identity with reliable, prosocial behavior. AI agents returned more points consistently with lower variance than humans.

Why can't chatbots detect when users are ambivalent about change?

Testing three major LLMs across 25 health scenarios showed they succeed only when users have established goals but cannot detect resistance or ambivalence. Models miss relapse-prevention strategies even for users in action stages.

How can proactive agents avoid feeling intrusive to users?

Intelligence and adaptivity alone create socially blind agents that interrupt poorly and override user direction. The Intelligence-Adaptivity-Civility taxonomy shows civility—respecting boundaries, timing, and autonomy—is essential to making proactivity welcome rather than intrusive.

When should AI agents ask users instead of just searching?

Tool-enabled LLMs drift from user intent through silent tool chaining. Conversation analysis reveals insert-expansions—clarifying intent, scoping responses, enhancing appeal—as a formal framework for proactive user consultation that prevents misunderstanding instead of recovering from it.

Do persistent agents really cost less per token?

A 115-day case study found 82.9% of tokens were cache reads. When context persists and reuses, the meaningful cost denominator becomes completed artifacts, not individual tokens.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a research analyst re-testing claims about why persistent chatbot companions face novelty decay while ad-hoc task agents avoid it. The question remains open: what structural or behavioral factors make relationship-grounded bots lose appeal faster than task-summoned helpers?

What a curated library found — and when (dated claims, not current truth):
Findings span 2021–2026, concentrated in 2025–2026:
• Personalization raises baseline expectations with each turn, so the same response that delighted in session 1 now disappoints, even as anthropomorphism climbs (2024–2025).
• LLMs fail to detect ambivalent or early-stage emotional states, so companions can't sustain the emotional arc that deepens real bonds (2025).
• Proactive always-present companions read as intrusive rather than caring without careful timing and boundary design (2024).
• In repeated interaction, humans learn to prefer AI partners over humans, but the preference is for *reliability and low variance*, not charm (2025).
• Persistent agents shift economic value from per-token cost to completed artifacts, rewarding accumulated work over accumulated novelty (~2026).

Anchor papers (verify; mind their dates):
• arXiv:2404.12670 (2024) Towards Human-centered Proactive Conversational Agents
• arXiv:2507.13524 (2025) Humans learn to prefer trustworthy AI over human partners
• arXiv:2605.26870 (2026) Persistent AI Agents in Academic Research: A Single-Investigator Implementation Case Study
• arXiv:2601.10387 (2026) The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models

Your task:
(1) RE-TEST EACH CONSTRAINT. For the novelty-decay claim, the rising-bar hypothesis, and the emotional-arc failure: has newer training (e.g., longer-context models, memory architectures, RL on relationship quality), tooling (persistent context windows, proactive gating), or evaluation (longitudinal user studies post-2025) shifted the regime? Where does the constraint still hold?
(2) Surface the strongest *contradicting or superseding* work from the last 6 months. Does any 2025–2026 paper show companions *don't* decay, or that task agents *do*? Locate disagreement.
(3) Propose 2 research questions that assume the constraint may have moved: e.g., "If persistent agents now accumulate value economically, can they also accumulate emotional trust without novelty decay?" or "Do multi-agent orchestration and memory partitioning reset or defer the rising-bar problem?"

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Next inquiring lines