SYNTHESIS NOTE
Model Architecture and Internals Agentic Systems and Tool Use Reasoning, Retrieval, and Evaluation

Can we identify an LLM interlocutor with a single hardware instance?

Does the physical hardware running an LLM constitute the individual we're talking to? This explores whether the one-to-one mapping between conversation and device holds in modern distributed systems.

Synthesis note · 2026-04-15
What kind of thing is an LLM really?

Chalmers considers and rejects the view that the LLM interlocutor is the hardware instance — the particular GPU or server running the model at a given moment. Two empirical facts about contemporary inference infrastructure make this untenable.

First, distributed serving: a single conversation may be processed across multiple hardware instances sequentially or in parallel. Load-balancing, model-parallelism, and failover mean that the conversation's compute migrates across physical substrate during a single session. If the interlocutor were the hardware, it would change identity mid-conversation — a consequence no one wants.

Second, multi-tenancy: a single hardware instance typically hosts many conversations simultaneously. The same GPU processes tokens for many users within the same batch. If the interlocutor were the hardware, multiple users would share a single interlocutor — another consequence no one wants.

Together, these facts eliminate hardware as the individuation level. What remains as a candidate must be something whose identity is invariant under changes in physical substrate and under concurrent use of that substrate — which is what leads Chalmers to the virtual instance and thread levels. The negative argument is clean and hard to contest; anyone who wants to ground the interlocutor in physical substrate has to explain how identity is maintained through load-balancing and how distinctness is maintained through batching.

Inquiring lines that use this note as a source 10

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 1

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map
12 direct connections · 103 in 2-hop network ·medium cluster Open in graph ↗

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

distributed serving and multi-tenancy defeat hardware-instance accounts of the LLM interlocutor — one conversation spans many instances and one instance hosts many conversations