Why do language models ignore temporal order in ranking?

Inquiring lines that use this note as a source 17

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Why do bag-of-mentions models discard conversation order in the first place?
How does sequential modeling within a session differ from modeling historical purchase sequences?
What other conversation structures besides mention order carry predictive information for recommendation?
How do position bias and popularity bias interact with sequence order blindness?
Do recency-focused prompts and in-context examples work equally well for order recovery?
How does Netflix decide which rows appear and in what order on the homepage?
What tokens do RL-trained summarizers learn to keep for ranking?
What anchoring effects shape how users rate items in sequence?
Can temporal ranking improve retrieval without modifying the underlying video model?
Should time always be a first-class ranking signal in temporally-extended sources?
How does sequence organization differ between spoken conversation and text chat?
What implicit knowledge about catalogs do LLMs learn from ranking signals alone?
Why does the order of training examples matter for what models learn?
Why does curriculum order matter when information theory says data order is irrelevant?
Why does token ordering in LLMs create sequences rather than true temporal flow?
What architectural changes would help LLMs distinguish causal relationships from temporal sequences?
Do LLMs show stronger reasoning about causality than about temporal ordering?

Related concepts in this collection 4

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

14 direct connections · 68 in 2-hop network ·medium cluster Open in graph ↗

Why do language models ignore temporal order in … Does conversation order matter for recommending it… Where do recommendation biases come from in langua… Why do global concept drift methods fail for recom… Why do recommendation systems miss recurring user …

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Does conversation order matter for recommending items in dialogue? Conversational recommendation systems typically ignore the sequence in which items are mentioned, treating dialogue as a bag of entities. But does the order itself carry predictive signal about what to recommend next?
complements: TSCR makes order architecturally first-class; LLM zero-shot must be coaxed into using order via prompts — same signal, different recovery mechanism
Where do recommendation biases come from in language models? Do LLM-based recommenders inherit systematic biases from pretraining that differ fundamentally from traditional collaborative filtering systems? Understanding these sources matters for building fairer, more accurate recommendations.
extends: order-blindness is a fourth pretraining-inherited recommendation bias adjacent to the named three
Why do global concept drift methods fail for recommender systems? Recommender systems treat user preferences as individuals with distinct, asynchronous preference shifts. Can standard concept-drift approaches designed for population-level changes capture this per-user heterogeneity?
complements: temporal modeling at training time and recency-prompting at inference time are parallel responses to the same user-drift signal
Why do recommendation systems miss recurring user preference patterns? Most streaming recommendation systems treat preference changes as one-time drift events and discard old patterns. But user behavior often cycles—coffee shops on weekday mornings, gyms on weekends. How should systems account for these recurring periodicities instead of detecting and resetting against them?
complements: explicit periodicity modeling vs prompt-induced recency are alternatives at different architectural layers

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Large Language Models are Zero-Shot Rankers for Recommender Systems0.87 match · arxiv ↗
Premise Order Matters in Reasoning with Large Language Models0.81 match · arxiv ↗
Large Language Models Sensitivity to The Order of Options in Multiple-Choice Questions0.80 match · arxiv ↗
A Survey on Large Language Models for Recommendation0.80 match · arxiv ↗
Foundations of Large Language Models0.79 match · arxiv ↗
Preference Discerning with LLM-Enhanced Generative Retrieval0.79 match · arxiv ↗
Toward Conversational Agents with Context and Time Sensitive Long-term Memory0.79 match · arxiv ↗
MARS: A Multi-Agent Framework Incorporating Socratic Guidance for Automated Prompt Optimization0.78 match · arxiv ↗

Search by related questions 4

Suggested questions this note speaks to — click to search the collection, or type your own.