Do LLMs represent low-resource cultures through dominant cultural proxies?

Inquiring lines that use this note as a source 41

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Do language models understand tacit workplace norms and unspoken social rules?
How do low-dimensional representation structures entangle multiple cultures together?
Can output-layer corrections fix fundamental cultural representation deficits in LLMs?
Why do moderately represented cultures show more flattening than data-poor cultures?
What distinguishes genuine cultural understanding from exploited surface-level elimination strategies?
How do LLM biases reflect social classification schemas rather than random errors?
Why do language models successfully simulate political perspectives and social personas?
How does mechanistic interpretability reveal ideological structures in language model weights?
Why does language compression via statistical dependencies capture cultural and situated language use?
What makes internal embeddings useful as multimodal input for language model training?
Can statistical learning from language alone capture all aspects of cultural competence?
Can a world model have rich representations without adequate data coverage?
Can adaptive compute allocation at sub-token granularity improve cross-lingual robustness?
How deeply are ideological structures represented in large language models?
Do language models build world models or just task-specific heuristics?
How do you measure the depth of political representation inside a language model?
Can large language models predict social norms better than individual script variation?
What happens when you remove core political features from a deep model?
Does encoding information in LM representations guarantee it influences output?
When does encoded knowledge fail to influence language model generation?
Can LLMs predict social norms without deep integration into linguistic practices?
Why might encoded world knowledge fail to actually influence language model outputs?
How do language models predict collective social norms better than individual humans?
What does zero-shot psychological profiling reveal about language model representations?
Why do language models approximate collective human judgment better than individuals?
Do language models consistently produce anachronistic output about historical periods?
How do description-based identifiers bias language model output distribution?
Can LLMs recover true joint distributions from marginal census data?
Why do language models reproduce human EPA structure despite different architecture?
What substrate do supervised models lack that makes them weaker on low-resource languages?
Can AI models predict whether alignment reads as warmth versus mockery in different cultures?
How much cultural knowledge exists only in unwritten social rules?
Can statistical learning from text replace embodied cultural experience?
What social information is missing from language data?
Why do language models presume common ground instead of building it?
Can language models learn internal world models without explicit environment specifications?
How do corpus statistics shape the abstraction hierarchy in language model representations?
How can we probe LLM representations in channels that training did not target?
Why does diversity in LLM outputs mask sampling from community priors?
Do rare cultural concepts fail predictably as model scale increases?
How does Western-dominance bias propagate through multimodal training data?

Related concepts in this collection 3

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map

13 direct connections · 123 in 2-hop network ·dense cluster Open in graph ↗

Do LLMs represent low-resource cultures through … Can identical outputs hide broken internal represe… Do LLM semantic features organize along human eval… Can we measure how deeply models represent politic…

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Can identical outputs hide broken internal representations? Can neural networks produce correct outputs while having fundamentally fractured internal structure that prevents generalization and creativity? This challenges our assumptions about what performance benchmarks actually measure.
cultural flattening is a specific form of FER: two cultures that should be independently represented are entangled through shared high-resource proxies, with the fracture being the loss of culture-specific regularities
Do LLM semantic features organize along human evaluation dimensions? Does the structure of meaning in language models match the three-dimensional semantic space (Evaluation-Potency-Activity) that humans use? If so, what are the implications for steering and alignment?
cultural representations may be entangled in similar low-dimensional structures, where steering toward one culture predictably activates others in the same representation cluster
Can we measure how deeply models represent political ideology? This research explores whether LLMs vary not just in political stance but in the internal richness of their political representation. Understanding this distinction could reveal how deeply models have internalized ideological concepts versus merely parroting positions.
cultural depth (the richness of culture-specific features) determines whether the model can be steered toward authentic cultural representation or falls back on flattened proxies

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Search by related questions 4

Suggested questions this note speaks to — click to search the collection, or type your own.