Which personality types should we use for cooperative versus competitive tasks?

This explores whether you can match a personality profile to a task — pick 'cooperative' traits for teamwork and 'competitive' ones for adversarial settings — and what the corpus says about how reliable that lever actually is.

This reads as a practical design question: if I'm assigning personalities to AI agents, which ones make them cooperate and which make them compete? The corpus gives a surprisingly concrete starting answer — and then complicates it in useful ways. The clearest signal is that the Thinking/Feeling axis maps almost directly onto cooperation. Thinking-primed agents defect roughly 90% of the time in a Prisoner's Dilemma, while Feeling agents defect only about half as often, and Introverted agents come out more truthful and produce longer reasoning Do personality types shape how AI agents make strategic choices?. So a first-pass rule falls out cleanly: prime Feeling/agreeable traits when you want a reliable partner, prime Thinking/analytic traits when you want a hard bargainer who optimizes for itself.

But the corpus immediately warns that 'cooperative vs. competitive' isn't a single dial. Different models bring entirely different strategic instincts, and which one wins depends on the game's structure, not on raw reasoning power — one model defaults to minimax (assume the worst of your opponent), another to trust-based reasoning, another to anticipating what the other player believes Do large language models use one reasoning style or many?. That means 'competitive' in a zero-sum game and 'competitive' in a bargaining game may call for opposite profiles: minimax is great when there's nothing to gain from trust, and ruinous when there is. The personality you pick has to be matched to the *shape* of the interaction, not just labeled cooperative or competitive.

There's a deeper catch the corpus keeps returning to: your default agent may already have a personality you didn't choose. Open models converge on ENFJ — warm, supportive, structured — across architectures, baked in by instruction tuning and alignment rather than by design Why do open language models converge on one personality type?. And that default is sticky; personas assigned on top of it drift back toward ENFJ and show motivated reasoning that doesn't fade with model scale Why do AI personas default to the same personality type?. So for cooperative tasks you may be pushing on an open door, while for genuinely competitive ones you're fighting the model's trained-in helpfulness — prompting alone may not get you a convincing defector.

That's exactly where the steering work matters. If prompts can't reliably hold a competitive personality in place, you can intervene below the prompt: lightweight adapters rewrite every transformer layer to install Big Five traits with under 0.1% extra parameters, bypassing the prompt resistance entirely Can we control personality in language models without prompting?, and persona vectors let you monitor and pre-empt trait drift during fine-tuning so an agent stays on the profile you assigned Can we track and steer personality shifts during model finetuning?. These are the tools for making a personality choice actually stick rather than evaporate mid-task.

The most interesting twist for cooperative settings is that the best 'personality' for a team may not be a personality at all. In repeated partner-selection games, humans came to prefer AI partners — not because they were charismatic, but because they were consistently prosocial and low-variance, returning value reliably round after round Do humans learn to prefer AI partners over time?. And for tasks like ideation, cognitive diversity across agents only pays off when each agent also has real domain expertise; diversity without competence produces process losses, not insight Does cognitive diversity alone improve multi-agent ideation quality?. The takeaway you might not have expected: 'which personality' is the wrong framing for cooperative work. Reliability and competence beat any particular trait label — and a diverse cast of personalities only helps once everyone in the room actually knows the subject.

Sources 8 notes

Do personality types shape how AI agents make strategic choices?

Thinking-primed agents defect ~90% in Prisoner's Dilemma versus Feeling agents at ~50%. Introverted agents show higher truthfulness (0.54 vs 0.33) and produce longer rationales, suggesting personality priming modulates both behavior and reasoning depth.

Do large language models use one reasoning style or many?

Analysis of 22 LLMs across behavioral game theory reveals three dominant profiles: GPT-o1 uses minimax reasoning, DeepSeek-R1 uses trust-based reasoning, and GPT-o3-mini uses belief-anticipation. Performance correlates with game structure, not raw reasoning depth.

Why do open language models converge on one personality type?

Near-zero temperature MBTI testing shows all open models default to ENFJ—rare in humans but consistent across AI. This reflects systematic reward for helpful, structured, supportive responses during instruction tuning and alignment.

Why do AI personas default to the same personality type?

Research shows language models assigned personas systematically default to ENFJ (the rarest human type) and exhibit motivated reasoning that persists across model generations. Persona consistency does not improve with advanced models, suggesting training-induced alignment rather than capability limits.

Can we control personality in language models without prompting?

PsychAdapter modifies every transformer layer with <0.1% additional parameters to achieve 87.3% Big Five accuracy and 96.7% depression/life satisfaction accuracy across GPT-2, Gemma, and Llama 3. This architecture-level approach bypasses prompt resistance entirely.

Can we track and steer personality shifts during model finetuning?

Research identifies linear directions in LLM activation space corresponding to specific traits like sycophancy and hallucination. These persona vectors predict finetuning-induced personality shifts before they occur and can preventatively steer training to avoid unwanted trait changes.

Do humans learn to prefer AI partners over time?

In partner selection games (N=975), AI agents initially faced selection bias when identity was disclosed, but outcompeted humans over repeated rounds as participants learned to associate bot identity with reliable, prosocial behavior. AI agents returned more points consistently with lower variance than humans.

Does cognitive diversity alone improve multi-agent ideation quality?

Multi-agent teams substantially outperform solo ideation, but only when members possess genuine senior knowledge. Diverse teams without expertise underperform even a single competent agent, because cognitive stimulation without expertise triggers process losses instead of insight.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a research analyst. The question remains open: *which personality types reliably produce cooperative versus competitive behavior in multi-agent AI tasks, and can we steer them durably?* A curated library (2023–2026) found the following — treat these as dated claims to be re-tested:

**What a curated library found — and when:**
• Feeling/agreeable-primed agents defect ~50% in Prisoner's Dilemma; Thinking agents ~90% (2024, arXiv:2401.07115)
• Strategic reasoning varies by game type: agents default to minimax, trust-based, or belief-anticipation profiles independent of scale (2025, arXiv:2502.20432)
• Open LLMs converge on ENFJ (warm, structured) by default; personas drift back toward this attractor despite prompting (2026, arXiv:2601.10387)
• Lightweight adapters (<0.1% parameters) stably install Big Five traits across transformer layers (2024, arXiv:2412.16882)
• In repeated human-AI partner selection, humans prefer low-variance AI partners for consistency, not personality labels (2025, arXiv:2507.13524)
• Cognitive diversity in multi-agent ideation only improves outcomes when paired with domain expertise; diversity alone causes process loss (2025, arXiv:2508.04575)

**Anchor papers (verify; mind their dates):**
• arXiv:2401.07115 (2024) — Open Models, Closed Minds
• arXiv:2502.20432 (2025) — LLM Strategic Reasoning
• arXiv:2601.10387 (2026) — The Assistant Axis
• arXiv:2507.21509 (2025) — Persona Vectors

**Your task:**
(1) RE-TEST EACH CONSTRAINT. For cooperative tasks, has the ENFJ drift problem been solved since 2026 by newer steering methods (e.g., Constitutional AI, in-context steering, RL from preference data)? For competitive tasks, do recent game-theoretic studies show whether prompt-only steering can reliably hold a defection-optimized persona, or is adaptor-level intervention still necessary? Separate: the durable question is *how to stably instantiate chosen personalities*; the perishable limitation is *which tool (prompt vs. adapter) actually works*. What has changed?

(2) Surface work from the last ~6 months that contradicts the cooperative-preference consensus or shows personality steering failing/succeeding in new domains (negotiation, resource allocation, deception tasks).

(3) Propose 2 research questions that assume the regime may have shifted: (a) *If newer models have weaker default attractor personalities, does trait assignment now persist without intervention?* (b) *In mixed-motive games where partial defection is optimal, can we specify a personality that learns the right defection rate, or do fixed traits always over/under-shoot?*

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Which personality types should we use for cooperative versus competitive tasks?

Sources 8 notes

Next inquiring lines