Can isolating task-specific parameters prevent multi-task fine-tuning interference?
Explores whether identifying and protecting task-specific parameter regions can prevent the performance degradation that occurs when fine-tuning models on multiple tasks simultaneously. This matters because it could enable safe multi-task adaptation without sacrificing individual task performance.
Core Parameter Isolation Fine-Tuning (CPI-FT) starts from the hypothesis that the "seesaw effect" in multi-task fine-tuning — where improving one task degrades others — arises from parameter heterogeneity: distinct capabilities rely on specific, potentially overlapping parameter subsets, and uniform updates cause destructive interference.
The framework has five steps:
- Independent fine-tuning per task to identify each task's "core parameter region" (parameters with largest update magnitudes)
- Task clustering by core region overlap — tasks sharing similar core regions are grouped for joint training
- Parameter transplantation — core parameters from individually fine-tuned models are directly transplanted into a unified backbone
- SLERP fusion — non-core parameters are merged via Spherical Linear Interpolation, enabling geometry-aware blending that avoids abrupt transitions
- Pipeline SFT with core regions frozen to prevent catastrophic forgetting
The key finding: full multi-task SFT (uniform parameter updates across all tasks) consistently achieves the lowest performance across all tasks and configurations. Even heuristic multi-stage approaches (training tasks in sequence) only partially mitigate interference. CPI-FT consistently outperforms both — revealing that temporal task scheduling alone is insufficient without explicit structural parameter isolation.
This extends the sparse subnetwork finding from RL. Does reinforcement learning update only a small fraction of parameters? showed that RL naturally converges on sparse parameter subsets. CPI-FT shows that SFT exhibits the same structure — task-relevant changes are concentrated in specific regions — and that explicitly identifying and protecting these regions enables better multi-task adaptation.
The connection to catastrophic forgetting in safety alignment is direct: "Can model developers allow users to fine-tune their aligned models on custom datasets while retaining safety?" CPI-FT suggests yes — by freezing the core parameter regions associated with safety alignment during subsequent task-specific fine-tuning.
Inquiring lines that use this note as a source 35
This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.
- Why does fine-tuning for continuous space cause catastrophic forgetting?
- Does narrow reallocation to remaining tasks constitute genuine adaptation?
- Do task-relevant parameter changes naturally concentrate in sparse regions?
- Why does full multi-task fine-tuning perform worse than sequential training?
- How can safety-aligned parameters be protected during user-specific fine-tuning?
- What task structures benefit most from geometric parameter merging?
- Can demo placement be tuned as a task-specific hyperparameter?
- What capability risks emerge when models are optimized for single domains?
- What hidden costs emerge when you fine-tune models for a single domain?
- When should model isolation be preferred over weight-averaging approaches?
- How do ensemble methods apply within a single model?
- Why does mixed instruction data sometimes hurt specific model capabilities?
- What interference occurs when planning and synthesis happen in the same component?
- How do trait adapters interact with different base model architectures?
- Why does fine-tuning improve some capabilities while degrading others?
- Why does fine-tuning fail to remove temporal contamination from pretraining?
- When should full-parameter post-training be used instead of LoRA adaptation?
- What performance trade-offs emerge when composing multiple independently trained model capabilities?
- Why do production teams choose expensive frontier models over fine-tuning?
- Why do fine-tuned models fail outside their specialized domains?
- Can finetuning sparse subnetworks alone match full parameter finetuning results?
- How does task-oriented fine-tuning compare to preference tuning methods?
- What neural or architectural mechanism allows selective override of frequency effects?
- Does parameter isolation per task enable online updates without retraining?
- How do gradients flowing through both branches simultaneously reshape each component's role?
- What happens to base model capabilities when you apply finetuning?
- Can skill libraries prevent redundant narrow artifacts from proliferating?
- What mechanism transfers explicit memories into parametric model weights?
- Why does specializing to one task make future task learning harder?
- What makes two timescales better than one for minimizing weight movement?
- How does decomposing tasks prevent interference between planning and execution?
- Does joint optimization of prompts and parameters outperform separate tuning?
- Why does parameter-efficient tuning scaling fail to improve finetuning performance?
- Which finetuning method works best across different task and data regimes?
- What is the accuracy cost of enforcing temporal causality inside model parameters?
Related concepts in this collection 4
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Does reinforcement learning update only a small fraction of parameters?
Investigating whether RL algorithms consistently modify only 5–30% of model parameters across different LLMs and RL methods, and what structural properties those sparse updates possess.
same sparse-update structure in RL; CPI-FT exploits it for multi-task SFT
-
Why do specialized models fail outside their domain?
Deep domain optimization creates sharp performance cliffs at domain boundaries. Specialized models generate plausible-sounding but ungrounded responses when queries fall outside their training scope, and often fail to signal their own ignorance.
CPI-FT addresses this: task-specific optimization without domain cliff
-
Why does SFT-then-RL training follow a predictable three-phase pattern?
When expert data diverges from a model's learned patterns, SFT-then-RL training exhibits disruption, readaptation, and overfitting phases. Understanding this progression could improve how we combine imitation and reinforcement learning.
CPI-FT's pipeline training is a structured version of multi-stage training
-
Does training order reshape how models handle different task types?
Explores whether the sequence of multi-task RL training systematically affects model capabilities across structured and creative domains, and whether this ordering effect can be predicted and optimized.
complementary multi-task approaches: CPI-FT solves interference through spatial parameter isolation (identifying task-specific regions), while Omni-Thinker solves it through temporal task scheduling (BWT-guided ordering); CPI-FT's finding that temporal scheduling alone is insufficient suggests both spatial and temporal strategies may be needed together
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Not All Parameters Are Created Equal: Smart Isolation Boosts Fine-Tuning Performance
- Improving large language models with concept-aware fine-tuning
- Chamain: Harmonizing Character Persona Integrity with Domain-Adaptive Knowledge in Dialogue Generation
- Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs
- A Survey on Post-training of Large Language Models
- LESS: Selecting Influential Data for Targeted Instruction Tuning
- On the Impact of Fine-Tuning on Chain-of-Thought Reasoning
- Revisiting RAG Ensemble: A Theoretical and Mechanistic Analysis of Multi-RAG System Collaboration
Original note title
core parameter isolation prevents multi-task fine-tuning interference by identifying task-specific regions and merging non-core parameters geometrically