LoRA Security: The "Achilles' Heel" of Parameter-Efficient Fine-Tuning?
1. Background: LoRA Is Everywhere
LoRA (Low-Rank Adaptation) is now the de facto standard for fine-tuning LLMs—add a few low-rank matrices, keep the original weights frozen, and the model adapts to new tasks. It is extremely efficient.
But nobody had systematically studied: how secure is a LoRA-fine-tuned model?
2. Our Analytical Framework: An NTK Perspective
We used NTK (Neural Tangent Kernel) to model the kernel-level differences between LoRA and full fine-tuning. The beauty of NTK is that it approximates training as kernel regression, enabling analytical analysis of attack effects.
3. Two Counterintuitive Findings
- Against untargeted poisoning: LoRA is more vulnerable. Reason: LoRA's low-rank constraint limits the model's ability to "correct" the poisoning's influence. Full fine-tuning has more degrees of freedom to "overwrite" anomalous signals introduced by poisoning.
- Against backdoor attacks: LoRA is actually more robust. Reason: LoRA's low-rank nature makes backdoor trigger patterns hard to "survive" in the low-rank subspace. Backdoors are inherently high-rank signals—LoRA's constraint ironically weakens them.
We also found that LoRA's rank and initialization variance significantly affect robustness—higher rank helps, but initialization effects are non-monotonic.
4. Paper Info
- Title: Does Low Rank Adaptation Lead to Lower Robustness against Training-Time Attacks?
- Authors: Zi Liang, Haibo Hu, Qingqing Ye, Yaxin Xiao, Ronghua Li
- Status: ICML 2025
- Code: https://github.com/liangzid/LoRA-sSecurity