LoRA Security: The "Achilles' Heel" of Parameter-Efficient Fine-Tuning?

1. Background: LoRA Is Everywhere

LoRA (Low-Rank Adaptation) is now the de facto standard for fine-tuning LLMs—add a few low-rank matrices, keep the original weights frozen, and the model adapts to new tasks. It is extremely efficient.

But nobody had systematically studied: how secure is a LoRA-fine-tuned model?

2. Our Analytical Framework: An NTK Perspective

We used NTK (Neural Tangent Kernel) to model the kernel-level differences between LoRA and full fine-tuning. The beauty of NTK is that it approximates training as kernel regression, enabling analytical analysis of attack effects.

3. Two Counterintuitive Findings

Against untargeted poisoning: LoRA is more vulnerable. Reason: LoRA's low-rank constraint limits the model's ability to "correct" the poisoning's influence. Full fine-tuning has more degrees of freedom to "overwrite" anomalous signals introduced by poisoning.
Against backdoor attacks: LoRA is actually more robust. Reason: LoRA's low-rank nature makes backdoor trigger patterns hard to "survive" in the low-rank subspace. Backdoors are inherently high-rank signals—LoRA's constraint ironically weakens them.

We also found that LoRA's rank and initialization variance significantly affect robustness—higher rank helps, but initialization effects are non-monotonic.

4. Paper Info

Title: Does Low Rank Adaptation Lead to Lower Robustness against Training-Time Attacks?
Authors: Zi Liang, Haibo Hu, Qingqing Ye, Yaxin Xiao, Ronghua Li
Status: ICML 2025
Code: https://github.com/liangzid/LoRA-sSecurity