Are Smarter LLMs Safer? Exploring Safety-Reasoning Trade-offs in Prompting and Fine-Tuning

Published in arXiv 2025 (First to reveal the safety–reasoning capability trade-off), 2025

This preprint explores safety-reasoning trade-offs in prompting and fine-tuning of large language models.

Ang Li, Yichuan Mo, Mingjie Li, Yifei Wang, and Yisen Wang. (2025). "Are Smarter LLMs Safer? Exploring Safety-Reasoning Trade-offs in Prompting and Fine-Tuning." arXiv preprint arXiv:2502.09673.