Posts by Collection

portfolio

publications

Multi-Task Learning Improves Synthetic Speech Detection

Published in ICASSP 2022, 2022

This paper studies synthetic speech detection through the lens of multi-task learning.

Recommended citation: Yichuan Mo, and Shilin Wang. (2022). "Multi-Task Learning Improves Synthetic Speech Detection." ICASSP 2022.

When Adversarial Training Meets Vision Transformers: Recipes from Training to Architecture

Published in NeurIPS 2022 (Spotlight, Top 5%) (First work to improve adversarial robustness of ViTs), 2022

Spotlight paper at NeurIPS 2022. Professor John Hopcroft described this as the first work to improve adversarial robustness for vision transformers.

Recommended citation: Yichuan Mo, Dongxian Wu, Yifei Wang, Yiwen Guo, and Yisen Wang. (2022). "When Adversarial Training Meets Vision Transformers: Recipes from Training to Architecture." NeurIPS 2022.

TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors

Published in ICML 2024 (First backdoor input detection method for diffusion models), 2024

This paper introduces a unified framework for detecting and mitigating backdoors in diffusion models.

Recommended citation: Yichuan Mo, Hui Huang, Mingjie Li, Ang Li, and Yisen Wang. (2024). "TERD: A Unified Framework for Safeguarding Diffusion Models Against Backdoors." ICML 2024.

Fight Back Against Jailbreaking via Prompt Adversarial Tuning

Published in NeurIPS 2024, 2024

This paper proposes prompt adversarial tuning as a defense against jailbreak attacks.

Recommended citation: Yichuan Mo, Yuji Wang, Zeming Wei, and Yisen Wang. (2024). "Fight Back Against Jailbreaking via Prompt Adversarial Tuning." NeurIPS 2024.

Are Smarter LLMs Safer? Exploring Safety-Reasoning Trade-offs in Prompting and Fine-Tuning

Published in arXiv 2025 (First to reveal the safety–reasoning capability trade-off), 2025

This preprint explores safety-reasoning trade-offs in prompting and fine-tuning of large language models.

Recommended citation: Ang Li, Yichuan Mo, Mingjie Li, Yifei Wang, and Yisen Wang. (2025). "Are Smarter LLMs Safer? Exploring Safety-Reasoning Trade-offs in Prompting and Fine-Tuning." arXiv preprint arXiv:2502.09673.

On the Adversarial Transferability of Generalized “Skip Connections”

Published in TPAMI 2026 (Journal extension of SGM, original paper cited 400+ times on Google Scholar), 2026

This journal article studies adversarial transferability through the lens of generalized skip connections.

Recommended citation: Yisen Wang, Yichuan Mo, Dongxian Wu, Mingjie Li, Xingjun Ma, and Zhouchen Lin. (2026). "On the Adversarial Transferability of Generalized "Skip Connections"." TPAMI 2026.

Decoding Large Language Diffusion Models with Foreseeing Movement

Published in ICLR 2026 DeLTa Workshop, 2026

This workshop paper studies how to decode large language diffusion models with foreseeing movement.

Recommended citation: Yichuan Mo, Quan Chen, Mingjie Li, Zeming Wei, and Yisen Wang. (2026). "Decoding Large Language Diffusion Models with Foreseeing Movement." ICLR 2026 DeLTa Workshop.

TrustLDM: Benchmarking Trustworthiness in Language Diffusion Model

Published in ICLR 2026 Trustworthy Workshop (First benchmark for evaluating trustworthiness of language diffusion models), 2026

This workshop paper benchmarks trustworthiness in language diffusion model.

Recommended citation: Yichuan Mo, Yukun Jiang, Yanbo Shi, Mingjie Li, Michael Backes, Yang Zhang, and Yisen Wang. (2026). "TrustLDM: Benchmarking Trustworthiness in Language Diffusion Model." ICLR 2026 Trustworthy Workshop.

talks

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.