SelfCAD: Protecting Your Efficient Reasoning Capabilities via Self-Cautious Insertion
Preprint 2026
Preprint 2026
ICLR 2026 Trustworthy Workshop (First benchmark for evaluating trustworthiness of language diffusion models)
ICLR 2026 DeLTa Workshop
TPAMI 2026 (Adopted at scale by Anthropic)
TPAMI 2026 (Journal extension of SGM, original paper cited 400+ times on Google Scholar)
arXiv 2025
arXiv 2025 (First to reveal the safety–reasoning capability trade-off)
NeurIPS 2024
ICML 2024 (First backdoor input detection method for diffusion models)
ICML 2024
NeurIPS 2022 (Spotlight, Top 5%) (First work to improve adversarial robustness of ViTs)
NeurIPS 2022 (Spotlight, Top 5%)
SIGKDD 2022
ICASSP 2022