Publications (* Equal contribution)

See Google Scholar for a full list.

From Individual to Common: An Early Exploration of Consensus in Non-verifiable Data for Balanced Preference Optimization
Certified Training with Branch-and-Bound for Lyapunov-stable Neural Control
SoundnessBench: A Soundness Benchmark for Neural Network Verifiers
Neural Network Verification with Branch-and-Bound for General Nonlinearities
Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation
Defending LLMs against Jailbreaking Attacks via Backtranslation
Red Teaming Language Model Detectors with Language Models
Effective Robustness against Natural Distribution Shifts for Models with Different Training Data
Towards Robustness Certification Against Universal Perturbations
Efficiently Computing Local Lipschitz Constants of Neural Networks via Bound Propagation
On the Adversarial Robustness of Vision Transformers
On the Sensitivity and Stability of Model Interpretations in NLP
On the Convergence of Certified Robust Training with Interval Bound Propagation
Robust Text CAPTCHAs Using Adversarial Examples
Fast Certified Robust Training with Short Warmup
Automatic Perturbation Analysis for Scalable Certified Robustness and Beyond
Robustness Verification for Transformers