Zhouxing Shi

UCLA

About me

I am a Ph.D. Candidate at UCLA Computer Science Department, advised by Prof. Cho-Jui Hsieh. Before that, I received my B.Eng. degree from the CST department at Tsinghua University where I worked with Prof. Minlie Huang.

Research

My research interest is machine learning, and my primary research focus is on trustworthy machine learning and the robustness of machine learning models. My works include:

General and scalable formal verification for neural networks: It is about formally bounding and verifying the output of a neural network (NN) given uncertain inputs from a region. I developed formal verification methods for general neural network-based models, from Transformers to general computational graphs and higher-order computational graphs. I am a main developer of the auto_LiRPA software (originally proposed in our NeurIPS 2020 paper) for perturbation analysis and verified bound computation on general computational graphs (including NNs with arbitrary architectures). Based on auto_LiRPA and complete verification algorithm (including branch-and-bound for general models) we also developed a complete verification toolbox named alpha-beta-CROWN which is the winner at the International Verification of Neural Networks Competitions (VNN-COMP) for three consecutive years from 2021 to 2023. While the properties to be verified are often robustness under small perturbtions, we have also extended formal verification for NNs to properties beyond robustness, such as monotonicity/fairness, Lyapunov stability of controllers in dynamic systems, constraints of power systems, etc.
Adversarial robustness in NLP: I proposed methods for generating adversarial examples for NLP by leveraging modern language models. To generate word-substitution attacks with high-quality synonyms that are compatible with context, I proposed to leverage a masked BERT model, use a instructional-following model such as ChatGPT, or use a text-completion model such as LLaMA. I proposed to red team large language model detectors by LLM-generated word substitution attacks or instructional prompts. We also recently proposed to defend against LLM jailbreaking attacks by backtranslation.
Out-of-distribution robustness: I proposed a new understanding and evaluation on the effective robustness of multimodal pre-trained models especially CLIP models. I found that pre-training data in CLIP can interfere the previous evaluation of OOD effective robustness rather than improve effective robustness, and suggested that CLIP is not effectively more robust than traditional models.
Efficiently training robust neural networks: I developed methods for faster certified robust training by an improved interval bound propagation-based training.

Preprints (* equal contribution)

Lyapunov-stable Neural Control for State and Output Feedback: A Novel Formulation for Efficient Synthesis and Verification

Code

Lujie Yang*, Hongkai Dai*, Zhouxing Shi, Cho-Jui Hsieh, Russ Tedrake, Huan Zhang

Defending LLMs against Jailbreaking Attacks via Backtranslation

Code

Yihan Wang*, Zhouxing Shi*, Andrew Bai, Cho-Jui Hsieh

Improving the Generation Quality of Watermarked Large Language Models via Word Importance Scoring

Yuhang Li*, Yihan Wang*, Zhouxing Shi, Cho-Jui Hsieh

Formal Verification for Neural Networks with General Nonlinearities via Branch-and-Bound

Zhouxing Shi*, Qirui Jin*, Huan Zhang, Zico Kolter, Suman Jana, Cho-Jui Hsieh

WFVML 2023 (ICML workshop)

Publications (* equal contribution)

Red Teaming Language Model Detectors with Language Models

Code

Zhouxing Shi*, Yihan Wang*, Fan Yin*, Xiangning Chen, Kai-Wei Chang, Cho-Jui Hsieh (*Alphabetical)

TACL 2024

Effective Robustness against Natural Distribution Shifts for Models with Different Training Data

Demo

Poster

Zhouxing Shi, Nicholas Carlini, Ananth Balashankar, Ludwig Schmidt, Cho-Jui Hsieh, Alex Beutel, Yao Qin

NeurIPS 2023

Towards Robustness Certification Against Universal Perturbations

Code

Yi Zeng*, Zhouxing Shi*, Ming Jin, Feiyang Kang, Lingjuan Lyu, Cho-Jui Hsieh, Ruoxi Jia

ICLR 2023

Efficiently Computing Local Lipschitz Constants of Neural Networks via Bound Propagation

Code

Poster

Zhouxing Shi, Yihan Wang, Huan Zhang, Zico Kolter, Cho-Jui Hsieh

NeurIPS 2022

On the Adversarial Robustness of Vision Transformers

Code

Video

Rulin Shao, Zhouxing Shi, Jinfeng Yi, Pin-Yu Chen, Cho-Jui Hsieh

TMLR 2022

On the Sensitivity and Stability of Model Interpretations in NLP

Fan Yin, Zhouxing Shi, Cho-Jui Hsieh, Kai-Wei Chang

ACL 2022

On the Convergence of Certified Robust Training with Interval Bound Propagation

Poster

Yihan Wang*, Zhouxing Shi*, Quanquan Gu, Cho-Jui Hsieh

ICLR 2022

Robust Text CAPTCHAs Using Adversarial Examples

Video

Rulin Shao, Zhouxing Shi, Jinfeng Yi, Pin-Yu Chen, Cho-Jui Hsieh

BigData 2022

Fast Certified Robust Training with Short Warmup

Code

Poster

Zhouxing Shi*, Yihan Wang*, Huan Zhang, Jinfeng Yi, Cho-Jui Hsieh

NeurIPS 2021

Automatic Perturbation Analysis for Scalable Certified Robustness and Beyond

Code

Kaidi Xu*, Zhouxing Shi*, Huan Zhang*, Yihan Wang, Kai-Wei Chang, Minlie Huang, Bhavya Kailkhura, Xue Lin, Cho-Jui Hsieh

NeurIPS 2020

Robustness Verification for Transformers

Code

Zhouxing Shi, Huan Zhang, Kai-Wei Chang, Minlie Huang, Cho-Jui Hsieh

ICLR 2020

Robustness to Modification with Shared Words in Paraphrase Identification

Code

Zhouxing Shi, Minlie Huang

Findings of EMNLP 2020

A Deep Sequential Model for Discourse Parsing on Multi-Party Dialogues

Code

Poster

Zhouxing Shi, Minlie Huang

AAAI 2019

Internships

Research Scientist Intern at Meta, Sunnyvale
Student Researcher at Google Research, New York
Research Intern at JD AI Research, Shanghai
Research Intern at ByteDance AI Lab, Shanghai

Teaching

TA at UCLA:

CS 35L Software Construction (Fall 2023, Prof. Paul Eggert)
CS M146 Introduction to Machine Learning (Fall 2022, Prof. Kai-Wei Chang)
CS 35L Software Construction (Spring 2022, Prof. Paul Eggert)
CS 260C Deep Learning (Winter 2022, Prof. Cho-Jui Hsieh)

Service

Co-organizer at the 2nd Workshop on Formal Verification of Machine Learning (an ICML workshop)
Conference reviewer: NeurIPS (since 2020), ICLR (since 2021), ICML (since 2021), ACL, NAACL, EMNLP, COLING, CVPR, ICCV
Journal reviewer: TMLR, TNNLS