Simin Chen

Simin Chen

Postdoctoral Researcher at Columbia University

Computer Science Department at Columbia University

Biography

I am a postdoctoral researcher in the Computer Science Department at Columbia University, working with Prof.Baishakhi Ray on research related to large language models for code (LLM4Code). I earned my Ph.D. from the University of Texas at Dallas (UTD), and I was fortunate to be advised by Prof.Wei Yang and Prof.Cong Liu. Before joining UTD, I received my master degree from Tongji University in May 2018. My research interest lies in machine learning, computer security, and program analysis.

Download my resumé.

Interests
  • Machine Learning
  • Computer Security
  • Software Engineering
Education
  • Ph.D., 2019 - 2024

    The University of Texas at Dallas

  • Master, 2015 - 2018

    Tongji University

  • Bachelor, 2011 - 2015

    Tongji University

Publications

(2024). DeciX: Explain Deep Learning Based Code Generation Applications. In ESEC/FSE 2024.

(2024). PPM: Automated Generation of Diverse Programming Problems for Benchmarking Code Generation Models. In ESEC/FSE 2024.

PDF Code

(2023). Dynamic Transformer Provide a False Sense of Efficiency. In ACL 2023.

(2023). The Dark Side of Dynamic Routing Neural Networks: Towards Efficiency Backdoor Injection. In CVPR 2023.

(2023). Sibling-Attack: Rethinking Transferable Adversarial Attacks against Face Recognition. In CVPR 2023.

(2023). DyCL: Dynamic Neural Network Compilation Via Program Rewriting and Graph Optimization. In ISSTA 2023.

(2022). NMTSloth: Understanding and Testing Efficiency Degradation of Neural Machine Translation Systems. In ESEC/FSE 2022.

PDF Code

(2022). Learning to Reverse DNNs from AI Programs Automatically. In IJCAI 2022.

PDF

(2022). NICGSlowDown: Evaluating the Efficiency Robustness of Neural Caption Generation Models. In CVPR 2022.

PDF Code

(2020). DENAS: automated rule generation by knowledge extraction from neural networks. In ESEC/FSE 2020.

PDF Code DOI

Experience

 
 
 
 
 
Research Assistant
Amazon Web Service
May 2023 – Aug 2023 Arlington Area, VA
Applying large language model for Cedar authorization policy language.
 
 
 
 
 
Research Assistant
Microsoft Research
May 2021 – Jul 2020 Seattle
Evaluate the model leakage risk of on-device DNNs.
 
 
 
 
 
Research Assistant
NEC Laboratories America
Jan 2020 – May 2020 New Jersey
Apply ML techniques for program analysis.

Contact