Ruizhe Shi / Personal Site

See my Google Scholar for a full list.

Understanding the Performance Gap in Preference Learning: A Dichotomy of RLHF and DPO

Ruizhe Shi*, Minhak Song*, Runlong Zhou, Zihan Zhang, Maryam Fazel, Simon S. Du

Working paper, a follow-up of our ICLR paper.
Decoding-Time Language Model Alignment with Multiple Objectives

Ruizhe Shi, Yifang Chen, Yushi Hu, Alisa Liu, Hannaneh Hajishirzi, Noah A. Smith, Simon S. Du

Conference on Neural Information Processing Systems (NeurIPS) 2024
Rethinking Transformers in Solving POMDPs

Chenhao Lu, Ruizhe Shi*, Yuyao Liu*, Kaizhe Hu, Simon S. Du, Huazhe Xu

International Conference on Machine Learning (ICML) 2024

Selected Publications