Publication

*: indicating equal contribution or alphabetic ordering.

Google Scholar

Sort by:
Publication Image

Enabling Scalable Oversight via Self-Evolving Critic

Zhengyang Tang*, Ziniu Li*, Zhenyang Xiao*, Tian Ding, Ruoyu Sun, Benyou Wang, Dayiheng Liu, Fei Huang, Tianyu Liu, Bowen Yu, Junyang Lin

arXiv:2501.05727

Publication Image

Pruning for Robust Concept Erasing in Diffusion Models

Tianyun Yang, Ziniu Li, Juan Cao, Chang Xu

NeurIPS Workshop on Safe Generative AI, 2024

Publication Image

Mitigating Hallucination in Large Vision-Language Models via Modular Attribution and Intervention

Tianyun Yang, Ziniu Li, Juan Cao, Chang Xu

NeurIPS Workshop on Adaptive Foundation Models, 2024

Publication Image

Unlocking Black-Box Prompt Tuning Efficiency via Zeroth-Order Optimization

Heshen Zhan, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun

The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP) (Findings), 2024

Publication Image

Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity

Ziniu Li, Congliang Chen, Tian Xu, Zeyu Qin, Jiancong Xiao, Ruoyu Sun, Zhi-Quan Luo

arXiv: 2408.16673
(Oral presentation at NeurIPS 2024 Workshop on Fine-Tuning in Modern Machine Learning)

Publication Image

Sensing Jamming Strategy from Limited Observations: An Imitation Learning Perspective

Youlin Fan, Bo Jiu, Wenqiang Pu, Ziniu Li, Kang Li, Hongwei Liu

IEEE Transactions on Signal Processing (TSP)

Publication Image

Adam-mini: Use Fewer Learning Rates To Gain More

Yushun Zhang, Congliang Chen, Ziniu Li, Tian Ding, Chenwei Wu, Yinyu Ye, Zhi-Quan Luo, Ruoyu Sun

arXiv:2406.16793

Publication Image

BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation

Chengxing Jia, Pengyuan Wang, Ziniu Li, Yi-Chen Li, Zhilong Zhang, Nan Tang, Yang Yu

arXiv:2405.17039

Publication Image

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

Ziniu Li, Tian Xu, Yushun Zhang, Zhihang Lin, Yang Yu, Ruoyu Sun, Zhi-Quan Luo

The 41st International Conference on Machine Learning (ICML), 2024

Publication Image

On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization

Jiancong Xiao, Ziniu Li, Xingyu Xie, Emily Getzen, Cong Fang, Qi Long, Weijie J. Su

arXiv:2405.16455

Publication Image

Why Transformers Need Adam: A Hessian Perspective

Yushun Zhang, Congliang Chen, Tian Ding, Ziniu Li, Ruoyu Sun, Zhi-Quan Luo

Conference on Neural Information Processing System (NeurIPS) 38, 2024

Publication Image

When is RL better than DPO in RLHF? A Representation and Optimization Perspective

Ziniu Li*, Tian Xu*, Yang Yu

The 12th International Conference on Learning Representations (ICLR) (Tiny Paper Track), 2024
(Oral presentation, with an early version at arXiv:2312.10584)

Publication Image

Imitation Learning from Imperfection: Theoretical Justifications and Algorithms

Ziniu Li*, Tian Xu*, Zeyu Qin, Yang Yu, Zhi-Quan Luo

Conference on Neural Information Processing System (NeurIPS) 37, 2023
(Spotlight presentation)

Publication Image

Provably Efficient Adversarial Imitation Learning with Unknown Transitions

Tian Xu*, Ziniu Li*, Yang Yu, Zhi-Quan Luo

The 39th Conference on Uncertainty in Artificial Intelligence (UAI), 2023
(Oral presentation, with an early version at arXiv:2106.10424v2)

Publication Image

Deploying Offline Reinforcement Learning with Human Feedback

Ziniu Li, Ke Xu, Liu Liu, Lanqing Li, Deheng Ye, Peilin Zhao

arXiv:2303.07046

Publication Image

Understanding Adversarial Imitation Learning in Small Sample Regime: A Stage-coupled Analysis

Tian Xu*, Ziniu Li*, Yang Yu, Zhi-Quan Luo

arXiv:2208.01899
(The early version of this work is at arXiv:2106.10424v3)

Publication Image

Rethinking ValueDice: Does It Really Improve Performance?

Ziniu Li*, Tian Xu*, Yang Yu, Zhi-Quan Luo

The 10th International Conference on Learning Representations (ICLR) (Blog Track), 2022

Publication Image
Publication Image

HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning

Ziniu Li, Yingru Li, Yushun Zhang, Tong Zhang, Zhi-Quan Luo

The 10th International Conference on Learning Representations (ICLR), 2022
(Oral presentation at Workshop on Ecological Theory of Reinforcement Learning at NeurIPS, 2021)

Publication Image

A Concise Introduction to Imitation Learning (In Chinese)

Tian Xu, Ziniu Li, Yang Yu

Online Available

Publication Image

Error Bounds of Imitating Policies and Environments for Reinforcement Learning

Tian Xu, Ziniu Li, Yang Yu

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021

Publication Image

Error Bounds of Imitating Policies and Environments

Tian Xu, Ziniu Li, Yang Yu

Conference on Neural Information Processing Systems 34 (NeurIPS), 2020

Publication Image

Efficient Exploration by Novelty-pursuit

Ziniu Li*, Xiong-Hui Chen*

The 2nd International Conference on Distributed Artificial Intelligence (DAI), 2020

Publication Image

Self-Guided Evolution Strategies with Historical Estimated Gradients

Fei-yu Liu, Ziniu Li, Chao Qian

The 29th International Conference on Joint Artificial Intelligence (IJCAI), 2020