The Foundations of AI Seminar Series is dedicated to topics of interest in artificial intelligence, machine learning, both empirically and theoretically, as well as related areas. Our goal is for these meetings to serve as a forum for discussions and quick dissemination of results. We invite anyone interested in the latest advancements in AI/ML to join us!

Next Seminar


Speaker: Dr. Chengchun Chi Date: 20-05-2025 2pm-3pm (BST) Location: Mathematical Science Building, MB0.08, University of Warwick, Coventry, UK

Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning

Dr. Chengchun Shi

Abstract

Reinforcement learning from human feedback (RLHF) has emerged as a key technique for aligning the output of large language models (LLMs) with human preferences. To learn the reward function, most existing RLHF algorithms use the Bradley-Terry model, which relies on assumptions about human preferences that may not reflect the complexity and variability of real-world judgments. In this paper, we propose a robust algorithm to enhance the performance of existing approaches under such reward model misspecifications. Theoretically, our algorithm reduces the variance of reward and policy estimators, leading to improved regret bounds. Empirical evaluations on LLM benchmark datasets demonstrate that the proposed algorithm consistently outperforms existing methods, with 77-81\% of responses being favored over baselines on the Anthropic Helpful and Harmless dataset.


About Dr. Chengchun Shi

Chengchun is an Associate Professor in the Department of Statistics at LSE. He works at the interface of RL, LLMs and statistics, with applications to ride-sharing and healthcare. His work brings to light the relevance and significance of statistical learning in RL, and demonstrates the usefulness of RL as a framework for policy evaluation and A/B testing in two-sided marketplaces. Chengchun has published approximately 70 papers, with half of them accepted in prestigious statistical journals (JRSSB, JASA, AoS) and top machine learning venues (NeurIPS, ICML, KDD, JMLR). His outstanding contributions have been recognized with esteemed awards such as the Peter Gavin Hall IMS Early Career Prize, IMS Tweedie Award and the Royal Statistical Society Research Prize. He is serving as the associate editors of prestigious journals JRSSB, JASA and AoAS. He also serves as a reviewer for a range of machine learning conferences, including NeurIPS, ICML, ICLR, AAAI, AISTATS, and KDD.

Upcoming Events


Speaker Image

Robust Reinforcement Learning from Human Feedback for Large Language Models Fine-Tuning

Changchun Chi - Associate professor, London School of Economics and Political Science, UK
Calendar Icon May 20, 2025 at 14:00PM
Location Icon Mathematical Science Building, MB0.08, University of Warwick, Coventry, UK

More Info
Speaker Image

TBD

Yingzen Li - Senior Lecturer, Imperial College, UK
Calendar Icon May 27, 2025 at 14:00PM
Location Icon Mathematical Sciences Building, MB0.08, University of Warwick, Coventry, UK

More Info
Speaker Image

Differentially private M-estimation via noisy optimization

Po-Ling Loh - Professor, University of Cambridge UK
Calendar Icon May 28, 2025 at 14:00PM
Location Icon Mathematical Science Building, MB0.08, University of Warwick, Coventry, UK

More Info
Speaker Image

TBD

Marco Mondelli - Assistant Professor, Institute of Science and Technology, Austria
Calendar Icon Jun 03, 2025 at 14:00PM
Location Icon Mathematical Science Building, MB0.07, University of Warwick, Coventry, UK

More Info
Speaker Image

TBD

Dr. Krikamol Muandet - Chief scientist and Tenure-track Faculty (fast track) at CISPA
Calendar Icon Jun 17, 2025 at 14:00PM
Location Icon Department of Computer Science, CS1.01, University of Warwick, Coventry, UK

More Info

Organising Team

Fanghui Liu

Fanghui Liu

Assistant Professor, CS Department, University of Warwick

Paris Giampouras

Paris Giampouras

Assistant Professor, CS Department, University of Warwick

Long Tran-Thanh

Long Tran-Thanh

Professor, CS Department, University of Warwick