The Foundations of AI Seminar Series is dedicated to topics of interest in artificial intelligence, machine learning, both empirically and theoretically, as well as related areas. Our goal is for these meetings to serve as a forum for discussions and quick dissemination of results. We invite anyone interested in the latest advancements in AI/ML to join us!

Next Seminar


Robust and Efficient AI Alignment

Speaker: Ilija Bogunovic (Assistant Professor, UCL, UK) Date: 22-10-2024, 2pm-3pm (BST) Location: Department of Computer Science, CS1.04, University of Warwick, Coventry, UK

Ilija Bogunovic

Download iCalendar File

Abstract

Aligning large language models (LLMs) with human values, ethical principles, and user intentions is critical to building AI systems like ChatGPT that drive transformative, human-centric progress. It is also a key step toward ensuring the sustainable and safe development of artificial general intelligence (AGI) that genuinely benefits humanity. Reinforcement learning from human feedback (RLHF) has become the leading method for fine-tuning LLMs, optimizing their responses by using human preferences to guide their behavior. However, RLHF faces pressing challenges—namely, inefficient data use and an overly simplified approach to the complex, pluralistic nature of human societies. RLHF-tuned models frequently exhibit bias, favoring majority perspectives while overlooking minority voices, highlighting the need for principled algorithms that address these shortcomings. In this talk, we present innovative, efficient, and pluralistic alignment algorithms that outperform standard RLHF methods. Our approach tackles data acquisition inefficiencies, improves bias resilience, and aligns LLMs with a wider range of human perspectives. By integrating RLHF with sequential decision-making frameworks, we significantly boost data efficiency and develop algorithms capable of navigating the complexities of aligning LLMs with diverse societal values. We showcase our algorithms’ effectiveness in fine-tuning popular LLM models.


About Ilija Bogunovic

Ilija Bogunovic is a lecturer at University College London (UCL) since 2022, where he leads a research group focused on the intersection of sequential decision-making and generative models. Prior to joining UCL, Ilija completed his PhD at EPFL and his postdoc at ETH Zurich. His recent achievements include receiving the prestigious EPSRC New Investigator Award and, for the first time at UCL, the Google Research Scholar Award.

Upcoming Events


Speaker Image

Robust and Efficient AI Alignment

Ilija Bogunovic - Assistant Professor, UCL, UK
Calendar Icon Oct 22, 2024 at 2:00PM
Location Icon Department of Computer Science, CS1.04, University of Warwick, Coventry, UK

More Info
Speaker Image

TBD

Ana-Andreea Stoica - Research Group Leader in the Social Foundations of Computation Department, Max Planck Institute, Germany
Calendar Icon Nov 12, 2024 at 12:00PM
Location Icon Zeeman building, MS.03, University of Warwick, Coventry, UK

More Info
Speaker Image

TBD

Patrick Rebeschini - Professor of Statistics and Machine Learning, University of Oxford, UK
Calendar Icon Jan 28, 2025 at 2:00PM
Location Icon Department of Computer Science, CS1.04, University of Warwick, Coventry, UK

More Info
Speaker Image

TBD

Marco Mondelli - Assistant Professor, Institute of Science and Technology, Austria
Calendar Icon Jun 03, 2025 at 14:00PM
Location Icon Mathematical Science Building, MB0.07, University of Warwick, Coventry, UK

More Info

Organising Team

Fanghui Liu

Fanghui Liu

Assistant Professor, CS Department, University of Warwick

Paris Giampouras

Paris Giampouras

Assistant Professor, CS Department, University of Warwick

Long Tran-Thanh

Long Tran-Thanh

Professor, CS Department, University of Warwick