Speakers
-
Pierre-Luc Bacon: is an Assistant Professor at the University of Montreal's DIRO, a CIFAR AI Chair, a core member of Mila, and affiliated with the Institute for Data Valorization (IVADO). Their research lies at the intersection of theory and application in reinforcement learning, focusing on real-world problems such as HVAC systems and molecular modeling. They work on improving RL through representation learning, neural differential equations, and transformer-based models, with a particular interest in addressing the curse of horizon in long-term planning. Recently, they have been exploring ways to leverage large language models to tackle specification challenges in RL, aiming to make RL systems more aligned and sample-efficient in practice.
-
Martin Riedmiller: is a research director and lead of the Controls team at Google DeepMind, previously serving as a university professor for machine learning and neuroinformatics from 2002 to 2015. His core scientific interest lies in intelligent machines that autonomously learn from scratch, particularly neural networks and their information storage and generalization capabilities. This fascination began with his master thesis on 'Rprop' (1992) and continued through pioneering work spanning neural forecasting for finance, self-driving vehicles, and brain-computer interfaces. His Brainstormers robotic soccer team won the RoboCup World Championship five times while pioneering reinforcement learning, and his algorithms including Neural Fitted Q Iteration and Deep Fitted Q established foundations for modern AI research. His career journey has included roles as a game programmer (1981-1986), computer science professor at German universities, co-founder of an early AI startup, before joining DeepMind in 2015.
-
Martha White: is an Associate Professor of Computing Science at the University of Alberta and a Fellow of Amii, which is one of the top machine learning centres in the world. She holds a Canada CIFAR AI Chair, a Tier 2 Canada Research Chair in Reinforcement Learning and received IEEE's "AIs 10 to Watch: The Future of AI" award in 2020. She has authored more than 70 papers in top journals and conferences. Martha is an associate editor for JMLR and TMLR, has served as co-program chair for ICLR and for RLC and regularly serves as an area chair for top AI conferences. Her research focus is on developing algorithms that learn to adapt continually, with a focus on more sustainable systems.
-
Akhil Bagaria: is a Research Scientist at Amazon, focusing on Reinforcement Learning applications for the Amazon Supply Chain. Previously, he completed his Computer Science PhD at Brown University under George Konidaris in the Intelligent Robot Lab and Brown BigAI initiative, including a summer internship with David Silver's RL team at DeepMind in 2022. His research centers on creating general-purpose agents that can learn from raw sensorimotor data, with particular emphasis on exploration and skill discovery problems. Before his doctoral studies, Akhil graduated from Harvey Mudd College in 2016, where he worked in the Lab for Autonomous and Intelligent Robotics under Professor Chris Clark. His professional experience also includes two years at Apple in Cupertino as part of the Multitouch Algorithms team, where he worked under the guidance of Nicole Wells and Wayne Westerman.
-
Jorge Montalvo: Jorge Montalvo is a Research engineer at Centrica, focusing on developing demand response solutions using state of the art machine learning models and operations research. He is also a core team member of the Climate Change AI organization where he manages the Power Systems Working Group.