Faculty Candidate Seminar

Reinforcement Learning from Static Datasets: Algorithms, Analysis and Applications

Aviral KumarPh.D. CandidateUniversity of California, Berkeley
WHERE:
3725 Beyster BuildingMap
SHARE:

Zoom link for remote participants, passcode:  348325

Abstract: Typically, reinforcement learning (RL) methods rely on trial-and-error interaction with the environment from scratch to discover effective behaviors. While this sort of paradigm has the potential to discover good strategies, this paradigm also inhibits RL methods from collecting enough experience or training data in real-world problems where active interaction is expensive (e.g., in drug design) or dangerous (e.g., for robots operating around humans). My work develops approaches to alleviate this limitation: how can we learn policies to effectively make decisions entirely from previously-collected, static datasets in an offline manner? In this talk, I will discuss challenges that appear in this kind of offline reinforcement learning (offline RL), and develop algorithms and techniques to address these challenges. I will then discuss how my approaches for offline RL and decision-making have enabled us to make progress in real-world problems such as hardware accelerator design, robotic manipulation, and computational chemistry. Finally, I will discuss how we can enable offline RL methods to benefit from generalization capabilities offered by large and expressive models, similar to supervised learning.
Bio: Aviral Kumar is a final year Ph.D. student at UC Berkeley, advised by Sergey Levine. His research focuses on developing effective, reliable, and easy-to-use approaches for (sequential) decision-making. Towards this goal, he focuses on designing reinforcement learning techniques to static datasets and on understanding and applying these methods in practice. Before his Ph.D., Aviral obtained his B.Tech. in Computer Science from IIT Bombay in India. He is a recipient of the C.V. & Daulat Ramamoorthy Distinguished Research Award, given to a PhD student in EECS at Berkeley for outstanding contributions to a new area of research in computer science and engineering, Facebook Ph.D. Fellowship in Machine Learning and Apple Scholars in AI/ML Ph.D. Fellowship.

Organizer

Cindy Estell

Student Host

Shengpu Tang

Faculty Host

Wei Hu