Talks

Building Embodied Intelligence Through Reinforcement Learning

Younggyo Seo

IRB 4105 or https://umd.zoom.us/j/94340703410?pwd=rrXaGSXSpabcMTtDNmeCNf2Ih2fQYE.1

Tuesday, March 4, 2025, 11:00 am-12:00 pm

You are subscribed to this talk through .
You are watching this talk through .
You are subscribed to this talk. (unsubscribe, watch)
You are watching this talk. (unwatch, subscribe)
You are not subscribed to this talk. (watch, subscribe)

Abstract

Impressive successes of large language models are based on a unified pipeline: pre-training on large datasets and fine-tuning with reinforcement learning (RL). However, this “recipe” does not directly work for robotics, because (i) robotics have a data scarcity problem, (ii) deploying robots to interact with the physical world is expensive, and (iii) RL for robotics is not data-efficient. In this talk, I will present my research on addressing these challenges to develop a pipeline towards building embodied intelligence through RL. I will first present data-efficient RL algorithms for robotics, which can solve robotic tasks that were not possible to solve with previous RL algorithms in a reasonable amount of time. Then, I will describe how to effectively train world models from videos so that we can use synthetic data for training robots with RL. Finally, I will describe my research on offline-to-online RL that identifies the challenges of fine-tuning RL agents trained on a narrow data distribution and addresses them. I will conclude by discussing future directions for robustifying this pipeline by incorporating large generative models trained on internet-scale data.

Bio

Younggyo Seo is a postdoctoral scholar at UC Berkeley, working with Pieter Abbeel. He received his Ph.D. from KAIST in 2023 advised by Jinwoo Shin, where he was recognized as AI/CS/EE Rising Stars. His research focuses on reinforcement learning, computer vision, and robotics, with the goal of developing intelligent agents that can continually improve themselves by interacting with the physical world. He also spent time as a research scientist at Dyson, developing reinforcement learning algorithms for robots.

This talk is organized by Samuel Malede Zewdu