Seungho Baek

Seungho Baek

I'm an M.S. student at Sungkyunkwan University (SKKU), advised by Prof. Yusung Kim. My research primarily focuses on Reinforcement Learning (RL), specifically within two main areas: offline RL and hierarchical RL. Initially, my research was centered on enhancing the agent’s long-horizon reasoning and stitching ability by formulating subgoal selection as a graph search problem rather than learning explicit high-level policy. I've also contributed by developing a framework that leverages temporal distance representations to improve offline hierarchical RL, particularly in settings with sparse rewards and suboptimal data. Recently, I’ve expanded into online RL, investigating how to enable efficient policy learning using a small amount of expert data combined with dense rewards shaped by a temporal distance representation.

Ultimately, my goal is to address the following key challenges in offline RL:

Research Keywords: Offline RL · Unsupervised RL · Goal-Conditioned RL · Hierarchical RL
GitHub Google Scholar YouTube Facebook Instagram
Email (academic) Email (personal)