Yunhai Han

Welcome! My name is Yunhai Han and I am a Robotics PhD student at Georgia Institute of Technology, advised under Prof. Harish Ravichandar. My research focus at Gatech is about structured robot learning. I am also honoured to be an awardee of Robotics PhD fellowship from Georgia Tech's Institute for Robotics and Intelligent Machines (IRIM). Before coming to Georgia Tech, I received my M.S. / B.S. in Mechanical Engineering from UCSD, 2021 and Yanshan University, 2019, respectively. I'm open to discussions and collaborations, so feel free to drop me an email if you are interested.

Email  /  CV(Jan 2025)  /  Github  /  Google Scholar  /  Linkdin

profile photo
Research

I have a broad interest in robotics, especially robot learning for manipulation tasks under complex environments, i.e., at home. I enjoy developing algorithms for robotic applications and I would like to explore how robots could learn and perceive the world to achieve high-level automation skills in dynamical environments.

My love to robotics stems from a famous Japanese anime series:Code Geass(コードギアス 反逆のルルーシュ). I was dreaming of building a macha when I was a high school student.

Mentored students

Kelin Yu (CS M.S., 2022-2024, now PhD at UMD CS)
Zhenyang Chen (Robotics M.S., 2023-2024, now PhD at GT Robotics)
Hanyao Guo (Robotics M.S., 2024-)
Pratik Shah (CS M.S., 2024-)
Linhao Bai (CS M.S., 2024-)
Yuqian Zheng (CSE M.S., 2024-)

Publication
      2024
On the Surprising Effectiveness of Spectrum Clipping in Learning Stable Linear Dynamics
Hanyao Guo*, Yunhai Han*, Harish Ravichandar (* Equal contribution; Guo is my master student advisee)
Under review by L4DC 2025
When learning stable linear dynamical systems from data, three important properties are desirable: i) predictive accuracy, ii) provable stability, and iii) computational efficiency. Unconstrained minimization of reconstruction errors leads to high accuracy and efficiency but cannot guarantee stability. Existing methods to remedy this focus on enforcing stability while also ensuring accuracy, but do so only at the cost of increased computation. In this work, we investigate if a straightforward approach can simultaneously offer all three desiderata of learning stable linear systems. Specifically, we consider a post-hoc approach that manipulates the spectrum of the learned system matrix after it is learned in an unconstrained fashion. We call this approach spectrum clipping (SC) as it involves eigen decomposition and subsequent reconstruction of the system matrix after clipping all of its eigenvalues that are larger than one to one (without altering the eigenvectors). Through detailed experiments involving two different applications and publicly available benchmark datasets, we demonstrate that this simple technique can simultaneously learn highly accurate linear systems that are provably stable. Notably, we demonstrate that SC can achieve similar or better performance than strong baselines while being orders-of-magnitude faster. We also show that SC can be readily combined with Koopman operators to learn stable nonlinear dynamics, such as those underlying complex dexterous manipulation skills involving multi-fingered robotic hands.
[Arxiv][Webpage]

AsymDex: Leveraging Asymmetry and Relative Motion in Learning Bimanual Dexterity
Zhaodong Yang, Yunhai Han, Harish Ravichandar
Poster at CoRL Whole-body Control and Bimanual Manipulation Workshop, 2024
We present Asymmetric Dexterity (AsymDex), a novel reinforcement learning (RL) framework that can efficiently learn asymmetric bimanual skills for multi-fingered hands without relying on demonstrations, which can be cumbersome to collect. Two crucial ingredients enable AsymDex to reduce the observation and action space dimensions and improve sample efficiency. First, AsymDex leverages the natural asymmetry found in human bimanual manipulation and assigns specific and interdependent roles to each hand: a facilitating hand that moves and reorients the object, and a dominant hand that performs complex manipulations on said object. Second, AsymDex defines and operates over relative observation and action spaces, facilitating responsive coordination between the two hands. Further, AsymDex can be easily integrated with recent advances in grasp learning to handle both the object acquisition phase and the interaction phase of bimanual dexterity. Unlike existing RL-based methods for bimanual dexterity, which are tailored to a specific task, AsymDex can be used to learn a wide variety of bimanual tasks that exhibit asymmetry. Detailed experiments on four simulated asymmetric bimanual dexterous manipulation tasks reveal that AsymDex consistently outperforms strong baselines that challenge its design choices, in terms of success rate and sample efficiency.
[OpenReview][Webpage]

Learning Koopman Dynamics for Safe Legged Locomotion with Reinforcement Learning-based Controller
Jeonghwan Kim, Yunhai Han, Harish Ravichandar, Sehoon Ha
Under review by ICRA 2025
Learning-based algorithms have demonstrated impressive performance in agile locomotion of legged robots. However, learned policies are often complex and opaque due to the black-box nature of learning algorithms, which hinders predictability and precludes guarantees on performance or safety. In this work, we develop a novel safe navigation framework that combines Koopman operators and model-predictive control (MPC) frameworks. Our method adopts Koopman operator theory to learn the linear evolution of dynamics of the underlying locomotion policy, which can be effectively learned with Dynamic Mode Decomposition (DMD). Given that our learned model is linear, we can readily leverage the standard MPC algorithm. Our framework is easy to implement with less prior knowledge because it does not require access to the underlying dynamical systems or control-theoretic techniques. We demonstrate that the learned linear dynamics can better predict the trajectories of legged robots than baselines. In addition, we showcase that the proposed navigation framework can achieve better safety with less collisions in challenging and dense environments with narrow passages.
[arXiv]

KOROL: Learning Visualizable Object Feature with Koopman Operator Rollout for Manipulation
Hongyi Chen, Abulikemu Abuduweili*, Aviral Agrawal*, Yunhai Han*, Harish Ravichandar, Changliu Liu, and Jeffrey Ichnowski (* Equal contribution)
Accepted by CoRL 2024
Learning dexterous manipulation skills presents significant challenges due to complex nonlinear dynamics that underlie the interactions between objects and multi-fingered hands. Koopman operators have emerged as a robust method for modeling such nonlinear dynamics within a linear framework. However, current methods rely on runtime access to ground-truth (GT) object states, making them unsuitable for vision-based practical applications. Unlike image-to-action policies that implicitly learn visual features for control, we use a dynamics model, specifically the Koopman operator, to learn visually interpretable object features critical for robotic manipulation within a scene. We construct a Koopman operator using object features predicted by a feature extractor and utilize it to auto-regressively advance system states. We train the feature extractor to embed scene information into object features, thereby enabling the accurate propagation of robot trajectories. We evaluate our approach on simulated and real-world robot tasks, with results showing that it outperformed the model-based imitation learning NDP by 1.08× and the image-to-action Diffusion Policy by 1.16×.
[arXiv][OpenReview][Code]

Learning Prehensile Dexterity by Imitating and Emulating State-only Observations
Yunhai Han, Zhenyang Chen, Kyle A Williams, and Harish Ravichandar
Accepted by RA-L
When human acquire physical skills (e.g., tennis) from experts, we tend to first learn from merely observing the expert. But this is often insufficient. We then engage in practice, where we try to emulate the expert and ensure that our actions produce similar effects on our environment. Inspired by this observation, we introduce Combining IMitation and Emulation for Motion Refinement (CIMER) -- a two-stage framework to learn dexterous prehensile manipulation skills from state-only observations. CIMER's first stage involves imitation: simultaneously encode the complex interdependent motions of the robot hand and the object in a structured dynamical system. This results in a reactive motion generation policy that provides a reasonable motion prior, but lacks the ability to reason about contact effects due to the lack of action labels. The second stage involves emulation: learn a motion refinement policy via reinforcement that adjusts the robot hand's motion prior such that the desired object motion is reenacted. CIMER is both task-agnostic (no task-specific reward design or shaping) and intervention-free (no additional teleoperated or labeled demonstrations).
[arXiv][Webpage][Code]

MimicTouch: Leveraging Multi-modal Human Tactile Demonstrations for Contact-rich Manipulation
Kelin Yu*, Yunhai Han*, Qixian Wang, Vaibhav Saxena, Danfei Xu, and Ye Zhao (* Equal contribution; Yu is my master student advisee)
Accepted by CoRL 2024 & Best Paper at NeurIPS Touch Processing Workshop & Poster at CoRL Deployable Workshop, 2023
Tactile sensing is critical to fine-grained, contact-rich manipulation tasks, such as insertion and assembly. An effective strategy to learn tactile-guided policy is to train neural networks that map tactile signal to control based on teleoperated demonstration data. However, to provide the teleoperated demonstration, human users often rely on visual feedback to control the robot. This creates a gap between the sensing modality used for controlling the robot (visual) and the modality of interest (tactile). To bridge this gap, we introduce MimicTouch, a novel framework for learning policies directly from demonstrations provided by human users with their hands. The key innovations are i) a human tactile data collection system which collects multi-modal tactile dataset for learning human's tactile-guided control strategy, ii) an imitation learning-based framework for learning human's tactile-guided control strategy through such data, and iii) an online residual RL framework to bridge the embodiment gap between the human hand and the robot gripper.
[arXiv][OpenReview][Webpage]

      2023
On the Utility of Koopman Operator Theory in Learning Dexterous Manipulation Skills
Yunhai Han, Mandy Xie, Ye Zhao, and Harish Ravichandar
Accepted by CoRL 2023 (Oral)
This work enables to learn dexterous manipulation skills from a few pieces of demonstration data. Dexterous manipulation is known to be very complex due to the absence of well-studied analytical models and high DoFs. Most existing work addresses this challenge via training a Reinforcement learning (RL) agent with fine-tuned shaped rewards. Motivated by the fact that complex nonlinear dynamics underlie dexterous manipulation and Koopman operators are simple yet powerful control-theoretic structures that help represent complex nonlinear dynamics as linear systems in higher-dimensional spaces, we propose a Koopman operator-based framework (KODex) and demonstrate that it is surprisingly effective for learning dexterous manipulation tasks and offers a number of unique benefits.
[arXiv][OpenReview][Webpage][Code]

      2022
Leveraging Heterogeneous Capabilities in Multi-Agent Systems for Environmental Conflict Resolution
Michael E. Cao⋆, Jonas Warnke⋆, Yunhai Han, Xinpei Ni, Ye Zhao, and Samuel Coogan (* Equal contributions)
Accepted by SSRR, 2022
This work introduces a high-level controller synthesis framework that enables teams of heterogeneous agents to assist each other in resolving environmental conflicts that appear at runtime. This conflict resolution method is built upon temporal-logic-based reactive synthesis to guarantee safety and task completion under specific environment assumptions. Additionally, we implement the proposed framework on a physical multi-agent robotic system to demonstrate its viability for real world applications.
[arXiv][Video]

      2021
A Numerical Verification Framework for Differential Privacy in Estimation
Yunhai Han and Sonia Martínez
Accepted by L-CSS and ACC, 2022
This work proposes a numerical method to verify differential privacy in estimation with performance guarantees. To achieve differential privacy, a mechanism (estimator) is turned into a stochastic mapping; which makes it hard to distinguish outputs produced by close inputs. While obtaining theoretical conditions that guarantee privacy may be possible, verifying these in practice can be hard.
[arXiv][slides]

Autonomous Vehicles for Micro-mobility
Henrik Christensen, David Paz, Hengyuan Zhang, Dominique Meyer, Hao Xiang, Yunhai Han, Yuhan Liu, Andrew Liang, Zheng Zhong, Shiqi Tang
Accepted by Autonomous Intelligent System, Springer, 2021
My teammates at AVL lab and I warpped up the technical details when we explored and developed robust autonomous car systems and architectures for mail-delivery and micro-transit applications on UCSD campus. In this paper, we first tell the details of the initial design experiments and the main lessons/short-comings from the deployments. Then, we present the various approaches to address these challenges. Finally, we outline our future work. My work for this journal paper mainly involves the auto-calibration system for urban autonomous driving.
[Springer]

Learning Generalizable Vision-Tactile Robotic Grasping Strategy for Deformable Objects via Transformer
Yunhai Han*, Kelin Yu*, Rahul Batra, Nathan Boyd, Chaitanya Mehta, Tuo Zhao, Yu She, Seth Hutchinson, and Ye Zhao
Accepted by T-MECH
Reliable robotic grasping with deformable objects remains a challenging task due to underactuated contact interactions with a gripper, unknown object dynamics, and variable object geometries. In this study, we propose a Transformer-based robot grasping framework for rigid grippers that leverage tactile information from a GelSight sensor for safe object grasping. The Transformer network learns physical feature embeddings from visual & tactile feedback and predict a final grasp through a multilayer perceptron (MLP) with grasping strength. Using these predictions, the gripper is commanded with an optimal force for safe grasping tasks.
[arXiv] [Webpage] [Video]

      2020
A 2D Surgical Simulation Framework for Tool-Tissue Interaction
Yunhai Han, Fei Liu and Michael C. Yip
Spotlight presentation at IROS Workshop, 2020
This framework continuously tracks the motion of manipulator and simulates the tissue deformation in presence of collision detection. The deformation energy can be computed for the control and planning task.
[arXiv] [slides][Talk]

Auto-calibration Method Using Stop Signs for Urban Autonomous Driving Applications
Yunhai Han*, Yuhan Liu*, David, paz and Henrik Christensen (* Equal contributions)
Accepted by ICRA, 2021
For use of cameras on an intelligent vehicle,driving over a major bump could challenge the calibration. It isthen of interest to do dynamic calibration. What structures canbe used for calibration? How about using traffic signs that yourecognize? In this paper an approach is presented for dynamic camera calibration based on recognition of stop signs.
[arXiv] [Dataset]

Real-to-Sim Registration of Deformable Soft Tissue with Position-Based Dynamics for Surgical Robot Autonomy
Fei Liu*, Zihan Li*, Yunhai Han, Jingpei Lu, Florian Richter and Michael C. Yip (* Equal contributions)
Accepted by ICRA, 2021
Autonomy in robotic surgery is very challenging in unstructured environments, especially when interacting with soft tissues. In this work, we propose an online, continuous, registration method to bridge from 3D visual perception to position-based dynamics modeling of tissues.
[arXiv][Video]

Group Projects

RoboMaster The RoboMaster program is a platform for robotic competitions and academic exchange founded by Da-Jiang Innovations (DJI). Each team has to design and build a squad of multiple-purpose robots for skirmish combats. I was the vision group leader of YSU Eagle. My group were mainly responsible for the system design of visual components (including object tracking, range estimation and serial port communication) and the PID stability adjustment of the gimbal unit on the mobile tank (to prevent bumps and collisions during movement).
[RoboMaster Overview][Our robots]



Professional Service

GT RoboGrads
Research Vice President

ICRA-2021
Program Committee Member(Reviewer)

AIM-2021
Program Committee Member(Reviewer)

ICRA-2022
Program Committee Member(Reviewer)

IROS-2022
Program Committee Member(Reviewer)

ACC-2022
Program Committee Member(Session Chair)

SSRR-2022
Program Committee Member(Reviewer)

ICRA-2023
Program Committee Member(Reviewer)

Teaching Experience

MAE145, Robotic Planning & Estimation, UCSD
21Winter, Teaching Assistant

MAE146, Introduction to ML Algorithms, UCSD
21Spring, Teaching Assistant

Working Experience

Research Assistant, Georgia Institute of Technology
21Summer ~ 22Spring

Misc

Running: Once finished half-marathon (21km) within 2 hours
Soccer and Badminton
Photograpy
Gaming: League of Legend
Anime, Manga and Movie


Template from this handsome guy.