I am an assistant professor at UC San Diego in the ECE department. I am affliated with the CSE department, Center for Visual Computing, Contextual Robotics Institute, and Artificial Intelligence Group. I am a member of the Robotics team in the TILOS NSF AI Institute.
I was a postdoctoral fellow at UC Berkeley with Alexei Efros and Trevor Darrell. I received a Ph.D. in robotics from the Carnegie Mellon University, at where I worked with Abhinav Gupta. Here is my PhD Thesis.
I am hiring a postdoc! There are PhD openings for 2024 fall (Visual Representation Learning, 3D Vision, Generative AI, Robotics, Robot Learning).
For PhD applicants, you can apply through both CSE and ECE departments. For applications in ECE departments, please apply to the ISRC/SIP track.
I am also taking self-motivated phd/master/undergrad interns starting 2023 fall.
Our group has a broad interest around the directions of Computer Vision, Machine Learning and Robotics. Our focus is on learning 3D and dynamics representations through videos and physical robotic interaction data. We explore various means of supervision signals from the data itself, language, and common sense knowledge. We leverage these comprehensive representations to facilitate the learning of robot skills, with the goal of generalizing the robot to interact effectively with a wide range of objects and environments in the real physical world. Please check out our individual research topic of Self-Supervised Learning, Video Understanding, Common Sense Reasoning, RL and Robotics, 3D Interaction, Dexterous Hand.
I gave a talk in the TTI/Vanguard: [next] Workshop on Human-Centric Robot Learning (2023, Dec).
I gave a talk in the UPenn GRASP seminar on Generalizable Geometric Robot Learning (2023, Nov).
I gave a talk in the UC San Diego Contextual Robotics Institute's forum (2023, Nov).
I gave a talk in the CoRL 2023 Deployable Workshop on Generalizable Geometric Robot Learning (2023, Oct).
I gave a talk in the NSF SAIL Workshop on Human-Centric Robot Learning (2023, Oct).
I gave a talk in the Workshop on 3D Scene Understanding for Vision, Graphics, and Robotics in CVPR 2023 on 3D Scene Understanding for Locomotion Control.
I gave a talk in the Workshop on 3D Vision and Robotics in CVPR 2023 on Geometric Robot Learning for Generalizable Dexterous Manipulation.
I gave a talk in the MIT Embodied Intelligence Seminar, Stanford SVL Seminar, and UC Berkeley on Geometric Robot Learning for Generalizable Skills Acquisition.
I gave a talk in the Workshop on Neural Fields across Fields in ICLR 2023, on Generalization in Neural Radiance Fields. Also see our recent release on Generalizable NeRFs: TUVF, ActorsNeRF, FeatureNeRF, MonoNeRF.
I am co-organizing the Workshop on Learning Dexterous Manipulation in RSS 2023.
I am co-organizing the Workshop on 4D Hand Object Interaction in CVPR 2023.
I am co-organizing the Tutorial on Building and Working in Environments for Embodied AI in CVPR 2022. Here is the code base for the tutorial.
I gave a talk in the Workshop on Generalizable Policy Learning in the Physical World in ICLR 2022, on Generalizing Dexterous Manipulation by Learning from Humans.
I gave a talk in the Tutorial on Large Scale Holistic Video Understanding in ICCV 2021, on Learning to Perceive Videos for Embodiment.
I gave a talk in the Large-scale Video Object Segmentation Challenge Workshop in CVPR 2021, on Self-Supervised Representation Learning with Videos.
I am co-organizing the 3rd Tutorial on Learning Representations via Graph-structured Networks in CVPR 2021. Here is the recorded video.
I am co-organizing the Comprehensive Tutorial on Video Modeling in CVPR 2021.
I gave a talk in Nvidia on Self-Supervised Learning. Here is the recorded video.
I am co-organizing the Workshop on Sensing, Understanding and Synthesizing Humans in ECCV 2020.
I am co-organizing the 2nd Tutorial on Learning Representations via Graph-structured Networks in CVPR 2020. Here is the recorded video.
I am co-organizing the Tutorial on Learning Representations via Graph-structured Networks in CVPR 2019.
I am co-organizing the Workshop on Multi-Modal Learning from Videos in CVPR 2019.