Selected Papers
Shiqi Yang, Minghuan Liu, Yuzhe Qin, Runyu Ding, Jialong Li, Xuxin Cheng, Ruihan Yang, Sha Yi, Xiaolong Wang.
ACE: A Cross-Platform Visual-Exoskeletons System for Low-Cost Dexterous Teleoperation.
Conference on Robot Learning (CoRL), 2024.
Jun Wang*, Ying Yuan*, Haichuan Che*, Haozhi Qi*, Yi Ma, Jitendra Malik, Xiaolong Wang.
Lessons from Learning to Spin “Pens”.
Conference on Robot Learning (CoRL), 2024.
Xuxin Cheng*, Jialong Li*, Shiqi Yang, Ge Yang, Xiaolong Wang.
Open-TeleVision: Teleoperation with Immersive Active Visual Feedback.
Conference on Robot Learning (CoRL), 2024.
Minghuan Liu*, Zixuan Chen*, Xuxin Cheng, Yandong Ji, Ruihan Yang, Xiaolong Wang.
Visual Whole-Body Control for Legged Loco-Manipulation.
Conference on Robot Learning (CoRL), 2024.
Oral Presentation
Adrian Remonda, Nicklas Hansen, Ayoub Raji, Nicola Musiu, Marko Bertogna, Eduardo E. Veas, Xiaolong Wang.
A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data.
Conference on Neural Information Processing Systems (NeurIPS), 2024.
An-Chieh Cheng, Hongxu Yin, Yang Fu, Qiushan Guo, Ruihan Yang, Jan Kautz, Xiaolong Wang, Sifei Liu.
SpatialRGPT: Grounded Spatial Reasoning in Vision-Language Models.
Conference on Neural Information Processing Systems (NeurIPS), 2024.
Yu Sun*, Xinhao Li*, Karan Dalal*, Jiarui Xu, Arjun Vikram, Genghan Zhang, Yann Dubois, Xinlei Chen†, Xiaolong Wang†, Sanmi Koyejo†, Tatsunori Hashimoto†, Carlos Guestrin†.
Learning to (Learn at Test Time): RNNs with Expressive Hidden States.
arXiv, 2024.
Runyu Ding*, Yuzhe Qin*, Jiyue Zhu*, Chengzhe Jia, Shiqi Yang, Ruihan Yang, Xiaojuan Qi, Xiaolong Wang.
Bunny-VisionPro: Real-Time Bimanual Dexterous Teleoperation for Imitation Learning.
arXiv, 2024.
Isabella Liu, Hao Su†, Xiaolong Wang†.
Dynamic Gaussians Mesh: Consistent Mesh Reconstruction from Monocular Videos.
arXiv, 2024.
Ri-Zhao Qiu, Ge Yang, Weijia Zeng, Xiaolong Wang.
Feature Splatting: Language-Driven Physics-Based Scene Synthesis and Editing.
European Conference on Computer Vision (ECCV), 2024.
Jiteng Mu, Michaël Gharbi, Richard Zhang, Eli Shechtman, Nuno Vasconcelos, Xiaolong Wang, Taesung Park.
Editable Image Elements for Controllable Synthesis.
European Conference on Computer Vision (ECCV), 2024.
Ri-Zhao Qiu, Yafei Hu, Ge Yang, Yuchen Song, Yang Fu, Jianglong Ye, Jiteng Mu, Ruihan Yang, Nikolay Atanasov, Sebastian Scherer, Xiaolong Wang.
Learning Generalizable Feature Fields for Mobile Manipulation.
arXiv, 2024.
Xuxin Cheng*, Yandong Ji*, Junming Chen, Ruihan Yang, Ge Yang, Xiaolong Wang.
Expressive Whole-Body Control for Humanoid Robots.
Robotics: Science and Systems (RSS), 2024.
Ruihan Yang, Yejin Kim, Aniruddha Kembhavi, Xiaolong Wang, Kiana Ehsani.
Harmonic Mobile Manipulation.
International Conference on Intelligent Robots and Systems (IROS), 2024.
Oral Presentation
Best Paper Award on Mobile Manipulation
Yinbo Chen, Oliver Wang, Richard Zhang, Eli Shechtman, Xiaolong Wang†, Michael Gharbi†.
Image Neural Field Diffusion Models.
Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
Highlight
Yang Fu, Sifei Liu, Amey Kulkarni, Jan Kautz, Alexei A. Efros, Xiaolong Wang.
COLMAP-Free 3D Gaussian Splatting.
Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
Highlight
Jiarui Xu, Xingyi Zhou, Shen Yan, Xiuye Gu, Anurag Arnab, Chen Sun, Xiaolong Wang, Cordelia Schmid.
Pixel Aligned Language Models.
Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
Ying Yuan*, Haichuan Che*, Yuzhe Qin*, Binghao Huang, Zhao-Heng Yin, Kang-Won Lee, Yi Wu, Soo-Chul Lim, Xiaolong Wang.
Robot Synesthesia: In-Hand Manipulation with Visuotactile Sensing.
International Conference on Robotics and Automation (ICRA), 2024.
Nicklas Hansen, Hao Su*, Xiaolong Wang*.
TD-MPC2: Scalable, Robust World Models for Continuous Control.
International Conference on Learning Representations (ICLR), 2024.
Spotlight
Binghao Huang*, Yuanpei Chen*, Tianyu Wang, Yuzhe Qin, Yaodong Yang, Nikolay Atanasov, Xiaolong Wang.
Dynamic Handover: Throw and Catch with Bimanual Hands .
Conference on Robot Learning (CoRL), 2023.
Yanjie Ze*, Ge Yan*, Yueh-Hua Wu*, Annabella Macaluso, Yuying Ge, Jianglong Ye, Nicklas Hansen, Li Erran Li, Xiaolong Wang.
GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields.
Conference on Robot Learning (CoRL), 2023.
Oral Presentation
Yunhai Feng*, Nicklas Hansen*, Ziyan Xiong*, Chandramouli Rajagopalan, Xiaolong Wang.
Finetuning Offline World Models in the Real World.
Conference on Robot Learning (CoRL), 2023.
Oral Presentation
Yuzhe Qin, Wei Yang, Binghao Huang, Karl Van Wyk, Hao Su, Xiaolong Wang, Yu-Wei Chao, Dieter Fox.
AnyTeleop: A General Vision-Based Dexterous Robot Arm-Hand Teleoperation System.
Robotics: Science and Systems (RSS), 2023.
Jianglong Ye, Naiyan Wang, Xiaolong Wang.
FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models.
International Conference on Computer Vision (ICCV), 2023.
Zhao-Heng Yin*, Binghao Huang*, Yuzhe Qin, Qifeng Chen, Xiaolong Wang.
Rotating without Seeing: Towards In-hand Dexterity through Touch.
Robotics: Science and Systems (RSS), 2023.
Ruihan Yang, Ge Yang, Xiaolong Wang.
Neural Volumetric Memory for Visual Locomotion Control.
Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
Highlight
Jiarui Xu, Sifei Liu*, Arash Vahdat*, Wonmin Byeon, Xiaolong Wang, Shalini De Mello.
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models.
Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
Highlight
Yuzhe Qin*, Binghao Huang*, Zhao-Heng Yin, Hao Su, Xiaolong Wang.
DexPoint: Generalizable Point Cloud Reinforcement Learning for Sim-to-Real Dexterous Manipulation.
Conference on Robot Learning (CoRL), 2022.
Sateesh Kumar, Jonathan Zamora*, Nicklas Hansen*, Rishabh Jangir, Xiaolong Wang.
Graph Inverse Reinforcement Learning from Diverse Videos.
Conference on Robot Learning (CoRL), 2022.
Oral Presentation
Yang Fu, Xiaolong Wang.
Category-Level 6D Object Pose Estimation in the Wild: A Semi-Supervised Learning Approach and A New Dataset.
Conference on Neural Information Processing Systems (NeurIPS), 2022.
Yuzhe Qin*, Yueh-Hua Wu*, Shaowei Liu, Hanwen Jiang, Ruihan Yang, Yang Fu, Xiaolong Wang.
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos.
European Conference on Computer Vision (ECCV), 2022.
Yuzhe Qin, Hao Su*, Xiaolong Wang*.
From One Hand to Multiple Hands: Imitation Learning for Dexterous Manipulation from Single-Camera Teleoperation.
Robotics and Automation Letters (RA-L), 2022.
International Conference on Intelligent Robots and Systems (IROS), 2022.
Nicklas Hansen, Xiaolong Wang*, Hao Su*.
Temporal Difference Learning for Model Predictive Control.
International Conference on Machine Learning (ICML), 2022.
Xuanchi Ren, Xiaolong Wang.
Look Outside the Room: Synthesizing A Consistent Long-Term 3D Scene Video from A Single Image.
Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
Jiarui Xu, Shalini De Mello, Sifei Liu, Wonmin Byeon, Thomas Breuel, Jan Kautz, Xiaolong Wang.
GroupViT: Semantic Segmentation Emerges from Text Supervision.
Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
Ruihan Yang*, Minghao Zhang*, Nicklas Hansen, Huazhe Xu, Xiaolong Wang.
Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal Transformers.
International Conference on Learning Representations (ICLR), 2022.
Spotlight Presentation
Zihang Lai, Sifei Liu, Alexei A. Efros, Xiaolong Wang.
Video Autoencoder: self-supervised disentanglement of static 3D structure and motion.
International Conference on Computer Vision (ICCV), 2021.
Oral Presentation
Jiarui Xu, Xiaolong Wang.
Rethinking Self-supervised Correspondence Learning: A Video Frame-level Similarity Perspective.
International Conference on Computer Vision (ICCV), 2021.
Oral Presentation
Hanwen Jiang*, Shaowei Liu*, Jiashun Wang, Xiaolong Wang.
Hand-Object Contact Consistency Reasoning for Human Grasps Generation.
International Conference on Computer Vision (ICCV), 2021.
Oral Presentation
Yinbo Chen, Sifei Liu, Xiaolong Wang.
Learning Continuous Image Representation with Local Implicit Image Function.
Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
Oral Presentation
Qiang Zhang, Tete Xiao, Alexei A. Efros, Lerrel Pinto, Xiaolong Wang.
Learning Cross-domain Correspondence for Control with Dynamics Cycle-consistency.
International Conference on Learning Representations (ICLR), 2021.
Oral Presentation
Nicklas Hansen, Rishabh Jangir, Yu Sun, Guillem Alenyà, Pieter Abbeel, Alexei A. Efros, Lerrel Pinto, Xiaolong Wang.
Self-Supervised Policy Adaptation during Deployment.
International Conference on Learning Representations (ICLR), 2021.
Spotlight Presentation
Yu Sun, Xiaolong Wang, Zhuang Liu, John Miller, Alexei A. Efros, Moritz Hardt.
Test-Time Training with Self-Supervision for Generalization under Distribution Shifts.
International Conference on Machine Learning (ICML), 2020.
Xiaolong Wang*, Allan Jabri* and Alexei A. Efros.
Learning Correspondence from the Cycle-consistency of Time.
Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
Oral Presentation
[arXiv] [BibTeX] [code]
Xiaolong Wang*, Rohit Girdhar*, and Abhinav Gupta.
Binge Watching: Scaling Affordance Learning from Sitcoms.
Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
Spotlight Presentation
Xiaolong Wang and Abhinav Gupta.
Unsupervised Learning of Visual Representations using Videos.
International Conference on Computer Vision (ICCV), 2015