Xiaolong Wang

Assistant Professor, UC San Diego [GitHub] [Google Scholar] [CV]
Home Publication Group Contact

Selected Papers


Chenhongyi Yang*, Jiarui Xu*, Shalini De Mello, Elliot J. Crowley, Xiaolong Wang.
GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation.
International Conference on Learning Representations (ICLR), 2023.
Spotlight Presentation

[arXiv] [code]

Kaifeng Zhang, Yang Fu, Shubhankar Borse, Hong Cai, Fatih Porikli, Xiaolong Wang.
Self-Supervised Geometric Correspondence for Category-Level 6D Object Pose Estimation in the Wild.
International Conference on Learning Representations (ICLR), 2023.

[arXiv] [project page]

Yuzhe Qin*, Binghao Huang*, Zhao-Heng Yin, Hao Su, Xiaolong Wang.
DexPoint: Generalizable Point Cloud Reinforcement Learning for Sim-to-Real Dexterous Manipulation.
Conference on Robot Learning (CoRL), 2022.

[arXiv] [project page] [video]

Sateesh Kumar, Jonathan Zamora*, Nicklas Hansen*, Rishabh Jangir, Xiaolong Wang.
Graph Inverse Reinforcement Learning from Diverse Videos.
Conference on Robot Learning (CoRL), 2022.
Oral Presentation

[arXiv] [project page] [video] [code]

Yang Fu, Xiaolong Wang.
Category-Level 6D Object Pose Estimation in the Wild: A Semi-Supervised Learning Approach and A New Dataset.
Conference on Neural Information Processing Systems (NeurIPS), 2022.

[arXiv] [project page] [dataset] [Wild6D code]

Yinbo Chen, Xiaolong Wang.
Transformers as Meta-Learners for Implicit Neural Representations.
European Conference on Computer Vision (ECCV), 2022.

[arXiv] [project page] [code]

Yuzhe Qin*, Yueh-Hua Wu*, Shaowei Liu, Hanwen Jiang, Ruihan Yang, Yang Fu, Xiaolong Wang.
DexMV: Imitation Learning for Dexterous Manipulation from Human Videos.
European Conference on Computer Vision (ECCV), 2022.

[arXiv] [project page] [video] [code]

Yuzhe Qin, Hao Su*, Xiaolong Wang*.
From One Hand to Multiple Hands: Imitation Learning for Dexterous Manipulation from Single-Camera Teleoperation.
Robotics and Automation Letters (RA-L), 2022.
International Conference on Intelligent Robots and Systems (IROS), 2022.

[arXiv] [project page] [video]

Chieko Sarah Imai*, Minghao Zhang*, Yuchen Zhang*, Marcin Kierebiński, Ruihan Yang, Yuzhe Qin, Xiaolong Wang.
Vision-Guided Quadrupedal Locomotion in the Wild with Multi-Modal Delay Randomization.
International Conference on Intelligent Robots and Systems (IROS), 2022.

[arXiv] [project page] [video] [code]

Nicklas Hansen, Xiaolong Wang*, Hao Su*.
Temporal Difference Learning for Model Predictive Control.
International Conference on Machine Learning (ICML), 2022.

[arXiv] [project page] [video] [code]

Jiteng Mu, Shalini De Mello, Zhiding Yu, Nuno Vasconcelos, Xiaolong Wang, Jan Kautz, Sifei Liu.
CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs.
Conference on Computer Vision and Pattern Recognition (CVPR), 2022.

[arXiv] [project page] [video]

Xuanchi Ren, Xiaolong Wang.
Look Outside the Room: Synthesizing A Consistent Long-Term 3D Scene Video from A Single Image.
Conference on Computer Vision and Pattern Recognition (CVPR), 2022.

[arXiv] [project page] [video] [code]

Jiarui Xu, Shalini De Mello, Sifei Liu, Wonmin Byeon, Thomas Breuel, Jan Kautz, Xiaolong Wang.
GroupViT: Semantic Segmentation Emerges from Text Supervision.
Conference on Computer Vision and Pattern Recognition (CVPR), 2022.

[arXiv] [project page] [video] [code] [huggingface colab] [huggingface demo]

Ruihan Yang*, Minghao Zhang*, Nicklas Hansen, Huazhe Xu, Xiaolong Wang.
Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal Transformers.
International Conference on Learning Representations (ICLR), 2022.
Spotlight Presentation

[arXiv] [project page] [video] [code]

Zihang Lai, Sifei Liu, Alexei A. Efros, Xiaolong Wang.
Video Autoencoder: self-supervised disentanglement of static 3D structure and motion.
International Conference on Computer Vision (ICCV), 2021.
Oral Presentation

[arXiv] [project page] [code] [video]

Jiarui Xu, Xiaolong Wang.
Rethinking Self-supervised Correspondence Learning: A Video Frame-level Similarity Perspective.
International Conference on Computer Vision (ICCV), 2021.
Oral Presentation

[arXiv] [project page] [code]

Hanwen Jiang*, Shaowei Liu*, Jiashun Wang, Xiaolong Wang.
Hand-Object Contact Consistency Reasoning for Human Grasps Generation.
International Conference on Computer Vision (ICCV), 2021.
Oral Presentation

[arXiv] [project page] [code]

Jiteng Mu, Weichao Qiu, Adam Kortylewski, Alan Yuille, Nuno Vasconcelos, Xiaolong Wang.
A-SDF: Learning Disentangled Signed Distance Functions for Articulated Shape Representation.
International Conference on Computer Vision (ICCV), 2021.

[arXiv] [project page] [code]

Yinbo Chen, Sifei Liu, Xiaolong Wang.
Learning Continuous Image Representation with Local Implicit Image Function.
Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
Oral Presentation

[arXiv] [project page] [code]

Qiang Zhang, Tete Xiao, Alexei A. Efros, Lerrel Pinto, Xiaolong Wang.
Learning Cross-domain Correspondence for Control with Dynamics Cycle-consistency.
International Conference on Learning Representations (ICLR), 2021.
Oral Presentation

[arXiv] [project page] [code]

Nicklas Hansen, Rishabh Jangir, Yu Sun, Guillem Alenyà, Pieter Abbeel, Alexei A. Efros, Lerrel Pinto, Xiaolong Wang.
Self-Supervised Policy Adaptation during Deployment.
International Conference on Learning Representations (ICLR), 2021.
Spotlight Presentation

[arXiv] [project page] [code] [bair blog post]

Yu Sun, Xiaolong Wang, Zhuang Liu, John Miller, Alexei A. Efros, Moritz Hardt.
Test-Time Training with Self-Supervision for Generalization under Distribution Shifts.
International Conference on Machine Learning (ICML), 2020.

[arXiv] [code and project page] [BibTeX]

Xiaolong Wang*, Allan Jabri* and Alexei A. Efros.
Learning Correspondence from the Cycle-consistency of Time.
Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
Oral Presentation

[project page] [slides] [result video] [oral talk]
[arXiv] [BibTeX] [code]

Xiaolong Wang, Ross Girshick, Abhinav Gupta, and Kaiming He.
Non-local Neural Networks.
Conference on Computer Vision and Pattern Recognition (CVPR), 2018.

[arXiv] [BibTeX] [code]

Xiaolong Wang*, Rohit Girdhar*, and Abhinav Gupta.
Binge Watching: Scaling Affordance Learning from Sitcoms.
Conference on Computer Vision and Pattern Recognition (CVPR), 2017.
Spotlight Presentation

[pdf] [BibTeX] [dataset] [project page] [spotlight video]

Xiaolong Wang and Abhinav Gupta.
Unsupervised Learning of Visual Representations using Videos.
International Conference on Computer Vision (ICCV), 2015

[pdf] [BibTeX] [code] [model] [mined_patches] [project page] [spotlight video]