CVPR 2020, The 2nd Tutorial on

Learning Representations via Graph-structured Networks

Slides and recorded videos are provided in this webpage.
Sunday afternoon (1PM - 4:30PM PDT), June 14, 2020



Recent years have seen a dramatic rise in the adoption of convolutional neural networks (ConvNets) for a myriad of computer vision tasks. The structure of convolution is proved to be powerful in numerous tasks to capture correlations and abstract conceptions out of image pixels. However, ConvNets are also shown to be deficient in modeling quite a few properties when computer vision works towards more difficult AI tasks. These properties include pairwise relation, global context and the ability to process irregular data beyond spatial grids.

An effective direction is to reorganize the data to be processed with graphs according to the task at hand, while constructing network modules that relate and propagate information across the visual elements within the graphs. We call these networks with such propagation modules as graph-structured networks. In this tutorial, we will introduce a series of effective graph-structured networks, including non-local neural networks, spatial generalized propagation networks, relation networks for objects and multi-agent behavior modeling, graph networks for videos and data of 3D domain. We will also discuss how to utilize graph-structured neural architectures to study the network connectivity patterns. Lastly, we will discuss related open challenges that still exist in many vision problems.


1:00 - 1:30 PM . Learning Long-term Visual Dynamics. Xiaolong Wang. [slides] [recorded video]

1:30 - 2:00 PM . Structured Representations of the Visual World. Chen Sun . [slides] [recorded video]

2:00 - 2:30 PM . Self-Attention Modeling for Visual Recognition. Han Hu. [slides] [recorded video]

2:30 - 3:00 PM . Break.

3:00 - 3:30 PM . Self-supervised Learning for Temporal Correspondence Sifei Liu . [slides] [recorded video]

3:30 - 4:00 PM . Graph Structure of Neural Networks. Saining Xie . [slides] [recorded video]

4:00 - 4:30 PM . Object-centric Physical and Spatial Inference. Shubham Tulsiani . [slides] [recorded video]

Please contact Xiaolong Wang if you have question. The webpage template is by the courtesy of awesome Georgia.