Talks

PhD Proposal: Towards Effective Temporal Modeling for Video Understanding and Beyond

Bo He

4105 Brendan Iribe Center for Computer Science and Engineering (IRB)

Wednesday, January 18, 2023, 3:00-5:00 pm

You are subscribed to this talk through .
You are watching this talk through .
You are subscribed to this talk. (unsubscribe, watch)
You are watching this talk. (unwatch, subscribe)
You are not subscribed to this talk. (watch, subscribe)

Abstract

ideo understanding is a fundamental research topic in computer vision, which requires extracting and analyzing information from videos automatically. Compared to the image modality, the video modality significantly differs from it with additional temporal dependencies, which provide crucial clues to help understand what happens across time. Therefore, how to effectively model temporal relationships for videos is of vital importance for video understanding.

This thesis is divided into two parts. In the first part, we mainly concentrate on how to model the temporal dependencies for different video understanding tasks, including action recognition, temporal action localization, and video summarization. In the second part, beyond the pure understanding of video content, we focus on how to represent large-scale videos compactly and efficiently. We propose to encode diverse videos into a single neural network and explore its superior advantages in various video downstream tasks.

Examining Committee

Chair:	Dr. Abhinav Shrivastava
Department Representative:	Dr. Ramani Duraiswami
Members:	Dr. Furong Huang
	Dr. Chen Sun

Bio

Bo He is a PhD student at the University of Maryland, College Park, advised by Prof. Abhinav Shrivastava. His research interests lie in video understanding, video neural representation and multimodal learning. He received the B.Eng. degree in Computer Science and Technology from the University of Chinese Academy of Sciences.

This talk is organized by Tom Hurst