学术报告

more
您当前所在位置: 首页 > 通知公告 > 学术报告 > 正文

用于目标检测的变形深度卷积神经网络--王晓刚教授

发布时间:2015-05-19点击量:

                                       DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection

Abstract: In this talk, I will introduce the deep learning based framework for general object detection on ImageNet. It significantly outperforms well-known object detection works such as GoogleNet, VGG and RCNN with large margins on the ILSVRC2014 detection test set. The proposed pipeline integrates region proposal, bounding box rejection, a new pre-training strategy based on object-level annotations, feature learning, part-deformation learning, contextual modeling, bounding box regression, and model averaging. Detailed component-wise analysis will be provided through extensive experimental evaluation, which provides a global view for people to understand the deep learning object detection pipeline.  In the proposed new deep architecture, a new deformation constrained pooling (def-pooling) layer models the deformation of object parts with geometric constraint and penalty.

Through the application of object detection, I would also like to highlight two key points on deep learning. (1) In order to learn feature representation with high discriminative power and good generalization capability, it is better to use challenging supervision tasks with high dimensional prediction to train deep models. Once these features are learned with challenging tasks, they can be well applied to easier tasks. (2) Instead of treating deep learning as black box, one could build the connection between the layers of deep models and the key components of existing vision systems. The research experience from existing vision systems can help us proposed new layers and new training strategies.

Xiaogang Wang received his Bachelor degree in Electrical Engineering and Information Science from the Special Class of Gifted Young at the University of Science and Technology of China in 2001, M. Phil. degree in Information Engineering from the Chinese University of Hong Kong in 2004, and PhD degree in Computer Science from Massachusetts Institute of Technology in 2009. He is an assistant professor in the Department of Electronic Engineering at the Chinese University of Hong Kong since August 2009. He received the Outstanding Young Researcher in Automatic Human Behaviour Analysis Award in 2011, Hong Kong RGC Early Career Award in 2012, and Young Researcher Award of the Chinese University of Hong Kong. He is the associate editor of the Image and Visual Computing Journal. He was the area chair of ICCV 2011, ECCV 2014, ACCV 2014, and ICCV 2015. His research interests include computer vision, deep learning, crowd video surveillance, object detection, and face recognition. 

上一篇:“互联网+”与大数据的产业发展趋势--张良杰校友报告
下一篇:图匹配方式解析街景数据--王瑞胜博士