s Shanghaitech Vision and Intelligent Perception(SVIP) LAB

Selected Conference Papers


TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding
Shuo Wang, Jing Li, Zibo Zhao, Dongze Lian, Binbin Huang, Xiaomei Wang, Zhengxin Li, Shenghua Gao

Accepted by WACV 2024
[Paper]

RoomDesigner: Encoding Anchor-latents for Style-consistent and Shape-compatible Indoor Scene Generation
Yiqun Zhao, Zibo Zhao, Jing Li, Sixun Dong, Shenghua Gao

Accepted by 3DV 2024
[Paper]

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Zibo Zhao, Wen Liu, Xin Chen, Xianfang Zeng, Rui Wang, Pei Cheng, Bin Fu, Tao Chen, Gang Yu, Shenghua Gao

Accepted by Neurips 2023
[Project] [Paper]

LivelySpeaker: Towards Semantic-aware Co-Speech Gesture Generation
Yihao Zhi, Xiaodong Cun, Xuelin Chen, Xi Shen, Wen Guo, Shaoli Huang, Shenghua Gao

Accepted by ICCV 2023
[Project] [Paper]

Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models
Jiale Xu, Xintao Wang, Weihao Cheng, Yan-Pei Cao, Ying Shan, Xiaohu Qie, Shenghua Gao

Accepted by CVPR 2023
[Project] [Paper]

Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos
Sixun Dong*, Huazhang Hu*, Dongze Lian, Weixin Luo, Yicheng Qian, Shenghua Gao

Accepted by CVPR 2023
[Project] [Paper]

PlaneDepth: Self-supervised Depth Estimation via Orthogonal Planes
Ruoyu Wang, Zehao Yu, Shenghua Gao

Accepted by CVPR 2023
[Project] [Paper]

Dual-Space NeRF: Learning Animatable Avatars and Scene Lighting in Separate Spaces
Yihao Zhi*, Shenhan Qian*, Xinhao Yan*, Shenghua Gao

Accepted by 3DV 2022
[Project] [Paper] [Video] [Code]

UNIF: United Neural Implicit Functions for Clothed Human Reconstruction and Animation
Shenhan Qian, Jiale Xu, Ziwei Liu, Liqian Ma, Shenghua Gao

Accepted by ECCV 2022
[Project] [Paper] [Video] [Code]

SVIP: Sequence VerIfication for Procedures in Videos
Yicheng Qian, Weixin Luo, Dongze Lian, Xu Tang, Peilin Zhao, Shenghua Gao

Accepted by CVPR 2022
[Paper] [Code]

DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers
Xianing Chen, Qiong Cao, Yujie Zhong, Jing Zhang, Shenghua Gao, Dacheng Tao

Accepted by CVPR 2022
[Paper]

TransRAC: Encoding Multi-scale Temporal Correlation with Transformers for Repetitive Action Counting
Huazhang Hu*, Sixun Dong*, Yiqun Zhao, Dongze Lian, Zhengxin Li, Shenghua Gao

Accepted by CVPR 2022 Oral Presentation
[Paper] [Code]

AS-MLP: An Axial Shifted MLP Architecture for Vision
Dongze Lian*, Zehao Yu*, Xing Sun, Shenghua Gao

Accepted by ICLR 2022
[Paper] [Code]

Crowd Counting With Partial Annotations in an Image
Yanyu Xu*, Ziming Zhong*, Dongze Lian, Jin Li, Zhengxin Li, Xinxing Xu, Shenghua Gao

Accepted by ICCV 2021

Speech Drives Templates: Co-Speech Gesture Synthesis with Learned Templates
Shenhan Qian*, Zhi Tu*, Yihao Zhi*, Wen Liu, Shenghua Gao

Accepted by ICCV 2021
[Project] [Paper] [Video] [Code]

Partially-Supervised Learning for Vessel Segmentation in Ocular Images
Yanyu Xu, Xinxing Xu, Lei Jin, Shenghua Gao, Rick Siow Mong Goh

Accepted by MICCAI 2021

Accurate depth estimation from a hybrid event-RGB stereo setup
Yi-Fan Zuo, Li Cui, Xin Peng, Yanyu Xu, Shenghua Gao, Xia Wang, Laurent Kneip

Accepted by IROS 2021

Look Before You Leap: Learning Landmark Features For One-Stage Visual Grounding
Binbin Huang, Dongze Lian, Weixin Luo, Shenghua Gao

Accepted by CVPR 2021
[Paper] [Code]

Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild
Zhaoyuan Yin, Jia Zheng, Weixin Luo, Shenhan Qian, Hanling Zhang, Shenghua Gao

Accepted by CVPR 2021
[Paper] [Code]

Layout-Guided Novel View Synthesis from a Single Indoor Panorama
Jiale Xu, Jia Zheng, Yanyu Xu, Rui Tang, Shenghua Gao

Accepted by CVPR 2021
[Paper] [Project]

Prior Based Human Completion
Zibo Zhao, Wen Liu, Yanyu Xu, Xianing Chen, Weixin Luo, Lei Jin, Bohui Zhu, Tong Liu, Binqiang Zhao, Shenghua Gao

Accepted by CVPR 2021
[Paper] [Code]

Appearance-Motion Memory Consistency Network for Video Anomaly Detection
Ruichu Cai, Hao Zhang, Wen Liu, Shenghua Gao, Zhifeng Hao

Accepted by AAAI 2021

KGDet: Keypoint-Guided Fashion Detection
Shenhan Qian*, Dongze Lian*, Binqiang Zhao, Tong Liu, Bohui Zhu, Hai Li, Shenghua Gao

Accepted by AAAI 2021
[Paper] [Code]

Amodal Segmentation Based on Visible Region Segmentation and Shape Prior
Yuting Xiao, Yanyu Xu, Ziming Zhong, Weixin Luo, Jiawei Li, Shenghua Gao

Accepted by AAAI 2021
[Paper] [Code]

SIRI: Spatial Relation Induced Network For Spatial Description Resolution
Peiyao Wang*, Weixin Luo*, Yanyu Xu, Haojie Li, Shugong Xu, Jianyu Yang, Shenghua Gao

Accepted by NeurIPS 2020
[Paper]

P2Net: Patch-match and Plane-regularization for Unsupervised Indoor Depth Estimation
Zehao Yu*, Lei Jin*, Shenghua Gao

Accepted by ECCV 2020
[Paper] [Code]

Encoding Structure-Texture Relation with P-Net for Anomaly Detection in Retinal Images
Kang Zhou*, Yuting Xiao*, Jianlong Yang, Jun Cheng, Wen Liu, Weixin Luo, Zaiwang Gu, Jiang Liu, Shenghua Gao.

Accepted by ECCV 2020
[Code] [Video]

Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling
Jia Zheng*, Junfei Zhang*, Jing Li, Rui Tang, Shenghua Gao, Zihan Zhou

Accepted by ECCV 2020
[Paper] [Supplemental Material] [Code] [Website]



Towards Fast Adaptation of Neural Architectures with Meta Learning
Dongze Lian, Yin Zheng, Yintao Xu, Yanxiong Lu, Leyu Lin, Peilin Zhao, Junzhou Huang, Shenghua Gao

Accepted by ICLR 2020
[Paper]



Fast-MVSNet: Sparse-to-Dense Multi-View Stereo With Learned Propagation and Gauss-Newton Refinement
Zehao Yu, Shenghua Gao

Accepted by CVPR 2020
[Paper] [Supplemental Material] [Code]

Geometric Structure Based and Regularized Depth Estimation From 360 Indoor Imagery
Lei Jin*, Yanyu Xu*, Jia Zheng, Junfei Zhang, Rui Tang, Shugong Xu, Jingyi Yu, Shenghua Gao

Accepted by CVPR 2020
[Paper] [Dataset]

Sparse-GAN: Sparsity-constrained Generative Adversarial Network for Retinal OCT Image Anomaly Detection
Kang Zhou, Shenghua Gao, Jun Cheng, Zaiwang Gu, Huazhu Fu, Zhi Tu, Jianlong Yang, Yitian Zhao, Jiang Liu

Accepted by ISBI 2020

Perceptual-assisted Adversarial Adaptation for Choroid Segmentation in Optical Coherence Tomography
Zhenjie Chai*, Kang Zhou*, Jianlong Yang, Yuhui Ma, Zhi Chen, Shenghua Gao, Jiang Liu

Accepted by ISBI 2020

SUNet: A Lesion Regularized Model for Simultaneous Diabetic Retinopathy and Diabetic Macular Edema Grading
Zhi Tu, Shenghua Gao, Kang Zhou, Xianing Chen, Huazhu Fu, Zaiwang Gu, Jun Cheng, Zehao Yu, Jiang Liu

Accepted by ISBI 2020

Open-Set OCT Image Recognition with Synthetic Learning
Yuting Xiao, Shenghua Gao, Zhenjie Chai, Kang Zhou, Tianyang Zhang, Yitian Zhao, Jun Cheng, Jiang Liu

Accepted by ISBI 2020

Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis
Wen Liu*, Zhixin Piao*, Jie Min, Wenhan Luo, Lin Ma, Shenghua Gao

Accepted by ICCV 2019
[Paper] [Supplemental Material] [Code] [Website]

Ki-GAN: Knowledge Infusion Generative Adversarial Network for Photoacoustic Image Reconstruction in vivo
Hengrong Lan*, Kang Zhou*, Changchun Yang, Jun Cheng, Jiang Liu, Shenghua Gao^, Fei Gao^

Accepted by MICCAI 2019

Learning Semantics-aware Distance Map with Semantics Layering Network for Amodal Instance Segmentation
Ziheng Zhang*, Anpei Chen*, Ling Xie, Jingyi Yu, Shenghua Gao

Accepted by ACM MM 2019
[Paper]

Margin Learning Embedded Prediction for Video Anomaly Detection with A Few Anomalies
Wen Liu*, Weixin Luo*, Zhengxin Li, Peilin Zhao, Shenghua Gao

Accepted by IJCAI 2019
[Paper]

FPN++: A SIMPLE BASELINE FOR PEDESTRIAN DETECTION
Junhao Hu*, Lei Jin*, Shenghua Gao

Accepted by ICME 2019
[Paper]

PPGNet: Learning Point-Pair Graph for Line Segment Detection
Ziheng Zhang*, Zhengxin Li*, Ning Bi, Jia Zheng, Jinlei Wang, Kun Huang, Weixin Luo, Yanyu Xu, Shenghua Gao

Accepted by CVPR 2019
[Paper] [Code]

Local to Global Learning: Gradually Adding Classes for Training Deep Neural Networks
Hao Cheng*, Dongze Lian*, Bowen Deng, Shenghua Gao, Tao Tan, Yanlin Geng

Accepted by CVPR 2019
[Paper] [Code]

Single-Image Piece-wise Planar 3D Reconstruction via Associative Embedding
Zehao Yu*, Jia Zheng*, Dongze Lian, Zihan Zhou, Shenghua Gao

Accepted by CVPR 2019
[Paper] [Code]

Density Map Regression Guided Detection Network for RGB-D Crowd Counting and Localization
Dongze Lian*, Jing Li*, Jia Zheng, Weixin Luo, Shenghua Gao

Accepted by CVPR 2019
[Paper] [Code]

RGBD Based Gaze Estimation via Multi-task CNN
Dongze Lian*, Ziheng Zhang*, Weixin Luo, Lina Hu, Minye Wu, Zechao Li, Jingyi Yu, Shenghua Gao

Accepted by AAAI 2019
[Paper] [Code]

Believe It or Not, We Know What You Are Looking at!
Dongze Lian*, Zehao Yu*, Shenghua Gao

Accepted as ACCV 2018 Oral Presentation (Accepted Rate: 4.6%)
[Paper] [Code]

Evaluating Capability of Deep Neural Networks for Image Classication via Information Plane
Hao Cheng, Dongze Lian, Shenghua Gao, Yanlin Geng

Accepted by ECCV 2018
[Paper] [Code]

Saliency Detection in 360 Videos
Ziheng Zhang *, Yanyu Xu * , Jingyi Yu, Shenghua Gao

Accepted by ECCV 2018
[Paper] [Code]

Future Frame Prediction for Anomaly Detection - A New Baseline
Wen Liu *, Weixin Luo *, Dongze Lian, Shenghua Gao

Accepted by CVPR 2018
[Paper] [Code]

Encoding Crowd Interaction with Deep Neural Network for Pedestrian Trajectory Prediction
Yanyu Xu *, Zhixin Piao *, Shenghua Gao

Accepted by CVPR 2018
[Paper] [Code]


Gaze Prediction in Dynamic 360 Immersive Videos
Yanyu Xu, Yanbing Dong, Junru Wu, Zhengzhong Sun, Zhiru Shi, Jingyi Yu, Shenghua Gao

Accepted by CVPR 2018
[Paper] [Code]


Face Aging With Identity-Preserved Conditional Generative Adversarial Networks
Zongwei Wang, Xu Tang, Weixin Luo, Shenghua Gao

Accepted by CVPR 2018
[Paper] [Code]


Learning to Parse Wireframes in Images of Man-Made Environments
Kun Huang, Yifan Wang, Zihan Zhou, Tianjiao Ding, Shenghua Gao, Yi Ma

Accepted by CVPR 2018
[Paper] [Code]


A Revisit of Sparse Coding Based Anomaly Detection in Stacked RNN Framework
Weixin Luo *, Wen Liu *, Shenghua Gao

Accepted by ICCV 2017
[Paper] [Code]


Remembering History with Convolutional LSTM for Anomaly Detection
Weixin Luo *, Wen Liu *, Shenghua Gao

Accepted by ICME 2017 (Oral, 15% accept rate)
[Paper] [Code]


Beyond Universal Saliency: Personalized Saliency Prediction with Multi-task CNN
Yanyu Xu, Nianyi Li, Junru Wu, Jingyi Yu, Shenghua Gao

Accepted by IJCAI 2017 (Best Student Paper Run-upper)
[Paper]


Bi-Level Multi-Column Convolutional Neural Networks for Facial Landmark Point Detection
Yanyu Xu, Shenghua Gao

Accepted by ECCV 2016 Workshop

Single-Image Crowd Counting via Multi-Column Convolutional Neural Network
Yingying Zhang, Desen Zhou, Siqin Chen, Shenghua Gao, Yi Ma

Accepted by CVPR 2016
[Paper]


Selected Journal Papers


Feature Re-Representation and Reliable Pseudo Label Retraining for Cross-Domain Semantic Segmentation
Jing Li, Kang Zhou, Shenhan Qian, Wen Li, Lixin Duan, Shenghua Gao

Accepted by TPAMI 2022


Locating and Counting Heads in Crowds With a Depth Prior
Dongze Lian, Xianing Chen, Jing Li, Weixin Luo, Shenghua Gao

Accepted by TPAMI 2022


Future Frame Prediction Network for Video Anomaly Detection
Weixin Luo, Wen Liu, Dongze Lian, Shenghua Gao

Accepted by TPAMI 2022


Proxy-bridged Image Reconstruction Network for Anomaly Detection in Medical Images
Kang Zhou, Jing Li, Weixin Luo, Zhengxin Li, Jianlong Yang, Huazhu Fu, Jun Cheng, Jiang Liu, Shenghua Gao

Accepted by TMI 2022


Memorizing Structure-Texture Correspondence for Image Anomaly Detection
Kang Zhou *, Jing Li *, Yuting Xiao, Jianlong Yang, Jun Cheng, Wen Liu, Weixin Luo, Jiang Liu, Shenghua Gao

Accepted by TNNLS 2021


Spherical DNNs and Their Applications in 360 Images and Videos
Yanyu Xu *, Ziheng Zhang *, Shenghua Gao

Accepted by TPAMI 2021


Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis
Wen Liu *, Zhixin Piao *, Zhi Tu, Wenhan Luo, Lin Ma, Shenghua Gao

Accepted by TPAMI 2021

Video Anomaly Detection with Sparse Coding Inspired Deep Neural Networks
Weixin Luo *, Wen Liu *, Shenghua Gao, Dongze Lian

Accepted by TPAMI 2019

Multi-view Multi-task Gaze Prediction with Deep Convolutional Neural Networks
Dongze Lian, Shenghua Gao, Lina Hu, Weixin Luo, Yanyu Xu, Lixin Duan, Jingyi Yu

Accepted by TNNLS 2018

Multi-column CNN and its Applications for Crowd Counting and Face Alignment
Yanyu Xu, Shenghua Gao, Yingying Zhang, Yi Ma

Submitted to IJCV 2018 (under review)

Personalized Saliency and its Prediction
Yanyu Xu, Junru Wu, Nianyi Li, Shenghua Gao, Jingyi Yu

Accepted by TPAMI 2018