Title | Authors |
Pre-training without Natural Images | Hirokatsu Kataoka (National Institute of Advanced Industrial Science and Technology (AIST))*; Kazushige Okayasu (National Institute of Advanced Industrial Science and Technology (AIST)); Asato Matsumoto (National Institute of Advanced Industrial Science and Technology (AIST)); Eisuke Yamagata (Tokyo Institute of Technology); Ryosuke Yamada (Tokyo Denki University); Nakamasa Inoue (Tokyo Institute of Technology); Akio Nakamura (Tokyo Denki University (TDU)); Yutaka Satoh (National Institute of Advanced Industrial Science and Technology (AIST)) |
In-sample Contrastive Learning and Consistent Attention for Weakly Supervised Object Localization | Minsong Ki (Yonsei University)*; Youngjung Uh (Yonsei University); Wonyoung Lee (Yonsei University); Hyeran Byun (Yonsei University) |
Backbone Based Feature Enhancement for Object Detection | Haoqin Ji (Shenzhen University); Weizeng Lu (Shenzhen University); Linlin Shen (Shenzhen University)* |
Part-aware Attention Network for Person Re-Identification | Wangmeng Xiang (The Hong Kong Polytechnic University); Jianqiang Huang (Damo Academy, Alibaba Group); Xian-Sheng Hua (Alibaba Group); Lei Zhang (“Hong Kong Polytechnic University, Hong Kong, China”)* |
RF-GAN: A Light and Reconfigurable Network for Unpaired Image-to-Image Translation | Ali Koksal (Nanyang Technological University); Shijian Lu (Nanyang Technological University)* |
SpotPatch: Parameter-Efficient Transfer Learning for Mobile Object Detection | Keren Ye (University of Pittsburgh)*; Adriana Kovashka (University of Pittsburgh); Mark Sandler (Google); Menglong Zhu (UPenn); Andrew Howard (Google); Marco Fornoni (Google) |
Sketch-to-Art: Synthesizing Stylized Art Images From Sketches | Bingchen Liu (Rutgers, The State University of New Jersey)*; Kunpeng Song (Rutgers University); Yizhe Zhu (Rutgers University ); Ahmed Elgammal (-) |
An Efficient Group Feature Fusion Residual Network for Image Super-Resolution | Pengcheng Lei (University of Shanghai for Science and Technology); Cong Liu (University of Shanghai for Science and Technology)* |
3D Human Motion Estimation via Motion Compression and Refinement | Zhengyi Luo (Carnegie Mellon University)*; S. Alireza Golestaneh (Carnegie Mellon University); Kris M. Kitani (Carnegie Mellon University) |
Dense Pixel-wise Micro-motion Estimation of Object Surface by using Low Dimensional Embedding of Laser Speckle Pattern | Ryusuke Sagawa (“AIST, Japan”)*; Yusuke Higuchi (Kyushu University); Hiroshi Kawasaki (Kyushu univ.); Ryo Furukawa (Hiroshima city univ.); Takahiro Ito (AIST) |
Bidirectional Pyramid Networks for Semantic Segmentation | Dong Nie (UNC)*; Jia Xue (Rutgers University); Xiaofeng Ren (Alibaba group) |
Self-supervised Learning of Orc-Bert Augmentator for Recognizing Few-Shot Oracle Characters | Wenhui Han (Fudan University); Xinlin Ren (Fudan University); Hangyu Lin (Fudan University); Yanwei Fu (Fudan University)*; Xiangyang Xue (Fudan University) |
Title | Authors |
RealSmileNet: A Deep End-To-End Network for Spontaneous and Posed Smile Recognition | Yan Yang (Australian National University)*; Md Zakir Hossain (The Australian National University ); Tom Gedeon (The Australian National University); Shafin Rahman (North South University) |
IAFA: Instance-Aware Feature Aggregation for 3D Object Detection from a Single Image | Dingfu Zhou (Baidu)*; Xibin Song (Baidu); Yuchao Dai (Northwestern Polytechnical University); Junbo Yin (Beijing Institute of Technology); Feixiang Lu (Baidu); Miao Liao (Baidu); Jin Fang (Baidu ); Liangjun Zhang (Baidu) |
Synthetic-to-Real Unsupervised Domain Adaptation for Scene Text Detection in the Wild | weijia wu (Zhejiang University)*; Ning Lu (Tencent Cloud Product Department); Enze Xie (The University of Hong Kong); Yuxing Wang (Zhejiang University); Wenwen Yu (Xuzhou Medical University); Cheng Yang (Zhejiang University); HONG ZHOU (Zhejiang University) |
Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses | Miao Liao (Baidu)*; Sibo Zhang (Baidu); Peng Wang (Baidu USA LLC.); Hao Zhu (Nanjing University); Xinxin Zuo (University of Kentucky); Ruigang Yang (University of Kentucky, USA) |
DeepVoxels++: Enhancing the Fidelity of Novel View Synthesis from 3D Voxel Embeddings | Tong He (UCLA)*; John Collomosse (Adobe Research); Hailin Jin (Adobe Research); Stefano Soatto (UCLA) |
MLIFeat: Multi-level information fusion based deep local features | Yuyang Zhang (Institute of Automation, Chinese Academy of Sciences; University of Chinese Academy of Sciences); Jinge Wang (Megvii); Shibiao Xu (Institute of Automation, Chinese Academy of Sciences)*; Xiao Liu (Megvii Inc); Xiaopeng Zhang (Institute of Automation, Chinese Academy of Sciences) |
Quantum Robust Fitting | Tat-Jun Chin (University of Adelaide); David Suter (Edith Cowan University); Shin-Fang Ch’ng (The University of Adelaide)*; James Quach (The University of Adelaide) |
Do We Need Sound for Sound Source Localization? | Takashi Oya (Waseda University)*; Shohei Iwase (Waseda University ); Ryota Natsume (Waseda University); Takahiro Itazuri (Waseda University); Shugo Yamaguchi (Waseda University); Shigeo Morishima (Waseda Research Institute for Science and Engineering) |
Second Order enhanced Multi-glimpse Attention in Visual Question Answering | Qiang Sun (Fudan University)*; Binghui Xie (Fudan University); Yanwei Fu (Fudan University) |
Restoring Spatially-Heterogeneous Distortions using Mixture of Experts Network | Sijin Kim (Ajou University); Namhyuk Ahn (Ajou University); Kyung-Ah Sohn (Ajou University)* |
Augmentation Network for Generalised Zero-Shot Learning | RAFAEL FELIX (The University of Adelaide)*; Michele Sasdelli (The University of Adelaide); Ian Reid (“University of Adelaide, Australia”); Gustavo Carneiro (University of Adelaide) |
Single-Image Camera Response Function Using Prediction Consistency and Gradual Refinement | Aashish Sharma (National University of Singapore)*; Robby T. Tan (Yale-NUS College); Loong-Fah Cheong (NUS) |
Channel Recurrent Attention Networks for Video Pedestrian Retrieval | Pengfei Fang (The Australian National University)*; Pan Ji (OPPO US Research Center); Jieming Zhou (The Australian National University); Lars Petersson (Data61/CSIRO); Mehrtash Harandi (Monash University) |
Scale-Aware Polar Representation for Arbitrarily-Shaped Text Detection | Yanguang Bi (SenseTime Research); Zhiqiang Hu (SenseTime Research)* |
Background Learnable Cascade for Zero-Shot Object Detection | Ye Zheng (Institute of Computing Technology, Chinese Academy of Sciences)*; Ruoran Huang (Institute of Computing Technology, Chinese Academy of Sciences); Chuanqi Han (Institute of Computing Technology, Chinese Academy of Sciences); Xi Huang (Institute of computing technology of the Chinese Academy of Sciences); Li Cui ( Institute of computing technology of the Chinese Academy of Sciences) |
Feature Variance Ratio-Guided Channel Pruning for Deep Convolutional Network Acceleration | Junjie He (Zhejiang University)*; Bohua Chen (Zhejiang University); Yinzhang Ding (Zhejiang University); Dongxiao Li (Zhejiang University) |
Faster Self-adaptive Deep Stereo | Haiyang Wang (Zhejiang University)*; Xinchao Wang (Stevens Institute of Technology); Jie Song (Zhejiang University); Jie Lei (Zhejiang University); Mingli Song (Zhejiang University) |
Color Enhancement using Global Parameters and Local Features Learning | Enyu Liu (Tencent)*; Songnan Li (Tencent); Shan Liu (Tencent America) |
Weakly-supervised Reconstruction of 3D Objects with Large Shape Variation from Single In-the-Wild Images | Shichen Sun (Sichuan University); Zhengbang Zhu (Sichuan University); Xiaowei Dai (Sichuan University); Qijun Zhao (Sichuan University)*; Jing Li (Sichuan University) |
A Day on Campus – An Anomaly Detection Dataset for Events in a Single Camera | Mantini Pranav (University of Houston)*; Li Zhenggang (University of Houston); Shah Shishir K (University of Houston) |
OpenGAN: Open Set Generative Adversarial Networks | Luke Ditria (Monash University); Benjamin J. Meyer (Monash University)*; Tom Drummond (Monash University) |
HPGCNN: Hierarchical Parallel Group Convolutional Neural Networks for Point Clouds Processing | Jisheng Dang (Lanzhou Jiaotong University); Jun Yang (School of Electronic and Information Engineering, Lanzhou Jiaotong University)* |
Graph-based Heuristic Search for Module Selection Procedure in Neural Module Network | Yuxuan Wu (The University of Tokyo)*; Hideki Nakayama (The University of Tokyo) |
Hierarchical X-Ray Report Generation via Pathology tags and Multi Head Attention | Preethi Srinivasan (IIT Mandi); Daksh Thapar (Indian Institute of Technology, Mandi)*; Arnav Bhavsar (IIT Mandi); Aditya Nigam (IIT mandi) |
Uncertainty Estimation and Sample Selection for Crowd Counting | Viresh Ranjan (Stony Brook University)*; Boyu Wang (Stony Brook University); Mubarak Shah (University of Central Florida); Minh Hoai (Stony Brook University) |
Visualizing Color-wise Saliency of Black-Box Image Classification Models | Yuhki Hatakeyama (SenseTime Japan)*; Hiroki Sakuma (SenseTime Japan); Yoshinori Konishi (SenseTime Japan); Kohei Suenaga (Kyoto University) |
CS-MCNet:A Video Compressive Sensing Reconstruction Network with Interpretable Motion Compensation | Bowen Huang (Fudan University)*; Jinjia Zhou (Hosei University); Xiao Yan (Fudan University); Ming’e Jing (Fudan University); Rentao Wan (Fudan University); Yibo Fan (Fudan University) |
Attention-Aware Feature Aggregation for Real-time Stereo Matching on Edge Devices | Jia-Ren Chang (National Chiao Tung University; aetherAI); Pei-Chun Chang (National Chiao Tung University); Yong-Sheng Chen (National Chiao Tung University)* |
MCGKT-Net: Multi-level Context Gating Knowledge Transfer Network for Single Image Deraining | Kohei Yamamichi (Yamaguchi University)*; Xian-Hua Han (Yamaguchi University) |
Lightweight Single-Image Super-Resolution Network with Attentive Auxiliary Feature Learning | Xuehui Wang (School of Data and Computer Science, Sun Yat-sen University); qing wang (School of Data and Computer Science, Sun Yat-sen University); Yuzhi Zhao (City University of Hong Kong); Junchi Yan (Shanghai Jiao Tong University); Lei Fan (Northwestern University); long chen (School of Data and Computer Science, Sun Yat-sen University)* |
Mask-Ranking Network for Semi-Supervised Video Object Segmentation | Wenjing Li (University of Electronic Science & Technology of China)*; Xiang Zhang (University of Electronic Science & Technology of China); Yujie Hu (University of Electronic Science & Technology of China); Yingqi Tang (University of Electronic Science & Technology of China) |
Few-Shot Object Detection by Second-order Pooling | Shan Zhang (ANU, Beijing Union University)*; Dawei Luo (Beijing Key Laboratory of Information Service Engineering, Beijing Union University ); Lei Wang (“University of Wollongong, Australia”); Piotr Koniusz (Data61/CSIRO, ANU) |
RE-Net: A Relation Embedded Deep Model for AU Occurrence and Intensity Estimation | Huiyuan Yang (Binghamton University-SUNY)*; Lijun Yin (State University of New York at Binghamton) |
COMET: Context-Aware IoU-Guided Network for Small Object Tracking | Seyed Mojtaba Marvasti-Zadeh (University of Alberta)*; Javad Khaghani (University of Alberta); Hossein Ghanei-Yakhdan (Yazd University); Shohreh Kasaei (Sharif University of Technology); Li Cheng (ECE dept., University of Alberta) |
Title | Authors |
Meta-Learning with Context-Agnostic Initialisations | Toby Perrett (University of Bristol)*; Alessandro Masullo (University of Bristol); Tilo Burghardt (University of Bristol); Majid Mirmehdi (University of Bristol); Dima Damen (University of Bristol) |
D2D: Keypoint Extraction with Describe to Detect Approach | Yurun Tian (Imperial College London)*; Vassileios Balntas (Scape Technologies); Tony Ng (Imperial College London); Axel Barroso-Laguna (Imperial College London); Yiannis Demiris (Imperial College London); Krystian Mikolajczyk (Imperial College London) |
Visually Guided Sound Source Separation using Cascaded Opponent Filter Network | Lingyu Zhu (Tampere University)*; Esa Rahtu (Tampere University) |
Efficient Large-Scale Semantic Visual Localization in 2D Maps | Tomas Vojir (CMP CTU)*; Ignas Budvytis (Department of Engineering, University of Cambridge); Roberto Cipolla (University of Cambridge) |
Encode the Unseen: Predictive Video Hashing for Scalable Mid-Stream Retrieval | Tong Yu (University of Strasbourg)*; Nicolas Padoy (University of Strasbourg) |
Self-Supervised Multi-View Synchronization Learning for 3D Pose Estimation | Simon Jenni (Universität Bern)*; Paolo Favaro (University of Bern) |
Raw-Guided Enhancing Reprocess of Low-Light Image via Deep Exposure Adjustment | Haofeng Huang (Peking University)*; Wenhan Yang (Peking University); Yueyu Hu (Peking University); Jiaying Liu (Peking University) |
Generic Image Segmentation in Fully Convolutional Networks by Superpixel Merging Map | Jin-Yu Huang (National Taiwan University); Jian-Jiun Ding (National Taiwan University)* |
Webly Supervised Semantic Embeddings for Large Scale Zero-Shot Learning | Yannick Le Cacheux (CEA LIST)*; Adrian Popescu (CEA LIST); Herve Le Borgne (CEA LIST) |
Sketch-to-Art: Synthesizing Stylized Art Images From Sketches | Bingchen Liu (Rutgers, The State University of New Jersey)*; Kunpeng Song (Rutgers University); Yizhe Zhu (Rutgers University ); Ahmed Elgammal (-) |
Title | Authors |
Transforming Multi-Concept Attention into Video Summarization | Yen-Ting Liu (National Taiwan University)*; Yu-Jhe Li (Carnegie Mellon University); Yu-Chiang Frank Wang (National Taiwan University) |
Exploiting Transferable Knowledge for Fairness-aware Image Classification | sunhee hwang (Yonsei university)*; Sungho Park (Yonsei University); Pilhyeon Lee (Yonsei University); seogkyu jeon (Yonsei university); Dohyung Kim (Yonsei University); Hyeran Byun (Yonsei University) |
CPTNet: Cascade Pose Transform Network for Single Image Talking Head Animation | Jiale Zhang (Huazhong University of Science and Technology); Ke Xian (Huazhong University of Science and Technology); Chengxin Liu (Huazhong University of Science and Technology)*; Yinpeng Chen (Huazhong University of Science and Technology); Zhiguo Cao (Huazhong Univ. of Sci.&Tech.); Weicai Zhong (Huawei CBG Consumer Cloud Service Big Data Platform Dept.) |
Modular Graph Attention Network for Complex Visual Relational Reasoning | Yihan Zheng (South China University of Technology); Zhiquan Wen (South China University of Technology); Mingkui Tan (South China University of Technology)*; Runhao Zeng (South China University of Technology); Qi Chen (South China University of Technology); Yaowei Wang (PengCheng Laboratory); Qi Wu (University of Adelaide) |
Modeling Cross-Modal interaction in a Multi-detector, Multi-modal Tracking Framework | Yiqi Zhong (University of Southern California)*; Suya You (US Army Research Laboratory); Ulrich Neumann (USC) |
HDD-Net: Hybrid Detector Descriptor with Mutual Interactive Learning | Axel Barroso-Laguna (Imperial College London)*; Yannick Verdie (Huawei Noah’s Ark Lab); Benjamin Busam (Technical University of Munich); Krystian Mikolajczyk (Imperial College London) |
Betrayed by Motion: Camouflaged Object Discovery via Motion Segmentation | Hala Lamdouar (University of Oxford)*; Charig Yang (University of Oxford); Weidi Xie (University of Oxford); Andrew Zisserman (University of Oxford) |
Sequential View Synthesis with Transformer | Phong Nguyen-Ha (University of Oulu)*; Lam Huynh ( University of Oulu); Esa Rahtu (Tampere University); Janne Heikkila (University of Oulu, Finland) |
Novel-View Human Action Synthesis | Mohamed Ilyes Lakhal (Queen Mary University of London)*; Davide Boscaini (Fondazione Bruno Kessler); Fabio Poiesi (Fondazione Bruno Kessler); Oswald Lanz (Fondazione Bruno Kessler, Italy); Andrea Cavallaro (Queen Mary University of London, UK) |
Leveraging Tacit Information Embedded in CNN Layers for Visual Tracking | Kourosh Meshgi (RIKEN AIP)*; Maryam Sadat Mirzaei (Riken AIP / Kyoto University); Shigeyuki Oba (Kyoto University) |
Rotation Axis Focused Attention Network (RAFA-Net) for Estimating Head Pose | Ardhendu Behera (Edge Hill University)*; Zachary Wharton (Edge Hill University); Pradeep Hewage (Edge Hill University); Swagat Kumar (Edge Hill University) |
SGNet: Semantics Guided Deep Stereo Matching | Shuya Chen (Zhejiang University); Zhiyu Xiang (Zhejiang University)*; Chengyu Qiao (Zhejiang University); Yiman Chen (Zhejiang University); Tingming Bai (Zhejiang University) |
Reconstructing Human Body Mesh from Point Clouds by Adversarial GP Network | Boyao Zhou (Inria)*; Jean-Sebastien Franco (INRIA); Federica Bogo (Microsoft); Bugra Tekin (Microsoft); Edmond Boyer (Inria) |
Semi-supervised Facial Action Unit Intensity Estimation with Contrastive Learning | Enrique Sanchez (Samsung AI Centre)*; Adrian Bulat (Samsung AI Center, Cambridge); Anestis Zaganidis (Samsung); Georgios Tzimiropoulos (Queen Mary University of London) |
Bridging Adversarial and Statistical Domain Transfer via Spectral Adaptation Networks | Christoph Raab (FHWS)*; Philipp Väth (FHWS); Peter Meier (FHWS); Frank-Michael Schleif (FHWS) |
Active Learning for Video Description With Cluster-Regularized Ensemble Ranking | David M. Chan (University of California, Berkeley)*; Sudheendra Vijayanarasimhan (Google research); David A. Ross (Google); John F. Canny (UC Berkeley) |
Video-Based Crowd Counting Using a Multi-Scale Optical Flow Pyramid Network | Mohammad Asiful Hossain (HUAWEI Technologies Co, LTD.)*; Kevin Cannons (Huawei Technologies Canada Co., Ltd ); Daesik Jang (Personal Research); Fabio Cuzzolin (Oxford Brookes University); Zhan Xu (Huawei Canada) |
Few-Shot Zero-Shot Learning: Knowledge Transfer with Less Supervision | Nanyi Fei (Renmin University of China); Jiechao Guan (Renmin University of China); Zhiwu Lu (Renmin University of China)*; Yizhao Gao (Renmin University of China) |
Synthetic-to-real domain adaptation for lane detection | Noa Garnett (GM); Roy Uziel (Ben-Gurion University); Netalee Efrat (General Motors); Dan Levi (General Motors)* |
Localin Reshuffle Net: Toward Naturally and Efficiently Facial Image Blending | Chengyao Zheng (Southeast Univeristy); Siyu Xia (Southeast University, China); Joseph Robinson (Northeastern University)*; Changsheng Lu (Shanghai Jiao Tong University); Wayne Wu (Tsinghua University); Chen Qian (SenseTime); Ming Shao (University of Massachusetts Dartmouth) |
RGB-D Co-attention Network for Semantic Segmentation | Hao Zhou (Harbin Engineering University)*; Lu Qi (The Chinese University of Hong Kong); Zhaoliang Wan (Harbin Engineering University); Hai Huang (Harbin Engineering University); Xu Yang (Chinese Academy of Sciences) |
Dynamic Depth Fusion and Transformation for Monocular 3D Object Detection | Erli Ouyang (Fudan University)*; Li Zhang (University of Oxford); Mohan Chen (Fudan University); Anurag Arnab (University of Oxford); Yanwei Fu (Fudan University) |
Learning Multi-Instance Sub-pixel Point Localization | Julien Schroeter (Cardiff University)*; Tinne Tuytelaars (KU Leuven); Kirill Sidorov (Cardiff University); David Marshall (Cardiff University) |
OpenTraj: Assessing Prediction Complexity in Human Trajectories Datasets | Javad Amirian (Inria, Rennes, France)*; Bingqing Zhang (UCL); Francisco Valente Castro (Cimat); Juan Jose Baldelomar (Cimat); Jean-Bernard Hayet (CIMAT); Julien Pettré (INRIA Rennes – Bretagne Atlantique) |
Compact and Fast Underwater Segmentation Network for Autonomous Underwater Vehicles | Jiangtao Wang (Loughborough University); Baihua Li (Loughborough University)*; Yang Zhou (Loughborough University); Emanuele Rocco (Witted Srl); Qinggang Meng (Computer Science Department Loughborough University) |
RGB-T Crowd Counting from Drone: A Benchmark and MMCCN Network | Tao Peng (Tianjin University); Qing Li (Northwestern Polytechnical University); Pengfei Zhu (Tianjin university)* |
Adversarially Robust Deep Image Super-Resolution using Entropy Regularization | Jun-Ho Choi (Yonsei University); Huan Zhang (UCLA); Jun-Hyuk Kim (Yonsei University); Cho-Jui Hsieh (UCLA); Jong-Seok Lee (“Yonsei University, Korea”)* |
MMD based Discriminative Learning for Face Forgery Detection | Jian Han (University of Amsterdam)*; Theo Gevers (University of Amsterdam) |
Rotation Equivariant Orientation Estimation for Omnidirectional Localization | Chao Zhang (Toshiba Europe Limited)*; Ignas Budvytis (Department of Engineering, University of Cambridge); Stephan Liwicki (Toshiba Europe Limited); Roberto Cipolla (University of Cambridge) |
Dense-Scale Feature Learning in Person Re-Identification | Li Wang (Inspur); Baoyu Fan (Inspur Electronic Information Industry Co.,Ltd.)*; Zhenhua Guo (Inspur Electronic Information Industry Co.,Ltd.); Yaqian Zhao (Inspur); Runze Zhang (Inspur Electronic Information Industry Co.,Ltd.); Rengang Li (Inspur); Weifeng Gong ( Inspur Electronic Information Industry Co.,Ltd.) |
Feedback Recurrent Autoencoder for Video Compression | Adam Golinski (University of Oxford)*; Reza Pourreza (Qualcomm); Yang Yang (Qualcomm Inc.); Guillaume Sautiere (Qualcomm AI Research); Taco S. Cohen (Qualcomm) |
Discovering Multi-Label Actor-Action Association in a Weakly Supervised Setting | Sovan Biswas (University of Bonn)*; Juergen Gall (University of Bonn) |
MLIFeat: Multi-level information fusion based deep local features | Yuyang Zhang (Institute of Automation, Chinese Academy of Sciences; University of Chinese Academy of Sciences); Jinge Wang (Megvii); Shibiao Xu (Institute of Automation, Chinese Academy of Sciences)*; Xiao Liu (Megvii Inc); Xiaopeng Zhang (Institute of Automation, Chinese Academy of Sciences) |
MCGKT-Net: Multi-level Context Gating Knowledge Transfer Network for Single Image Deraining | Kohei Yamamichi (Yamaguchi University)*; Xian-Hua Han (Yamaguchi University) |
Title | Authors |
Fast and Differentiable Message Passing on Pairwise Markov Random Fields | Zhiwei Xu (Australian National University)*; Thalaiyasingam Ajanthan (ANU); RICHARD HARTLEY (Australian National University, Australia) |
Accurate and Efficient Single Image Super-Resolution with Matrix Channel Attention Network | Hailong Ma (Xiaomi); Xiangxiang Chu (Xiaomi); Bo Zhang (Xiaomi)* |
Domain Adaptation Gaze Estimation by Embedding with Prediction Consistency | Zidong Guo (Xi’an Jiaotong university)*; Zejian Yuan (Xi‘an Jiaotong University); Chong Zhang (Tencent Robotics X); Wanchao Chi (Tencent Robotics X); Yonggen Ling (Tencent); shenghao zhang (Tencent) |
DoFNet: Depth of Field Difference Learning for Detecting Image Forgery | Yonghyun Jeong (Samsung SDS)*; Jongwon Choi (Chung-Ang University); Doyeon Kim (SamsungSDS); Sehyeon Park (Samsung SDS); Minki Hong (Samsung SDS); Changhyun Park (Samsung SDS); Seungjai Min (Samsung SDS); Youngjune Gwon (Samsung SDS) |
Adversarial Refinement Network for Human Motion Prediction | Xianjin Chao (The City University of Hong Kong)*; Yanrui Bin (HUST); Wenqing Chu (Tencent); Xuan Cao (Tencent); Yanhao Ge (Tencent); Chengjie Wang (Tencent); Jilin Li (Tencent); Feiyue Huang (Tencent); Howard Leung (City University of Hong Kong) |
Sparse Convolutions on Continuous Domains for Point Cloud and Event Stream Networks | Dominic Jack (Queensland University of Technology)*; Frederic Maire (Queensland University of Technology); SIMON DENMAN (Queensland University of Technology, Australia); Anders Eriksson (University of Queensland ) |
In-sample Contrastive Learning and Consistent Attention for Weakly Supervised Object Localization | Minsong Ki (Yonsei University)*; Youngjung Uh (Yonsei University); Wonyoung Lee (Yonsei University); Hyeran Byun (Yonsei University) |
RF-GAN: A Light and Reconfigurable Network for Unpaired Image-to-Image Translation | Ali Koksal (Nanyang Technological University); Shijian Lu (Nanyang Technological University)* |
Dense Pixel-wise Micro-motion Estimation of Object Surface by using Low Dimensional Embedding of Laser Speckle Pattern | Ryusuke Sagawa (“AIST, Japan”)*; Yusuke Higuchi (Kyushu University); Hiroshi Kawasaki (Kyushu univ.); Ryo Furukawa (Hiroshima city univ.); Takahiro Ito (AIST) |
Title | Authors |
Anatomy and Geometry Constrained One-Stage Framework for 3D Human Pose Estimation | Xin Cao (Shanghai JiaoTong University); Xu Zhao (Shanghai Jiao Tong University)* |
Imbalance Robust Softmax for Deep Embedding Learning | Hao Zhu (Australian National University)*; Yang Yuan (AnyVision); Guosheng Hu (AnyVision); Xiang Wu (Reconova); Neil Robertson (Queen’s University Belfast) |
Frequency Attention Network: Blind Noise Removal for Real Images | Hongcheng Mo (Shanghai Jiao Tong University); Jianfei Jiang (Shanghai Jiao Tong University); Qin Wang (Shanghai Jiao Tong University)*; Dong Yin (Fullhan); Pengyu Dong (Fullhan); Jingjun Tian (Fullhan) |
Learning End-to-End Action Interaction by Paired-Embedding Data Augmentation | Ziyang Song (Institute of Artificial Intelligence and Robotics, Xi’an Jiaotong University.)*; Zejian Yuan (Xi‘an Jiaotong University); Chong Zhang (Tencent Robotics X); Wanchao Chi (Tencent Robotics X); Yonggen Ling (Tencent); Shenghao Zhang (Tencent) |
Horizontal Flipping Assisted Disentangled Feature Learning for Semi-Supervised Person Re-Identification | Gehan Hao ( University of Electronic Science and Technology of China); Yang Yang (Institute of Automation, Chinese Academy of Sciences); Xue Zhou (University of Electronic Science and Technology of China)*; Guanan Wang (CASIA); Zhen Lei (NLPR, CASIA, China) |
Dense Dual-Path Network for Real-time Semantic Segmentation | Xinneng Yang (Tongji University)*; Yan Wu (Tongji University); Junqiao Zhao (Tongji University); Feilin Liu (Tongji University) |
Channel Pruning for Accelerating Convolutional Neural Networks via Wasserstein Metric | Haoran Duan (University of Science and Technology of China (USTC))*; Hui Li (University of Science and Technology of China (USTC)) |
Human Motion Deblurring using Localized Body Prior | Jonathan Samuel Lumentut (Inha University); Joshua Santoso (Inha University); In Kyu Park (Inha University)* |
Compensating for the Lack of Extra Training Data by Learning Extra Representation | Hyeonseong Jeon (Sungkyunkwan University)*; Siho Han (Sungkyunkwan University); Sangwon Lee (SKKU); Simon S. Woo (SKKU) |
Second-order Camera-aware Color Transformation for Cross-domain Person Re-identification | Wangmeng Xiang (The Hong Kong Polytechnic University); Hongwei Yong (The Hong Kong Polytechnic University); Jianqiang Huang (Damo Academy, Alibaba Group); Xian-Sheng Hua (Alibaba Group); Lei Zhang (“Hong Kong Polytechnic University, Hong Kong, China”)* |
DEAL: Difficulty-aware Active Learning for Semantic Segmentation | Shuai Xie (Zhejiang University); Zunlei Feng (Zhejiang University); Ying chen (Zhejiang University); Songtao Sun (Zhejiang University); Chao Ma (Zhejiang University); Mingli Song (Zhejiang University)* |
Gaussian Vector: An Efficient Solution for Facial Landmark Detection | Yilin Xiong (Central South University)*; Zijian Zhou (Horizon); yuhao dou (Horizon); ZHIZHONG SU (Horizon Robotics) |
Homography-based Egomotion Estimation Using Gravity and SIFT Features | Yaqing Ding (Nanjing University of Science and Technology)*; Daniel Barath (MTA SZTAKI, CMP Prague); Zuzana Kukelova (Czech Technical University in Prague) |
COG: COnsistent data auGmentation for object perception | Zewen He (Casia)*; Rui Wu (Horizon Robotics); Dingqian Zhang (Horizon Robotics) |
Branch Interaction Network for Person Re-identification | Zengming Tang (Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, China)*; Jun Huang (Shanghai Advanced Research Institute, Chinese Academy of Sciences) |
Over-exposure Correction via Exposure and Scene Information Disentanglement | Yuhui Cao (SECE, Shenzhen Graduate School, Peking University)*; Yurui Ren (Shenzhen Graduate School, Peking University); Thomas H. Li (Advanced Institute of Information Technology, Peking University); Ge Li (SECE, Shenzhen Graduate School, Peking University) |
Synthesizing the Unseen for Zero-shot Object Detection | Nasir Hayat (IIAI); Munawar Hayat (IIAI)*; Shafin Rahman (North South University); Salman Khan (Australian National University (ANU)); Syed Waqas Zamir (IIAI); Fahad Shahbaz Khan (Inception Institute of Artificial Intelligence) |
SDP-Net: Scene Flow Based Real-time Object Detection and Prediction from Sequential 3D Point Clouds | Yi Zhang (Zhejiang University); Yuwen Ye (Zhejiang University); Zhiyu Xiang (Zhejiang University)*; Jiaqi Gu (Zhejiang University) |
Jointly Discriminating and Frequent Visual Representation Mining | Qiannan Wang (Xidian university); Ying Zhou (Xidian University); ZhaoYan Zhu (Xidian university); Xuefeng Liang (Xidian University)*; Yu Gu (School of Artificial Intelligence, Xi’dian University) |
Show, Conceive and Tell: Image Captioning with Prospective Linguistic Information | Yiqing Huang (Tsinghua University); Jiansheng Chen (Tsinghua University)* |
SAUM: Symmetry-Aware Upsampling Module for Consistent Point Cloud Completion | Hyeontae Son (Seoul National University)*; Young Min Kim (Seoul National University) |
Online Knowledge Distillation via Multi-branch Diversity Enhancement | Zheng Li (Institute of Virtual Reality and Intelligent System, Hangzhou Normal University)*; YING HUANG (Hangzhou Normal University); Defang Chen (Zhejiang University); Tianren Luo (Institute of Virtual Reality and Intelligent System,Hangzhou Normal University); Ning Cai (Institute of Virtual Reality and Intelligent System,Hangzhou Normal University); Zhigeng Pan (Institute of Virtual Reality and Intelligent System,Hangzhou Normal University) |
3D Object Detection from Consecutive Monocular Images | Chia-Chun Cheng (National Tsing Hua University)*; Shang-Hong Lai (Microsoft) |
MBNet: A Multi-Task Deep Neural Network for Semantic Segmentation and Lumbar Vertebra Inspection on X-ray Images | Van Luan Tran (National Chung Cheng University)*; Huei-Yung Lin (National Chung Cheng University); Hsiao-Wei Liu (Industrial Technology Research Institute (ITRI)) |
Do We Need Sound for Sound Source Localization? | Takashi Oya (Waseda University)*; Shohei Iwase (Waseda University ); Ryota Natsume (Waseda University); Takahiro Itazuri (Waseda University); Shugo Yamaguchi (Waseda University); Shigeo Morishima (Waseda Research Institute for Science and Engineering) |
Single-Image Camera Response Function Using Prediction Consistency and Gradual Refinement | Aashish Sharma (National University of Singapore)*; Robby T. Tan (Yale-NUS College); Loong-Fah Cheong (NUS) |
Color Enhancement using Global Parameters and Local Features Learning | Enyu Liu (Tencent)*; Songnan Li (Tencent); Shan Liu (Tencent America) |
OpenGAN: Open Set Generative Adversarial Networks | Luke Ditria (Monash University); Benjamin J. Meyer (Monash University)*; Tom Drummond (Monash University) |
HPGCNN: Hierarchical Parallel Group Convolutional Neural Networks for Point Clouds Processing | Jisheng Dang (Lanzhou Jiaotong University); Jun Yang (School of Electronic and Information Engineering, Lanzhou Jiaotong University)* |
Title | Authors |
To Filter Prune, or to Layer Prune, That Is The Question | Sara Elkerdawy (University of Alberta)*; Mostafa Elhoushi (Huawei Technologies); Abhineet Singh (University of Alberta); Hong Zhang (University of Alberta); Nilanjan Ray (University of Alberta) |
Long-Term Cloth-Changing Person Re-identification | Xuelin Qian (Fudan University); Wenxuan Wang (Fudan University); Li Zhang (University of Oxford); Fangrui Zhu (Fudan University); Yanwei Fu (Fudan University)*; Tao Xiang (University of Surrey); Yu-Gang Jiang (Fudan University); Xiangyang Xue (Fudan University) |
A cost-effective method for improving and re-purposing large, pre-trained GANs by fine-tuning their class-embeddings | Qi Li (Auburn University); Long Mai (Adobe Research); Michael A. Alcorn (Auburn University); Anh Nguyen (Auburn University)* |
A Benchmark and Baseline for Language-Driven Image Editing | Jing Shi (University of Rochester)*; Ning Xu (Adobe Research); Trung Bui (Adobe Research); Franck Dernoncourt (Adobe Research); Zheng Wen (DeepMind); Chenliang Xu (University of Rochester) |
GAN-based Noise Model for Denoising Real Images | Linh Duy Tran (Teikyo University)*; Son Minh Nguyen (Teikyo University); Masayuki Arai (Teikyo Univ.) |
Depth-Adapted CNN for RGB-D cameras | Zongwei WU (Univ. Bourgogne Franche-Comte, France)*; Guillaume Allibert (Université Côte d’Azur, CNRS, I3S, France ); Christophe Stolz (Univ. Bourgogne Franche-Comte, France); Cedric Demonceaux (Univ. Bourgogne Franche-Comte, France) |
Unified Application of Style Transfer for Face Swapping and Reenactment | Le Minh Ngo (University of Amsterdam)*; Christian aan de Wiel (3DUniversum); Sezer Karaoglu (University of Amsterdam); Theo Gevers (University of Amsterdam) |
Efficient Large-Scale Semantic Visual Localization in 2D Maps | Tomas Vojir (CMP CTU)*; Ignas Budvytis (Department of Engineering, University of Cambridge); Roberto Cipolla (University of Cambridge) |
Self-Supervised Multi-View Synchronization Learning for 3D Pose Estimation | Simon Jenni (Universität Bern)*; Paolo Favaro (University of Bern) |
3D Human Motion Estimation via Motion Compression and Refinement | Zhengyi Luo (Carnegie Mellon University)*; S. Alireza Golestaneh (Carnegie Mellon University); Kris M. Kitani (Carnegie Mellon University) |
Raw-Guided Enhancing Reprocess of Low-Light Image via Deep Exposure Adjustment | Haofeng Huang (Peking University)*; Wenhan Yang (Peking University); Yueyu Hu (Peking University); Jiaying Liu (Peking University) |
Webly Supervised Semantic Embeddings for Large Scale Zero-Shot Learning | Yannick Le Cacheux (CEA LIST)*; Adrian Popescu (CEA LIST); Herve Le Borgne (CEA LIST) |
Title | Authors |
CLASS: Cross-Level Attention and Supervision for Salient Objects Detection | Lv Tang (Nanjing University)*; Bo Li (Nanjing University) |
Tracking-by-Trackers with a Distilled and Reinforced Model | Matteo Dunnhofer (University of Udine)*; Niki Martinel (University of Udine); CHRISTIAN MICHELONI (University of Udine, Italy) |
Adaptive Spatio-Temporal Regularized Correlation Filters for UAV-based Tracking | Libin Xu (Shandong University of Technology); Qilei Li (Sichuan University); Jun Jiang ( Southwest Petroleum University;Sichuan University of Science & Engineering); Guofeng Zou (Shandong University of Technology); Zheng Liu (University of British Columbia); Mingliang Gao (Shandong University of Technology)* |
Towards Fast and Robust Adversarial Training for Image Classification | Erh-Chung Chen (National Tsing Hua University)*; Che-Rung Lee (National Tsing Hua University ) |
A Calibration Method for the Generalized Imaging Model with Uncertain Calibration Target Coordinates | David Uhlig (Karlsruhe Institute of Technology)*; Michael Heizmann (Karlsruher Institut fuer Technologie) |
MIX’EM: Unsupervised Image Classification using a Mixture of Embeddings | Ali Varamesh (KU Leuven)*; Tinne Tuytelaars (KU Leuven) |
Localize to Classify and Classify to Localize: Mutual Guidance in Object Detection | Heng Zhang (Univ Rennes 1)*; Elisa Fromont (Université Rennes 1, IRISA/INRIA rba); Sébastien Lefèvre (Université de Bretagne Sud / IRISA); Bruno Avignon (Atermes) |
Road Obstacle Detection Method Based on an Autoencoder with Semantic Segmentation | Toshiaki Ohgushi (TOYOTA); Kenji Horiguchi (TOYOTA); Masao Yamanaka (TOYOTA)* |
Decoupled Spatial-Temporal Attention Network for Skeleton-Based Action-Gesture Recognition | Lei Shi (Institute of Automation,Chinese Academy of Sciences )*; Yifan Zhang (Institute of Automation, Chinese Academy of Sciences); Jian Cheng (“Chinese Academy of Sciences, China”); Hanqing Lu (NLPR, Institute of Automation, CAS) |
Real-Time Segmentation Networks should be Latency Aware | Evann Courdier (Idiap Research Institute)*; François Fleuret (University of Geneva) |
Reconstructing Creative Lego Models | George Tattersall (University of York)*; Dizhong Zhu (University of York); William A. P. Smith (University of York); Sebastian Deterding (University of York); Patrik Huber (University of York) |
Unsupervised Domain Adaptive Object Detection using Forward-Backward Cyclic Adaptation | Siqi Yang (University of Queensland)*; Lin Wu (University of Queensland); Arnold Wiliem (the University of Queensland); Brian C. Lovell (University of Queensland) |
A Two-Stage Minimum Cost Multicut Approach to Self-Supervised Multiple Person Tracking | Kalun Ho (Fraunhofer ITWM)*; Amirhossein Kardoost (University of Mannheim); Franz-Josef Pfreundt (Fraunhofer ITWM); Janis Keuper (hs-offenburg); Margret Keuper (University of Mannheim) |
Cascaded Transposed Long-range Convolutions for Monocular Depth Estimation | Go Irie (NTT Corporation)*; Daiki Ikami (NTT Corporation); Takahito Kawanishi (NTT Corporation); Kunio Kashino (NTT Communication Science Laboratories) |
Learning to Adapt to Unseen Abnormal Activities under Weak Supervision | JaeYoo Park (Seoul National University)*; Junha Kim (Seoul National University); Bohyung Han (Seoul National University) |
Bi-Directional Attention for Joint Instance and Semantic Segmentation in Point Clouds | guangnan wu (Shandong university)*; Zhiyi Pan (Shandong University); Peng Jiang (Shandong University); Changhe Tu (Shandong University) |
FootNet: An efficient convolutional network for multiview 3D foot reconstruction | Felix Kok (Cambridge University)*; James Charles (Cambridge University); Roberto Cipolla (University of Cambridge) |
FAN: Feature Adaptation Network for Surveillance Face Recognition and Normalization | Xi Yin (Microsoft Cloud & AI)*; Ying Tai (Tencent YouTu); Yuge Huang (Tencent YouTu); Xiaoming Liu (Michigan State University) |
Large-Scale Cross-Domain Few-Shot Learning | Jiechao Guan (Renmin University of China); Manli Zhang (Renmin University of China); Zhiwu Lu (Renmin University of China)* |
Best Buddies Registration for Point Clouds | Amnon Drory (Tel-Aviv University)*; Tal Shomer (Tel-Aviv University); Shai Avidan (Tel Aviv University); Raja Giryes (Tel Aviv University) |
Discrete Spatial Importance-Based Deep Weighted Hashing | Yang Shi (Shandong University); Xiushan Nie (Shandong Jianzhu University)*; Quan Zhou (Shandong University); Xiaoming Xi (Shandong Jianzhu University ); Yilong Yin (Shandong University) |
Overwater Image Dehazing via Cycle-Consistent Generative Adversarial Network | Shunyuan Zheng (Harbin Institute of Technology)*; Jiamin Sun (Harbin Institute of Technology); Qinglin Liu (Harbin Institute of Technology); Yuankai Qi (Harbin Institute of Technology); Shengping Zhang (Harbin Institute of Technology) |
ERIC: Extracting Relations Inferred from Convolutions | Joe Townsend (Fujitsu Laboratories of Europe LTD)*; Theodoros Kasioumis (Fujitsu Laboratories of Europe LTD); Hiroya Inakoshi (Fujitsu Laboratories of Europe) |
Self-supervised Sparse to Dense Motion Segmentation | Amirhossein Kardoost (University of Mannheim)*; Kalun Ho (Fraunhofer ITWM); Peter Ochs (Saarland University); Margret Keuper (University of Mannheim) |
Multi-task Learning with Future States for Vision-based Autonomous Driving | Inhan Kim (POSTECH)*; Hyemin Lee (POSTECH); Joonyeong Lee (POSTECH); Eunseop Lee (POSTECH); Daijin Kim (Pohang University of Science and Technology) |
CPTNet: Cascade Pose Transform Network for Single Image Talking Head Animation | Jiale Zhang (Huazhong University of Science and Technology); Ke Xian (Huazhong University of Science and Technology); Chengxin Liu (Huazhong University of Science and Technology)*; Yinpeng Chen (Huazhong University of Science and Technology); Zhiguo Cao (Huazhong Univ. of Sci.&Tech.); Weicai Zhong (Huawei CBG Consumer Cloud Service Big Data Platform Dept.) |
Modular Graph Attention Network for Complex Visual Relational Reasoning | Yihan Zheng (South China University of Technology); Zhiquan Wen (South China University of Technology); Mingkui Tan (South China University of Technology)*; Runhao Zeng (South China University of Technology); Qi Chen (South China University of Technology); Yaowei Wang (PengCheng Laboratory); Qi Wu (University of Adelaide) |
Modeling Cross-Modal interaction in a Multi-detector, Multi-modal Tracking Framework | Yiqi Zhong (University of Southern California)*; Suya You (US Army Research Laboratory); Ulrich Neumann (USC) |
Rotation Axis Focused Attention Network (RAFA-Net) for Estimating Head Pose | Ardhendu Behera (Edge Hill University)*; Zachary Wharton (Edge Hill University); Pradeep Hewage (Edge Hill University); Swagat Kumar (Edge Hill University) |
A Day on Campus – An Anomaly Detection Dataset for Events in a Single Camera | Mantini Pranav (University of Houston)*; Li Zhenggang (University of Houston); Shah Shishir K (University of Houston) |
RGB-D Co-attention Network for Semantic Segmentation | Hao Zhou (Harbin Engineering University)*; Lu Qi (The Chinese University of Hong Kong); Zhaoliang Wan (Harbin Engineering University); Hai Huang (Harbin Engineering University); Xu Yang (Chinese Academy of Sciences) |
Uncertainty Estimation and Sample Selection for Crowd Counting | Viresh Ranjan (Stony Brook University)*; Boyu Wang (Stony Brook University); Mubarak Shah (University of Central Florida); Minh Hoai (Stony Brook University) |
RE-Net: A Relation Embedded Deep Model for AU Occurrence and Intensity Estimation | Huiyuan Yang (Binghamton University-SUNY)*; Lijun Yin (State University of New York at Binghamton) |
Feedback Recurrent Autoencoder for Video Compression | Adam Golinski (University of Oxford)*; Reza Pourreza (Qualcomm); Yang Yang (Qualcomm Inc.); Guillaume Sautiere (Qualcomm AI Research); Taco S. Cohen (Qualcomm) |
Title | Authors |
Class-Wise Difficulty-Balanced Loss for Solving Class-Imbalance | Saptarshi Sinha (Hitachi CRL)*; Hiroki Ohashi (Hitachi Ltd); Katsuyuki Nakamura (Hitachi Ltd.) |
Descriptor-Free Multi-View Region Matching for Instance-Wise 3D Reconstruction | Takuma Doi (Osaka University); Fumio Okura (Osaka University)*; Toshiki Nagahara (Osaka University); Yasuyuki Matsushita (Osaka University); Yasushi Yagi (Osaka University) |
Dehazing Cost Volume for Deep Multi-view Stereo in Scattering Media | Yuki Fujimura (Kyoto University)*; Motoharu Sonogashira (Kyoto University); Masaaki Iiyama (Kyoto University) |
Domain-transferred Face Augmentation Network | Hao-Chiang Shao (Fu Jen Catholic University); Kang-Yu Liu (National Tsing Hua University); Chia-Wen Lin (National Tsing Hua University)*; Jiwen Lu (Tsinghua University) |
Image Inpainting with Onion Convolutions | Shant Navasardyan (Picsart Inc.)*; Marianna Ohanyan (Picsart Inc.) |
Chromatic Aberration Correction Using Cross-Channel Prior in Shearlet Domain | Kunyi Li (Tsinghua University); Xin Jin (Tsinghua University)* |
Mapping of Sparse 3D Data using Alternating Projection | Siddhant Ranade (University of Utah); Xin Yu (University of Utah); Shantnu Kakkar (Trimble); Pedro Miraldo (Instituto Superior Técnico, Lisboa); Srikumar Ramalingam (University of Utah)* |
Fast and Differentiable Message Passing on Pairwise Markov Random Fields | Zhiwei Xu (Australian National University)*; Thalaiyasingam Ajanthan (ANU); RICHARD HARTLEY (Australian National University, Australia) |
To Filter Prune, or to Layer Prune, That Is The Question | Sara Elkerdawy (University of Alberta)*; Mostafa Elhoushi (Huawei Technologies); Abhineet Singh (University of Alberta); Hong Zhang (University of Alberta); Nilanjan Ray (University of Alberta) |
Adversarial Refinement Network for Human Motion Prediction | Xianjin Chao (The City University of Hong Kong)*; Yanrui Bin (HUST); Wenqing Chu (Tencent); Xuan Cao (Tencent); Yanhao Ge (Tencent); Chengjie Wang (Tencent); Jilin Li (Tencent); Feiyue Huang (Tencent); Howard Leung (City University of Hong Kong) |
A cost-effective method for improving and re-purposing large, pre-trained GANs by fine-tuning their class-embeddings | Qi Li (Auburn University); Long Mai (Adobe Research); Michael A. Alcorn (Auburn University); Anh Nguyen (Auburn University)* |
Title | Authors |
End-to-end Model-based Gait Recognition | Xiang Li (Nanjing University of Science and Technology)*; Yasushi Makihara (“””Osaka University, Japan”””); Chi Xu (Nanjing University of Science and Technology); Yasushi Yagi (Osaka University); Shiqi Yu (Southern University of Science and Technology, China); Mingwu Ren (Nanjing University of Science and Technology) |
Learning Global Pose Features in Graph Convolutional Networks for 3D Human Pose Estimation | Kenkun Liu ( University of Illinois at Chicago); Zhiming Zou (University of Illinois at Chicago); Wei Tang (University of Illinois at Chicago)* |
Unpaired Multimodal Facial Expression Recognition | Bin Xia (University of Science and Technology of China); Shangfei Wang (University of Science and Technology of China)* |
Hyperparameter-Free Out-of-Distribution Detection Using Cosine Similarity | Engkarat Techapanurak (Tohoku University)*; Masanori Suganuma (RIKEN AIP / Tohoku University); Takayuki Okatani (Tohoku University/RIKEN AIP) |
Utilizing Transfer Learning and a Customized Loss Function for Optic Disc Segmentation from Retinal Images | Abdullah Sarhan (University of Calgary)*; Ali Al-Khaz’Aly (University of Calgary); Adam Gorner (University of Calgary); Andrew Swift (University of Calgary); Jon Rokne (University of Calgary); Reda Alhajj (University of Calgary); Andrew Crichton (University of Calgary) |
Unified Density-Aware Image Dehazing and Object Detection in Real-World Hazy Scenes | Zhengxi Zhang (Nanjing University of Science & Technology); Liang Zhao (Nanjing University of Science & Technology); Yunan Liu (Nanjing University of Science & Technology); Shanshan Zhang (Max Planck Institute for Informatics)*; Jian Yang (Nanjing University of Science and Technology) |
MagGAN: High-Resolution Face Attribute Editing with Mask-Guided Generative Adversarial Network | Yi Wei (University at Albany – SUNY)*; Zhe Gan (Microsoft); Wenbo Li (Samsung Research America); Siwei Lyu (University at Albany); Ming-Ching Chang (University at Albany – SUNY); Lei Zhang (Microsoft); Jianfeng Gao (Microsoft Research); Pengchuan Zhang (Microsoft Research AI) |
dpVAEs: Fixing Sample Generation for Regularized VAEs | Riddhish Bhalodia (Scientific Computing and Imaging Institute); Iain Lee (Scientific computing and Imaging Institute, University of Utah); Shireen Elhabian (Scientific Computing and Imaging Institute, University of Utah)* |
RAF-AU Database: In-the-Wild Facial Expressions with Subjective Emotion Judgement and Objective AU Annotations | Wen-Jing Yan (JD Digits)*; Shan Li (Beijing University of Posts and Telecommunications); Chengtao Que (JD Digits); Jiquan Pei (JD Digits); Weihong Deng (Beijing University of Posts and Telecommunications) |
EPSNet: Efficient Panoptic Segmentation Network with Cross-layer Attention Fusion | Chia-Yuan Chang (National Taiwan University)*; Shuo-En Chang (National Taiwan University); Pei-Yung Hsiao (National University of Kaohsiung); Li-Chen Fu (National Taiwan University) |
BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal Generation | Haisheng Su (SenseTime Group Limited)* |
Multi-label X-ray Imagery Classification via Bottom-up Attention and Meta Fusion | Benyi Hu (Xi’an Jiaotong University)*; Chi Zhang (Xi’an Jiaotong Univiersity); Le Wang (Xi’an Jiaotong University); Qilin Zhang (HERE Technologies); Yuehu Liu (Xi’an Jiaotong University) |
Regularizing Meta-Learning via Gradient Dropout | Hung-Yu Tseng (University of California, Merced)*; Yi-Wen Chen (University of California, Merced); Yi-Hsuan Tsai (NEC Labs America); Sifei Liu (NVIDIA); Yen-Yu Lin (National Chiao Tung University); Ming-Hsuan Yang (University of California at Merced) |
Addressing Class Imbalance in Scene Graph Parsing by Learning to Contrast and Score | He Huang (University of Illinois at Chicago)*; Shunta Saito (Preferred Networks, Inc.); Yuta Kikuchi (Preferred Networks, Inc.); Eiichi Matsumoto (Preferred Networks, Inc.); Wei Tang (University of Illinois at Chicago); Philip S. Yu (UIC) |
Point Proposal based Instance Segmentation with Rectangular Masks for Robot Picking Task | Satoshi Ito (Toshiba Corporation)*; Susumu Kubota (Toshiba Corporation) |
Vax-a-Net: Training-time Defence Against Adversarial Patch Attacks | Thomas Gittings (University of Surrey); Steve Schneider (University of Surrey); John Collomosse (Adobe Research)* |
Local Context Attention for Salient Object Segmentation | Jing Tan (Megvii(face++) Research); Pengfei Xiong (Megvii(face++) Research)*; Zhengyi Lv (Megvii(face++) Research); Kuntao Xiao (Megvii(face++) Research); Yuwen He (Megvii(face++) Research) |
Explaining image classifiers by removing input features using generative models | Chirag Agarwal (UIC); Anh Nguyen (Auburn University)* |
Contrastively Smoothed Class Alignment for Unsupervised Domain Adaptation | Shuyang Dai (Duke University)*; Yu Cheng (Microsoft); Yizhe Zhang (Microsoft Research); Zhe Gan (Microsoft); Jingjing Liu (Microsoft); Lawrence Carin (CS) |
Emotional Landscape Image Generation Using Generative Adversarial Networks | Chanjong Park (Yonsei University); In-Kwon Lee (Yonsei University)* |
Anatomy and Geometry Constrained One-Stage Framework for 3D Human Pose Estimation | Xin Cao (Shanghai JiaoTong University); Xu Zhao (Shanghai Jiao Tong University)* |
Human Motion Deblurring using Localized Body Prior | Jonathan Samuel Lumentut (Inha University); Joshua Santoso (Inha University); In Kyu Park (Inha University)* |
Compensating for the Lack of Extra Training Data by Learning Extra Representation | Hyeonseong Jeon (Sungkyunkwan University)*; Siho Han (Sungkyunkwan University); Sangwon Lee (SKKU); Simon S. Woo (SKKU) |
DEAL: Difficulty-aware Active Learning for Semantic Segmentation | Shuai Xie (Zhejiang University); Zunlei Feng (Zhejiang University); Ying chen (Zhejiang University); Songtao Sun (Zhejiang University); Chao Ma (Zhejiang University); Mingli Song (Zhejiang University)* |
Gaussian Vector: An Efficient Solution for Facial Landmark Detection | Yilin Xiong (Central South University)*; Zijian Zhou (Horizon); yuhao dou (Horizon); ZHIZHONG SU (Horizon Robotics) |
Homography-based Egomotion Estimation Using Gravity and SIFT Features | Yaqing Ding (Nanjing University of Science and Technology)*; Daniel Barath (MTA SZTAKI, CMP Prague); Zuzana Kukelova (Czech Technical University in Prague) |
COG: COnsistent data auGmentation for object perception | Zewen He (Casia)*; Rui Wu (Horizon Robotics); Dingqian Zhang (Horizon Robotics) |
Synthesizing the Unseen for Zero-shot Object Detection | Nasir Hayat (IIAI); Munawar Hayat (IIAI)*; Shafin Rahman (North South University); Salman Khan (Australian National University (ANU)); Syed Waqas Zamir (IIAI); Fahad Shahbaz Khan (Inception Institute of Artificial Intelligence) |
SDP-Net: Scene Flow Based Real-time Object Detection and Prediction from Sequential 3D Point Clouds | Yi Zhang (Zhejiang University); Yuwen Ye (Zhejiang University); Zhiyu Xiang (Zhejiang University)*; Jiaqi Gu (Zhejiang University) |
Jointly Discriminating and Frequent Visual Representation Mining | Qiannan Wang (Xidian university); Ying Zhou (Xidian University); ZhaoYan Zhu (Xidian university); Xuefeng Liang (Xidian University)*; Yu Gu (School of Artificial Intelligence, Xi’dian University) |
RGB-T Crowd Counting from Drone: A Benchmark and MMCCN Network | Tao Peng (Tianjin University); Qing Li (Northwestern Polytechnical University); Pengfei Zhu (Tianjin university)* |
SAUM: Symmetry-Aware Upsampling Module for Consistent Point Cloud Completion | Hyeontae Son (Seoul National University)*; Young Min Kim (Seoul National University) |
Online Knowledge Distillation via Multi-branch Diversity Enhancement | Zheng Li (Institute of Virtual Reality and Intelligent System, Hangzhou Normal University)*; YING HUANG (Hangzhou Normal University); Defang Chen (Zhejiang University); Tianren Luo (Institute of Virtual Reality and Intelligent System,Hangzhou Normal University); Ning Cai (Institute of Virtual Reality and Intelligent System,Hangzhou Normal University); Zhigeng Pan (Institute of Virtual Reality and Intelligent System,Hangzhou Normal University) |
3D Object Detection from Consecutive Monocular Images | Chia-Chun Cheng (National Tsing Hua University)*; Shang-Hong Lai (Microsoft) |
Title | Authors |
Introspective Learning by Distilling Knowledge from Online Self-explanation | Jindong Gu (University of Munich)*; Zhiliang Wu (Siemens AG and Ludwig Maximilian University of Munich); Volker Tresp (Siemens AG and Ludwig Maximilian University of Munich ) |
Progressive Batching for Efficient Non-linear Least Squares | Huu Le (Chalmers University of Technology)*; Christopher Zach (Chalmers University); Edward Rosten (Snap Inc.); Oliver J. Woodford (Snap Inc) |
DeepSEE: Deep Disentangled Semantic Explorative Extreme Super-Resolution | Marcel C. Bühler (ETH Zürich)*; Andrés Romero (ETH Zürich); Radu Timofte (ETH Zurich) |
A Sparse Gaussian Approach to Region-Based 6DoF Object Tracking | Manuel Stoiber (German Aerospace Center (DLR))*; Martin Pfanne (German Aerospace Center); Klaus H. Strobl (DLR); Rudolph Triebel (German Aerospace Center (DLR)); Alin Albu-Schaeffer (Robotics and Mechatronics Center (RMC), German Aerospace Center (DLR)) |
FreezeNet: Full Performance by Reduced Storage Costs | Paul Wimmer (Luebeck University / Robert Bosch GmbH)*; Jens Mehnert (Robert Bosch GmbH); Alexandru Condurache (Bosch) |
Project to Adapt: Domain Adaptation for Depth Completion from Noisy and Sparse Sensor Data | Adrian Lopez-Rodriguez (Imperial College London)*; Benjamin Busam (Technical University of Munich); Krystian Mikolajczyk (Imperial College London) |
Lossless Image Compression Using a Multi-Scale Progressive Statistical Model | Honglei Zhang (Nokia Technologies)*; Francesco Cricri (Nokia Technologies); Hamed R. Tavakoli (Nokia Technologies); Nannan Zou (Tampere University); Emre Aksu (Nokia Technologies); Miska M. Hannuksela (Nokia Technologies) |
GAN-based Noise Model for Denoising Real Images | Linh Duy Tran (Teikyo University)*; Son Minh Nguyen (Teikyo University); Masayuki Arai (Teikyo Univ.) |
Depth-Adapted CNN for RGB-D cameras | Zongwei WU (Univ. Bourgogne Franche-Comte, France)*; Guillaume Allibert (Université Côte d’Azur, CNRS, I3S, France ); Christophe Stolz (Univ. Bourgogne Franche-Comte, France); Cedric Demonceaux (Univ. Bourgogne Franche-Comte, France) |
Unified Application of Style Transfer for Face Swapping and Reenactment | Le Minh Ngo (University of Amsterdam)*; Christian aan de Wiel (3DUniversum); Sezer Karaoglu (University of Amsterdam); Theo Gevers (University of Amsterdam) |
Self-supervised Learning of Orc-Bert Augmentator for Recognizing Few-Shot Oracle Characters | Wenhui Han (Fudan University); Xinlin Ren (Fudan University); Hangyu Lin (Fudan University); Yanwei Fu (Fudan University)*; Xiangyang Xue (Fudan University) |
Title | Authors |
Visual Tracking by TridentAlign and Context Embedding | Janghoon Choi (Seoul National University)*; Junseok Kwon (Chung-Ang Univ., Korea); Kyoung Mu Lee (Seoul National University) |
Multi-View Consistency Loss for Improved Single-Image 3D Reconstruction of Clothed People | Akin Caliskan (Center for Vision Speech and Signal Processing – University of Surrey)*; Armin Mustafa (University of Surrey); Evren Imre (Vicon); Adrian Hilton (University of Surrey) |
Multiple Exemplars-based Hallucination for Face Super-resolution and Editing | Kaili Wang (KU Leuven, UAntwerpen)*; Jose Oramas (UAntwerp, imec-IDLab); Tinne Tuytelaars (KU Leuven) |
Audiovisual Transformer with Instance Attention for Audio-Visual Event Localization | Yan-Bo Lin (National Taiwan University)*; Yu-Chiang Frank Wang (National Taiwan University) |
Towards Robust Fine-grained Recognition by Maximal Separation of Discriminative Features | Krishna Kanth Nakka (EPFL)*; Mathieu Salzmann (EPFL) |
Semantic Synthesis of Pedestrian Locomotion | Maria Priisalu (Lund University)*; Ciprian Paduraru (IMAR); Aleksis Pirinen (Lund University); Cristian Sminchisescu (Lund University) |
Image Captioning through Image Transformer | Sen He (University of Exeter)*; Wentong Liao (Leibniz University Hannover); Hamed R. Tavakoli (Nokia Technologies); Michael Yang (University of Twente); Bodo Rosenhahn (Leibniz University Hannover); Nicolas Pugeault (University of Glasgow) |
Learn more, forget less: Cues from human brain | Arijit Patra (University of Oxford); Tapabrata Chakraborti (University of Oxford)* |
Learning Local Feature Descriptors for Multiple Object Tracking | Dmytro Borysenko (Samsung R&D Institute Ukraine); Dmytro Mykheievskyi (Samsung R&D Institute Ukraine); Viktor Porokhonskyy (Samsung Research&Development Institute Ukraine (SRK))* |
Towards Optimal Filter Pruning with Balanced Performance and Pruning Speed | Dong Li (Nuctech)*; Sitong Chen (Nuctech); Xudong Liu (Nuctech); Yunda Sun (Nuctech); Li Zhang (Nuctech) |
Fully Supervised and Guided Distillation for One-Stage Detectors | Deyu Wang (Canon Information Technology (Beijing) Co., LTD)*; Dongchao Wen (Canon Information Technology (Beijing) Co., LTD); Junjie Liu (Canon Information Technology (Beijing) Co., LTD); Wei Tao (Canon Information Technology (Beijing) Co., LTD); Tse-Wei Chen (Canon Inc.); Kinya Osa (Canon Inc.); Masami Kato (Canon Inc.) |
V2A – Vision to Action: Learning robotic arm actions based on vision and language | Michal Nazarczuk (Imperial College London)*; Krystian Mikolajczyk (Imperial College London) |
Data-Efficient Ranking Distillation for Image Retrieval | Zakaria Laskar (Aalto University)*; Juho Kannala (Aalto University, Finland) |
3D Object Detection and Pose Estimation of Unseen Objects in Color Images with Local Surface Embeddings | Giorgia Pitteri (Université de Bordeaux, LaBRI)*; Aureélie Bugeau (University of Bordeaux); Slobodan Ilic (Siemens AG); Vincent Lepetit (Ecole des Ponts ParisTech) |
Contextual Semantic Interpretability | Diego Marcos (Wageningen University)*; Ruth Fong (University of Oxford); Sylvain Lobry (Wageningen University and Research); Rémi Flamary (Université Côte d’Azur); Nicolas Courty (UBS); Devis Tuia (Wageningen University and Research) |
MatchGAN: A Self-Supervised Semi-Supervised Conditional Generative Adversarial Network | Jiaze Sun (Imperial College London)*; Binod Bhattarai (Imperial College London); Tae-Kyun Kim (Imperial College London) |
CLASS: Cross-Level Attention and Supervision for Salient Objects Detection | Lv Tang (Nanjing University)*; Bo Li (Nanjing University) |
Tracking-by-Trackers with a Distilled and Reinforced Model | Matteo Dunnhofer (University of Udine)*; Niki Martinel (University of Udine); CHRISTIAN MICHELONI (University of Udine, Italy) |
Adaptive Spatio-Temporal Regularized Correlation Filters for UAV-based Tracking | Libin Xu (Shandong University of Technology); Qilei Li (Sichuan University); Jun Jiang ( Southwest Petroleum University;Sichuan University of Science & Engineering); Guofeng Zou (Shandong University of Technology); Zheng Liu (University of British Columbia); Mingliang Gao (Shandong University of Technology)* |
A Calibration Method for the Generalized Imaging Model with Uncertain Calibration Target Coordinates | David Uhlig (Karlsruhe Institute of Technology)*; Michael Heizmann (Karlsruher Institut fuer Technologie) |
Localize to Classify and Classify to Localize: Mutual Guidance in Object Detection | Heng Zhang (Univ Rennes 1)*; Elisa Fromont (Université Rennes 1, IRISA/INRIA rba); Sébastien Lefèvre (Université de Bretagne Sud / IRISA); Bruno Avignon (Atermes) |
Decoupled Spatial-Temporal Attention Network for Skeleton-Based Action-Gesture Recognition | Lei Shi (Institute of Automation,Chinese Academy of Sciences )*; Yifan Zhang (Institute of Automation, Chinese Academy of Sciences); Jian Cheng (“Chinese Academy of Sciences, China”); Hanqing Lu (NLPR, Institute of Automation, CAS) |
Reconstructing Creative Lego Models | George Tattersall (University of York)*; Dizhong Zhu (University of York); William A. P. Smith (University of York); Sebastian Deterding (University of York); Patrik Huber (University of York) |
Unsupervised Domain Adaptive Object Detection using Forward-Backward Cyclic Adaptation | Siqi Yang (University of Queensland)*; Lin Wu (University of Queensland); Arnold Wiliem (the University of Queensland); Brian C. Lovell (University of Queensland) |
BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal Generation | Haisheng Su (SenseTime Group Limited)* |
A Two-Stage Minimum Cost Multicut Approach to Self-Supervised Multiple Person Tracking | Kalun Ho (Fraunhofer ITWM)*; Amirhossein Kardoost (University of Mannheim); Franz-Josef Pfreundt (Fraunhofer ITWM); Janis Keuper (hs-offenburg); Margret Keuper (University of Mannheim) |
Cascaded Transposed Long-range Convolutions for Monocular Depth Estimation | Go Irie (NTT Corporation)*; Daiki Ikami (NTT Corporation); Takahito Kawanishi (NTT Corporation); Kunio Kashino (NTT Communication Science Laboratories) |
Learning to Adapt to Unseen Abnormal Activities under Weak Supervision | JaeYoo Park (Seoul National University)*; Junha Kim (Seoul National University); Bohyung Han (Seoul National University) |
Bi-Directional Attention for Joint Instance and Semantic Segmentation in Point Clouds | guangnan wu (Shandong university)*; Zhiyi Pan (Shandong University); Peng Jiang (Shandong University); Changhe Tu (Shandong University) |
Weakly-supervised Reconstruction of 3D Objects with Large Shape Variation from Single In-the-Wild Images | Shichen Sun (Sichuan University); Zhengbang Zhu (Sichuan University); Xiaowei Dai (Sichuan University); Qijun Zhao (Sichuan University)*; Jing Li (Sichuan University) |
Large-Scale Cross-Domain Few-Shot Learning | Jiechao Guan (Renmin University of China); Manli Zhang (Renmin University of China); Zhiwu Lu (Renmin University of China)* |
Overwater Image Dehazing via Cycle-Consistent Generative Adversarial Network | Shunyuan Zheng (Harbin Institute of Technology)*; Jiamin Sun (Harbin Institute of Technology); Qinglin Liu (Harbin Institute of Technology); Yuankai Qi (Harbin Institute of Technology); Shengping Zhang (Harbin Institute of Technology) |
Self-supervised Sparse to Dense Motion Segmentation | Amirhossein Kardoost (University of Mannheim)*; Kalun Ho (Fraunhofer ITWM); Peter Ochs (Saarland University); Margret Keuper (University of Mannheim) |
Multi-task Learning with Future States for Vision-based Autonomous Driving | Inhan Kim (POSTECH)*; Hyemin Lee (POSTECH); Joonyeong Lee (POSTECH); Eunseop Lee (POSTECH); Daijin Kim (Pohang University of Science and Technology) |
Title | Authors |
Class-incremental Learning with Rectified Feature-Graph Preservation | Cheng-Hsun Lei (National Chiao Tung University); Yi-Hsin Chen (National Chiao Tung University); Wen-Hsiao Peng (National Chiao Tung University); Wei-Chen Chiu (National Chiao Tung University)* |
Patch SVDD: Patch-level SVDD for Anomaly Detection and Segmentation | Jihun Yi (Seoul National University); Sungroh Yoon (Seoul National University)* |
Learning More Accurate Features for Semantic Segmentation in CycleNet | Linzi Qu (Xidian University)*; Lihuo He (Xidian University); JunJie Ke (Xidian University); Xinbo Gao (Xidian University); Wen Lu (Xidian University) |
AFN: Attentional Feedback Network based 3D Terrain Super-Resolution | Ashish Kubade (International Institute Of Information Technology, Hyderabad)*; Diptiben Patel (IIIT Hyderabad); Avinash Sharma (CVIT, IIIT-Hyderabad); K. S. Rajan (IIIT Hyderabad) |
Backbone Based Feature Enhancement for Object Detection | Haoqin Ji (Shenzhen University); Weizeng Lu (Shenzhen University); Linlin Shen (Shenzhen University)* |
Part-aware Attention Network for Person Re-Identification | Wangmeng Xiang (The Hong Kong Polytechnic University); Jianqiang Huang (Damo Academy, Alibaba Group); Xian-Sheng Hua (Alibaba Group); Lei Zhang (“Hong Kong Polytechnic University, Hong Kong, China”)* |
Class-Wise Difficulty-Balanced Loss for Solving Class-Imbalance | Saptarshi Sinha (Hitachi CRL)*; Hiroki Ohashi (Hitachi Ltd); Katsuyuki Nakamura (Hitachi Ltd.) |
Dehazing Cost Volume for Deep Multi-view Stereo in Scattering Media | Yuki Fujimura (Kyoto University)*; Motoharu Sonogashira (Kyoto University); Masaaki Iiyama (Kyoto University) |
Chromatic Aberration Correction Using Cross-Channel Prior in Shearlet Domain | Kunyi Li (Tsinghua University); Xin Jin (Tsinghua University)* |
Title | Authors |
TTPLA: An Aerial-Image Dataset for Detection and Segmentation of Transmission Towers and Power Lines | Rabab Abdelfattah (University of South Carolina)*; XIAOFENG Wang (USC); Song Wang (University of South Carolina) |
Accurate Arbitrary-Shaped Scene Text Detection via Iterative Polynomial Parameter Regression | Jiahao Shi (Nanjing University); Long Chen (Nanjing University); Feng Su (Nanjing University)* |
DiscFace: Minimum Discrepancy Learning for Deep Face Recognition | Insoo Kim (Samsung Advanced Institute of Technology)*; Seungju Han (Samsung Advanced Institute of Technology); Seong-Jin Park (Samsung Advanced Institute of Technology); Ji-won Baek (Samsung Advanced Institute of Technology); Jinwoo Shin (KAIST); Jae-Joon Han (Samsung); Changkyu Choi (Samsung) |
Synergistic Saliency and Depth Prediction for RGB-D Saliency Detection | Yue Wang (Dalian University of Technology); Yuke Li (UC Berkeley); James H. Elder (York University); Runmin Wu (Dalian University of Technology ); Huchuan Lu (Dalian University of Technology)*; Lu Zhang (Dalian University of Technology) |
Query by Strings and Return Ranking Word Regions with Only One Look | Peng Zhao (Beijing Jiaotong University); Wenyuan Xue (Beijing Jiaotong University); Qingyong Li (Beijing Jiaotong University)*; Siqi Cai (Beijing Jiaotong University) |
Reweighted Non-convex Non-smooth Rank Minimization based Spectral Clustering on Grassmann Manifold | Xinglin Piao (Peng Cheng Laboratory; Peking University; Dalian University of Technology)*; Yongli Hu (Beijing University of Technology); Junbin Gao (University of Sydney, Australia); Yanfeng Sun (Beijing University of Technology); Xin Yang (Dalian University of Technology); Baocai Yin (Beijing University of Technology) |
Knowledge Transfer Graph for Deep Collaborative Learning | Soma Minami (Chubu university)*; Tsubasa Hirakawa (Chubu University); Takayoshi Yamashita (Chubu University); Hironobu Fujiyoshi (Chubu University) |
Spatial Temporal Attention Graph Convolutional Networks with Mechanics-Stream for Skeleton-based Action Recognition | Katsutoshi Shiraki (Chubu University)*; Tsubasa Hirakawa (Chubu University); Takayoshi Yamashita (Chubu University); Hironobu Fujiyoshi (Chubu University) |
Multi-scale Attentive Residual Dense Network for Single Image Rain Removal | Xiang Chen (Shenyang Aerospace University ); Yufeng Huang (Shenyang Aerospace University)*; Lei Xu (Shenyang Fire Science and Technology Research Institute of MEM) |
End-to-end Model-based Gait Recognition | Xiang Li (Nanjing University of Science and Technology)*; Yasushi Makihara (“””Osaka University, Japan”””); Chi Xu (Nanjing University of Science and Technology); Yasushi Yagi (Osaka University); Shiqi Yu (Southern University of Science and Technology, China); Mingwu Ren (Nanjing University of Science and Technology) |
Quantum Robust Fitting | Tat-Jun Chin (University of Adelaide); David Suter (Edith Cowan University); Shin-Fang Ch’ng (The University of Adelaide)*; James Quach (The University of Adelaide) |
Hyperparameter-Free Out-of-Distribution Detection Using Cosine Similarity | Engkarat Techapanurak (Tohoku University)*; Masanori Suganuma (RIKEN AIP / Tohoku University); Takayuki Okatani (Tohoku University/RIKEN AIP) |
Second Order enhanced Multi-glimpse Attention in Visual Question Answering | Qiang Sun (Fudan University)*; Binghui Xie (Fudan University); Yanwei Fu (Fudan University) |
Restoring Spatially-Heterogeneous Distortions using Mixture of Experts Network | Sijin Kim (Ajou University); Namhyuk Ahn (Ajou University); Kyung-Ah Sohn (Ajou University)* |
Unified Density-Aware Image Dehazing and Object Detection in Real-World Hazy Scenes | Zhengxi Zhang (Nanjing University of Science & Technology); Liang Zhao (Nanjing University of Science & Technology); Yunan Liu (Nanjing University of Science & Technology); Shanshan Zhang (Max Planck Institute for Informatics)*; Jian Yang (Nanjing University of Science and Technology) |
Augmentation Network for Generalised Zero-Shot Learning | RAFAEL FELIX (The University of Adelaide)*; Michele Sasdelli (The University of Adelaide); Ian Reid (“University of Adelaide, Australia”); Gustavo Carneiro (University of Adelaide) |
Channel Recurrent Attention Networks for Video Pedestrian Retrieval | Pengfei Fang (The Australian National University)*; Pan Ji (OPPO US Research Center); Jieming Zhou (The Australian National University); Lars Petersson (Data61/CSIRO); Mehrtash Harandi (Monash University) |
Scale-Aware Polar Representation for Arbitrarily-Shaped Text Detection | Yanguang Bi (SenseTime Research); Zhiqiang Hu (SenseTime Research)* |
Background Learnable Cascade for Zero-Shot Object Detection | Ye Zheng (Institute of Computing Technology, Chinese Academy of Sciences)*; Ruoran Huang (Institute of Computing Technology, Chinese Academy of Sciences); Chuanqi Han (Institute of Computing Technology, Chinese Academy of Sciences); Xi Huang (Institute of computing technology of the Chinese Academy of Sciences); Li Cui ( Institute of computing technology of the Chinese Academy of Sciences) |
RAF-AU Database: In-the-Wild Facial Expressions with Subjective Emotion Judgement and Objective AU Annotations | Wen-Jing Yan (JD Digits)*; Shan Li (Beijing University of Posts and Telecommunications); Chengtao Que (JD Digits); Jiquan Pei (JD Digits); Weihong Deng (Beijing University of Posts and Telecommunications) |
EPSNet: Efficient Panoptic Segmentation Network with Cross-layer Attention Fusion | Chia-Yuan Chang (National Taiwan University)*; Shuo-En Chang (National Taiwan University); Pei-Yung Hsiao (National University of Kaohsiung); Li-Chen Fu (National Taiwan University) |
Faster Self-adaptive Deep Stereo | Haiyang Wang (Zhejiang University)*; Xinchao Wang (Stevens Institute of Technology); Jie Song (Zhejiang University); Jie Lei (Zhejiang University); Mingli Song (Zhejiang University) |
Multi-label X-ray Imagery Classification via Bottom-up Attention and Meta Fusion | Benyi Hu (Xi’an Jiaotong University)*; Chi Zhang (Xi’an Jiaotong Univiersity); Le Wang (Xi’an Jiaotong University); Qilin Zhang (HERE Technologies); Yuehu Liu (Xi’an Jiaotong University) |
Graph-based Heuristic Search for Module Selection Procedure in Neural Module Network | Yuxuan Wu (The University of Tokyo)*; Hideki Nakayama (The University of Tokyo) |
Visualizing Color-wise Saliency of Black-Box Image Classification Models | Yuhki Hatakeyama (SenseTime Japan)*; Hiroki Sakuma (SenseTime Japan); Yoshinori Konishi (SenseTime Japan); Kohei Suenaga (Kyoto University) |
Attention-Aware Feature Aggregation for Real-time Stereo Matching on Edge Devices | Jia-Ren Chang (National Chiao Tung University; aetherAI); Pei-Chun Chang (National Chiao Tung University); Yong-Sheng Chen (National Chiao Tung University)* |
Emotional Landscape Image Generation Using Generative Adversarial Networks | Chanjong Park (Yonsei University); In-Kwon Lee (Yonsei University)* |
Mask-Ranking Network for Semi-Supervised Video Object Segmentation | Wenjing Li (University of Electronic Science & Technology of China)*; Xiang Zhang (University of Electronic Science & Technology of China); Yujie Hu (University of Electronic Science & Technology of China); Yingqi Tang (University of Electronic Science & Technology of China) |
Title | Authors |
Condensed Movies: Story Based Retrieval with Contextual Embeddings | Max Bain (University of Oxford)*; Arsha Nagrani (Oxford University ); Andrew Brown (University of Oxford); Andrew Zisserman (University of Oxford) |
Goal-GAN: Multimodal Trajectory Prediction Based on Goal Position Estimation | Patrick Dendorfer (TUM)*; Aljosa Osep (TUM Munich); Laura Leal-Taixé (TUM) |
Learning 3D Face Reconstruction with a Pose Guidance Network | Pengpeng Liu (The Chinese University of Hong Kong)*; Xintong Han (Huya AI); Michael Lyu (The Chinese University of Hong Kong); Irwin King (The Chinese University of Hong Kong); Jia Xu (Huya AI) |
FKAConv: Feature-Kernel Alignment for Point Cloud Convolution | Alexandre Boulch (valeo.ai)*; Gilles Puy (Valeo); Renaud Marlet (Ecole des Ponts ParisTech) |
EvolGAN: Evolutionary Generative Adversarial Networks | Baptiste Roziere (Facebook AI Research); Fabien Teytaud (Univ. Littoral Cote d’Opale); Vlad Hosu (University of Konstanz); Hanhe Lin (University of Konstanz); Jeremy Rapin (Facebook AI Research); Mariia Zameshina (Inria); Olivier Teytaud (Facebook)* |
L2R GAN: LiDAR-to-Radar Translation | LeiChen Wang (Daimler AG)*; Bastian Goldluecke (University of Konstanz); Carsten Anklam (Daimler AG) |
Introspective Learning by Distilling Knowledge from Online Self-explanation | Jindong Gu (University of Munich)*; Zhiliang Wu (Siemens AG and Ludwig Maximilian University of Munich); Volker Tresp (Siemens AG and Ludwig Maximilian University of Munich ) |
Meta-Learning with Context-Agnostic Initialisations | Toby Perrett (University of Bristol)*; Alessandro Masullo (University of Bristol); Tilo Burghardt (University of Bristol); Majid Mirmehdi (University of Bristol); Dima Damen (University of Bristol) |
Visually Guided Sound Source Separation using Cascaded Opponent Filter Network | Lingyu Zhu (Tampere University)*; Esa Rahtu (Tampere University) |
DeepSEE: Deep Disentangled Semantic Explorative Extreme Super-Resolution | Marcel C. Bühler (ETH Zürich)*; Andrés Romero (ETH Zürich); Radu Timofte (ETH Zurich) |
Project to Adapt: Domain Adaptation for Depth Completion from Noisy and Sparse Sensor Data | Adrian Lopez-Rodriguez (Imperial College London)*; Benjamin Busam (Technical University of Munich); Krystian Mikolajczyk (Imperial College London) |
Mapping of Sparse 3D Data using Alternating Projection | Siddhant Ranade (University of Utah); Xin Yu (University of Utah); Shantnu Kakkar (Trimble); Pedro Miraldo (Instituto Superior Técnico, Lisboa); Srikumar Ramalingam (University of Utah)* |
Title | Authors |
Self-Guided Multiple Instance Learning for Weakly Supervised Thoracic Disease Classification and Localization in Chest Radiographs | Constantin Seibold (Karlsruhe Institute of Technology)*; Jens Kleesiek (German Cancer Research Center); Heinz-Peter Schlemmer (German Cancer Research Center); Rainer Stiefelhagen (Karlsruhe Institute of Technology) |
Play Fair: Frame Contributions in Video Models | Will Price (University of Bristol)*; Dima Damen (University of Bristol) |
Recursive Bayesian Filtering for Multiple Human Pose Tracking from Multiple Cameras | Oh-Hun Kwon (University of Bonn); Julian Tanke (University of Bonn)*; Jürgen Gall (University of Bonn) |
Interpreting Video Features: A Comparison of 3D Convolutional Networks and Convolutional LSTM Networks | Joonatan Mänttäri (KTH Royal Institute of Technology); Sofia Broomé (KTH Royal Institute of Technology)*; John Folkesson (KTH Royal Institute of Technology); Hedvig Kjellström (KTH Royal Institute of Technology) |
Spatial Class Distribution Shift in Unsupervised Domain Adaptation: Local Alignment Comes to Rescue | Safa Cicek (UCLA)*; Ning Xu (Adobe Research); Zhaowen Wang (Adobe Research); Hailin Jin (Adobe Research); Stefano Soatto (UCLA) |
Double Targeted Universal Adversarial Perturbations | Philipp Benz (KAIST)*; Chaoning Zhang (KAIST); Tooba Imtiaz (KAIST); In So Kweon (KAIST) |
Robust High Dynamic Range (HDR) Imaging with Complex Motion and Parallax | Zhiyuan Pu (NanJing University); Peiyao Guo (Nanjing University); M. Salman Asif (University of California, Riverside); Zhan Ma (Nanjing University)* |
Faster, Better and More Detailed: 3D Face Reconstruction with Graph Convolutional Networks | Shiyang Cheng (Samsung)*; Georgios Tzimiropoulos (Samsung AI); Jie Shen (Imperial College London); Maja Pantic (Samsung AI Centre Cambridge/ Imperial College London ) |
Understanding Motion in Sign Language: A New Structured Translation Dataset | Jefferson Rodriguez (UIS); Juan Chacon (UIS); Edgar Rangel (UIS); Luis Guayacan (UIS); Claudia Hernandez (UIS); Luisa Hernandez (UIS); Fabio Martinez (UIS )* |
Trainable Structure Tensors for Autonomous Baggage Threat Detection Under Extreme Occlusion | Taimur Hassan (Khalifa University of Science and Technology)*; Naoufel Werghi (Khalifa University of Science and Technology) |
Towards Robust Fine-grained Recognition by Maximal Separation of Discriminative Features | Krishna Kanth Nakka (EPFL)*; Mathieu Salzmann (EPFL) |
Semantic Synthesis of Pedestrian Locomotion | Maria Priisalu (Lund University)*; Ciprian Paduraru (IMAR); Aleksis Pirinen (Lund University); Cristian Sminchisescu (Lund University) |
HDD-Net: Hybrid Detector Descriptor with Mutual Interactive Learning | Axel Barroso-Laguna (Imperial College London)*; Yannick Verdie (Huawei Noah’s Ark Lab); Benjamin Busam (Technical University of Munich); Krystian Mikolajczyk (Imperial College London) |
Betrayed by Motion: Camouflaged Object Discovery via Motion Segmentation | Hala Lamdouar (University of Oxford)*; Charig Yang (University of Oxford); Weidi Xie (University of Oxford); Andrew Zisserman (University of Oxford) |
Sequential View Synthesis with Transformer | Phong Nguyen-Ha (University of Oulu)*; Lam Huynh ( University of Oulu); Esa Rahtu (Tampere University); Janne Heikkila (University of Oulu, Finland) |
Image Captioning through Image Transformer | Sen He (University of Exeter)*; Wentong Liao (Leibniz University Hannover); Hamed R. Tavakoli (Nokia Technologies); Michael Yang (University of Twente); Bodo Rosenhahn (Leibniz University Hannover); Nicolas Pugeault (University of Glasgow) |
Novel-View Human Action Synthesis | Mohamed Ilyes Lakhal (Queen Mary University of London)*; Davide Boscaini (Fondazione Bruno Kessler); Fabio Poiesi (Fondazione Bruno Kessler); Oswald Lanz (Fondazione Bruno Kessler, Italy); Andrea Cavallaro (Queen Mary University of London, UK) |
Learning Local Feature Descriptors for Multiple Object Tracking | Dmytro Borysenko (Samsung R&D Institute Ukraine); Dmytro Mykheievskyi (Samsung R&D Institute Ukraine); Viktor Porokhonskyy (Samsung Research&Development Institute Ukraine (SRK))* |
Bridging Adversarial and Statistical Domain Transfer via Spectral Adaptation Networks | Christoph Raab (FHWS)*; Philipp Väth (FHWS); Peter Meier (FHWS); Frank-Michael Schleif (FHWS) |
Active Learning for Video Description With Cluster-Regularized Ensemble Ranking | David M. Chan (University of California, Berkeley)*; Sudheendra Vijayanarasimhan (Google research); David A. Ross (Google); John F. Canny (UC Berkeley) |
Video-Based Crowd Counting Using a Multi-Scale Optical Flow Pyramid Network | Mohammad Asiful Hossain (HUAWEI Technologies Co, LTD.)*; Kevin Cannons (Huawei Technologies Canada Co., Ltd ); Daesik Jang (Personal Research); Fabio Cuzzolin (Oxford Brookes University); Zhan Xu (Huawei Canada) |
Few-Shot Zero-Shot Learning: Knowledge Transfer with Less Supervision | Nanyi Fei (Renmin University of China); Jiechao Guan (Renmin University of China); Zhiwu Lu (Renmin University of China)*; Yizhao Gao (Renmin University of China) |
Synthetic-to-real domain adaptation for lane detection | Noa Garnett (GM); Roy Uziel (Ben-Gurion University); Netalee Efrat (General Motors); Dan Levi (General Motors)* |
Dynamic Depth Fusion and Transformation for Monocular 3D Object Detection | Erli Ouyang (Fudan University)*; Li Zhang (University of Oxford); Mohan Chen (Fudan University); Anurag Arnab (University of Oxford); Yanwei Fu (Fudan University) |
OpenTraj: Assessing Prediction Complexity in Human Trajectories Datasets | Javad Amirian (Inria, Rennes, France)*; Bingqing Zhang (UCL); Francisco Valente Castro (Cimat); Juan Jose Baldelomar (Cimat); Jean-Bernard Hayet (CIMAT); Julien Pettré (INRIA Rennes – Bretagne Atlantique) |
Contrastively Smoothed Class Alignment for Unsupervised Domain Adaptation | Shuyang Dai (Duke University)*; Yu Cheng (Microsoft); Yizhe Zhang (Microsoft Research); Zhe Gan (Microsoft); Jingjing Liu (Microsoft); Lawrence Carin (CS) |
Adversarially Robust Deep Image Super-Resolution using Entropy Regularization | Jun-Ho Choi (Yonsei University); Huan Zhang (UCLA); Jun-Hyuk Kim (Yonsei University); Cho-Jui Hsieh (UCLA); Jong-Seok Lee (“Yonsei University, Korea”)* |
MMD based Discriminative Learning for Face Forgery Detection | Jian Han (University of Amsterdam)*; Theo Gevers (University of Amsterdam) |
V2A – Vision to Action: Learning robotic arm actions based on vision and language | Michal Nazarczuk (Imperial College London)*; Krystian Mikolajczyk (Imperial College London) |
Data-Efficient Ranking Distillation for Image Retrieval | Zakaria Laskar (Aalto University)*; Juho Kannala (Aalto University, Finland) |
3D Object Detection and Pose Estimation of Unseen Objects in Color Images with Local Surface Embeddings | Giorgia Pitteri (Université de Bordeaux, LaBRI)*; Aureélie Bugeau (University of Bordeaux); Slobodan Ilic (Siemens AG); Vincent Lepetit (Ecole des Ponts ParisTech) |
Contextual Semantic Interpretability | Diego Marcos (Wageningen University)*; Ruth Fong (University of Oxford); Sylvain Lobry (Wageningen University and Research); Rémi Flamary (Université Côte d’Azur); Nicolas Courty (UBS); Devis Tuia (Wageningen University and Research) |
MatchGAN: A Self-Supervised Semi-Supervised Conditional Generative Adversarial Network | Jiaze Sun (Imperial College London)*; Binod Bhattarai (Imperial College London); Tae-Kyun Kim (Imperial College London) |
COMET: Context-Aware IoU-Guided Network for Small Object Tracking | Seyed Mojtaba Marvasti-Zadeh (University of Alberta)*; Javad Khaghani (University of Alberta); Hossein Ghanei-Yakhdan (Yazd University); Shohreh Kasaei (Sharif University of Technology); Li Cheng (ECE dept., University of Alberta) |
Title | Authors |
Adversarial Image Composition with Auxiliary Illumination | Fangneng Zhan (Nanyang Technological University); Shijian Lu (Nanyang Technological University)*; Changgong Zhang (Alibaba Group); Feiying Ma (Alibaba); Xuansong Xie (Alibaba) |
Deep Snapshot HDR Imaging Using Multi-Exposure Color Filter Array | Takeru Suda (Tokyo Institute of Technology); Masayuki Tanaka (Tokyo Institute of Technology); Yusuke Monno (Tokyo Institute of Technology)*; Masatoshi Okutomi (Tokyo Institute of Technology) |
Class-incremental Learning with Rectified Feature-Graph Preservation | Cheng-Hsun Lei (National Chiao Tung University); Yi-Hsin Chen (National Chiao Tung University); Wen-Hsiao Peng (National Chiao Tung University); Wei-Chen Chiu (National Chiao Tung University)* |
Accurate and Efficient Single Image Super-Resolution with Matrix Channel Attention Network | Hailong Ma (Xiaomi); Xiangxiang Chu (Xiaomi); Bo Zhang (Xiaomi)* |
Domain Adaptation Gaze Estimation by Embedding with Prediction Consistency | Zidong Guo (Xi’an Jiaotong university)*; Zejian Yuan (Xi‘an Jiaotong University); Chong Zhang (Tencent Robotics X); Wanchao Chi (Tencent Robotics X); Yonggen Ling (Tencent); shenghao zhang (Tencent) |
DoFNet: Depth of Field Difference Learning for Detecting Image Forgery | Yonghyun Jeong (Samsung SDS)*; Jongwon Choi (Chung-Ang University); Doyeon Kim (SamsungSDS); Sehyeon Park (Samsung SDS); Minki Hong (Samsung SDS); Changhyun Park (Samsung SDS); Seungjai Min (Samsung SDS); Youngjune Gwon (Samsung SDS) |
Patch SVDD: Patch-level SVDD for Anomaly Detection and Segmentation | Jihun Yi (Seoul National University); Sungroh Yoon (Seoul National University)* |
Learning More Accurate Features for Semantic Segmentation in CycleNet | Linzi Qu (Xidian University)*; Lihuo He (Xidian University); JunJie Ke (Xidian University); Xinbo Gao (Xidian University); Wen Lu (Xidian University) |
A Benchmark and Baseline for Language-Driven Image Editing | Jing Shi (University of Rochester)*; Ning Xu (Adobe Research); Trung Bui (Adobe Research); Franck Dernoncourt (Adobe Research); Zheng Wen (DeepMind); Chenliang Xu (University of Rochester) |
Sparse Convolutions on Continuous Domains for Point Cloud and Event Stream Networks | Dominic Jack (Queensland University of Technology)*; Frederic Maire (Queensland University of Technology); SIMON DENMAN (Queensland University of Technology, Australia); Anders Eriksson (University of Queensland ) |
AFN: Attentional Feedback Network based 3D Terrain Super-Resolution | Ashish Kubade (International Institute Of Information Technology, Hyderabad)*; Diptiben Patel (IIIT Hyderabad); Avinash Sharma (CVIT, IIIT-Hyderabad); K. S. Rajan (IIIT Hyderabad) |
Title | Authors |
3D Guided Weakly Supervised Semantic Segmentation | Weixuan Sun (Australian National University, Data61 )*; Jing Zhang (Australian National University); Nick Barnes (ANU) |
Attended-Auxiliary Supervision Representation for Face Anti-spoofing | Son Minh Nguyen (Teikyo University)*; Linh Duy Tran (Teikyo University); Masayuki Arai (Teikyo Univ.) |
TSI: Temporal Scale Invariant Network for Action Proposal Generation | Shuming Liu (Shanghai Jiao Tong University); Xu Zhao (Shanghai Jiao Tong University)*; Haisheng Su (Shanghai Jiao Tong University); Zhilan Hu (Huawei) |
Local Facial Makeup Transfer via Disentangled Representation | Zhaoyang Sun (Wuhan University of Technology)*; Feng Liu (Wuhan University of Technology); Wen Liu (Wuhan University of Technology); Shengwu Xiong (Wuhan University of Technology); Wenxuan Liu (Wuhan University of Technology) |
Multi-Task Learning for Simultaneous Video Generation and Remote Photoplethysmography Estimation | Yun-Yun Tsou (National Tsing Hua University)*; Yi-An Lee (National Tsing Hua University); Chiou-Ting Hsu (National Tsing Hua University) |
SDCNet: Size Divide and Conquer Network for Salient Object Detection | Senbo Yan (Zhejiang University); Xiaowen Song (Zhejiang University)*; chuer yu (ZheJiang University) |
Attention-Based Fine-Grained Classification of Bone Marrow Cells | Weining Wang (South China University of Technology); Peirong Guo (South China University of Technology)*; Lemin Li (South China University of Technology); Yan Tan (Peking University People’s Hospital); Hongxia Shi (Peking University People’s Hospital); Yan Wei (Peking University People’s Hospital); Xiangmin Xu (South China University of Technology) |
Adaptive Spotting: Deep Reinforcement Object Search in 3D Point Clouds | Onkar Krishna (NTT Corporation, Japan)*; Go Irie (NTT Corporation); Xiaomeng Wu (NTT Corporation); Takahito Kawanishi (NTT Corporation); Kunio Kashino (NTT Communication Science Laboratories) |
BLT: Balancing Long-Tailed Datasets with Adversarially-Perturbed Images | Jedrzej Kozerawski (UC Santa Barbara); Victor Fragoso (Microsoft)*; Nikolaos Karianakis (Microsoft); Gaurav Mittal (Microsoft); Matthew Turk (TTIC); Mei Chen (Microsoft) |
TinyGAN: Distilling BigGAN for Conditional Image Generation | Ting-Yun Chang (National Taiwan University)*; Chi-Jen Lu (Academia Sinica) |
Deep Priors inside an Unrolled and Adaptive Deconvolution Model | Hung-Chih Ko (National Taiwan University); Je-Yuan Chang (National Taiwan University); Jian-Jiun Ding (National Taiwan University)* |
Imbalance Robust Softmax for Deep Embedding Learning | Hao Zhu (Australian National University)*; Yang Yuan (AnyVision); Guosheng Hu (AnyVision); Xiang Wu (Reconova); Neil Robertson (Queen’s University Belfast) |
Frequency Attention Network: Blind Noise Removal for Real Images | Hongcheng Mo (Shanghai Jiao Tong University); Jianfei Jiang (Shanghai Jiao Tong University); Qin Wang (Shanghai Jiao Tong University)*; Dong Yin (Fullhan); Pengyu Dong (Fullhan); Jingjun Tian (Fullhan) |
Learning End-to-End Action Interaction by Paired-Embedding Data Augmentation | Ziyang Song (Institute of Artificial Intelligence and Robotics, Xi’an Jiaotong University.)*; Zejian Yuan (Xi‘an Jiaotong University); Chong Zhang (Tencent Robotics X); Wanchao Chi (Tencent Robotics X); Yonggen Ling (Tencent); Shenghao Zhang (Tencent) |
Horizontal Flipping Assisted Disentangled Feature Learning for Semi-Supervised Person Re-Identification | Gehan Hao ( University of Electronic Science and Technology of China); Yang Yang (Institute of Automation, Chinese Academy of Sciences); Xue Zhou (University of Electronic Science and Technology of China)*; Guanan Wang (CASIA); Zhen Lei (NLPR, CASIA, China) |
Dense Dual-Path Network for Real-time Semantic Segmentation | Xinneng Yang (Tongji University)*; Yan Wu (Tongji University); Junqiao Zhao (Tongji University); Feilin Liu (Tongji University) |
Channel Pruning for Accelerating Convolutional Neural Networks via Wasserstein Metric | Haoran Duan (University of Science and Technology of China (USTC))*; Hui Li (University of Science and Technology of China (USTC)) |
Second-order Camera-aware Color Transformation for Cross-domain Person Re-identification | Wangmeng Xiang (The Hong Kong Polytechnic University); Hongwei Yong (The Hong Kong Polytechnic University); Jianqiang Huang (Damo Academy, Alibaba Group); Xian-Sheng Hua (Alibaba Group); Lei Zhang (“Hong Kong Polytechnic University, Hong Kong, China”)* |
TTPLA: An Aerial-Image Dataset for Detection and Segmentation of Transmission Towers and Power Lines | Rabab Abdelfattah (University of South Carolina)*; XIAOFENG Wang (USC); Song Wang (University of South Carolina) |
Accurate Arbitrary-Shaped Scene Text Detection via Iterative Polynomial Parameter Regression | Jiahao Shi (Nanjing University); Long Chen (Nanjing University); Feng Su (Nanjing University)* |
DiscFace: Minimum Discrepancy Learning for Deep Face Recognition | Insoo Kim (Samsung Advanced Institute of Technology)*; Seungju Han (Samsung Advanced Institute of Technology); Seong-Jin Park (Samsung Advanced Institute of Technology); Ji-won Baek (Samsung Advanced Institute of Technology); Jinwoo Shin (KAIST); Jae-Joon Han (Samsung); Changkyu Choi (Samsung) |
Synergistic Saliency and Depth Prediction for RGB-D Saliency Detection | Yue Wang (Dalian University of Technology); Yuke Li (UC Berkeley); James H. Elder (York University); Runmin Wu (Dalian University of Technology ); Huchuan Lu (Dalian University of Technology)*; Lu Zhang (Dalian University of Technology) |
Query by Strings and Return Ranking Word Regions with Only One Look | Peng Zhao (Beijing Jiaotong University); Wenyuan Xue (Beijing Jiaotong University); Qingyong Li (Beijing Jiaotong University)*; Siqi Cai (Beijing Jiaotong University) |
Reweighted Non-convex Non-smooth Rank Minimization based Spectral Clustering on Grassmann Manifold | Xinglin Piao (Peng Cheng Laboratory; Peking University; Dalian University of Technology)*; Yongli Hu (Beijing University of Technology); Junbin Gao (University of Sydney, Australia); Yanfeng Sun (Beijing University of Technology); Xin Yang (Dalian University of Technology); Baocai Yin (Beijing University of Technology) |
Branch Interaction Network for Person Re-identification | Zengming Tang (Shanghai Advanced Research Institute, Chinese Academy of Sciences, Shanghai, China)*; Jun Huang (Shanghai Advanced Research Institute, Chinese Academy of Sciences) |
Knowledge Transfer Graph for Deep Collaborative Learning | Soma Minami (Chubu university)*; Tsubasa Hirakawa (Chubu University); Takayoshi Yamashita (Chubu University); Hironobu Fujiyoshi (Chubu University) |
Over-exposure Correction via Exposure and Scene Information Disentanglement | Yuhui Cao (SECE, Shenzhen Graduate School, Peking University)*; Yurui Ren (Shenzhen Graduate School, Peking University); Thomas H. Li (Advanced Institute of Information Technology, Peking University); Ge Li (SECE, Shenzhen Graduate School, Peking University) |
Spatial Temporal Attention Graph Convolutional Networks with Mechanics-Stream for Skeleton-based Action Recognition | Katsutoshi Shiraki (Chubu University)*; Tsubasa Hirakawa (Chubu University); Takayoshi Yamashita (Chubu University); Hironobu Fujiyoshi (Chubu University) |
Show, Conceive and Tell: Image Captioning with Prospective Linguistic Information | Yiqing Huang (Tsinghua University); Jiansheng Chen (Tsinghua University)* |
Robust High Dynamic Range (HDR) Imaging with Complex Motion and Parallax | Zhiyuan Pu (NanJing University); Peiyao Guo (Nanjing University); M. Salman Asif (University of California, Riverside); Zhan Ma (Nanjing University)* |
Multi-scale Attentive Residual Dense Network for Single Image Rain Removal | Xiang Chen (Shenyang Aerospace University ); Yufeng Huang (Shenyang Aerospace University)*; Lei Xu (Shenyang Fire Science and Technology Research Institute of MEM) |
MBNet: A Multi-Task Deep Neural Network for Semantic Segmentation and Lumbar Vertebra Inspection on X-ray Images | Van Luan Tran (National Chung Cheng University)*; Huei-Yung Lin (National Chung Cheng University); Hsiao-Wei Liu (Industrial Technology Research Institute (ITRI)) |
Understanding Motion in Sign Language: A New Structured Translation Dataset | Jefferson Rodriguez (UIS); Juan Chacon (UIS); Edgar Rangel (UIS); Luis Guayacan (UIS); Claudia Hernandez (UIS); Luisa Hernandez (UIS); Fabio Martinez (UIS )* |
Trainable Structure Tensors for Autonomous Baggage Threat Detection Under Extreme Occlusion | Taimur Hassan (Khalifa University of Science and Technology)*; Naoufel Werghi (Khalifa University of Science and Technology) |
Title | Authors |
Watch, read and lookup: learning to spot signs from multiple supervisors | Liliane Momeni (University of Oxford); Gul Varol (University of Oxford)*; Samuel Albanie (University of Oxford); Triantafyllos Afouras (University of Oxford); Andrew Zisserman (University of Oxford) |
In Defense of LSTMs for Addressing Multiple Instance Learning Problems | Kaili Wang (KU Leuven, UAntwerpen)*; Jose Oramas (UAntwerp, imec-IDLab); Tinne Tuytelaars (KU Leuven) |
Semantics through Time: Semi-supervised Segmentation of Aerial Videos with Iterative Label Propagation | Alina Marcu (University “Politehnica” of Bucharest)*; Vlad Licaret (Autonomous Systems); Dragos Costea (University “Politehnica” of Bucharest); Marius Leordeanu (University “Politehnica” of Bucharest) |
Condensed Movies: Story Based Retrieval with Contextual Embeddings | Max Bain (University of Oxford)*; Arsha Nagrani (Oxford University ); Andrew Brown (University of Oxford); Andrew Zisserman (University of Oxford) |
Pre-training without Natural Images | Hirokatsu Kataoka (National Institute of Advanced Industrial Science and Technology (AIST))*; Kazushige Okayasu (National Institute of Advanced Industrial Science and Technology (AIST)); Asato Matsumoto (National Institute of Advanced Industrial Science and Technology (AIST)); Eisuke Yamagata (Tokyo Institute of Technology); Ryosuke Yamada (Tokyo Denki University); Nakamasa Inoue (Tokyo Institute of Technology); Akio Nakamura (Tokyo Denki University (TDU)); Yutaka Satoh (National Institute of Advanced Industrial Science and Technology (AIST)) |
Long-Term Cloth-Changing Person Re-identification | Xuelin Qian (Fudan University); Wenxuan Wang (Fudan University); Li Zhang (University of Oxford); Fangrui Zhu (Fudan University); Yanwei Fu (Fudan University)*; Tao Xiang (University of Surrey); Yu-Gang Jiang (Fudan University); Xiangyang Xue (Fudan University) |
Goal-GAN: Multimodal Trajectory Prediction Based on Goal Position Estimation | Patrick Dendorfer (TUM)*; Aljosa Osep (TUM Munich); Laura Leal-Taixé (TUM) |
Learning 3D Face Reconstruction with a Pose Guidance Network | Pengpeng Liu (The Chinese University of Hong Kong)*; Xintong Han (Malong Technologies); Michael Lyu (The Chinese University of Hong Kong); Irwin King (The Chinese University of Hong Kong); Jia Xu (Huya AI) |
Bidirectional Pyramid Networks for Semantic Segmentation | Dong Nie (UNC)*; Jia Xue (Rutgers University); Xiaofeng Ren (Alibaba group) |
FKAConv: Feature-Kernel Alignment for Point Cloud Convolution | Alexandre Boulch (valeo.ai)*; Gilles Puy (Valeo); Renaud Marlet (Ecole des Ponts ParisTech) |
EvolGAN: Evolutionary Generative Adversarial Networks | Baptiste Roziere (Facebook AI Research); Fabien Teytaud (Univ. Littoral Cote d’Opale); Vlad Hosu (University of Konstanz); Hanhe Lin (University of Konstanz); Jeremy Rapin (Facebook AI Research); Mariia Zameshina (Inria); Olivier Teytaud (Facebook)* |
L2R GAN: LiDAR-to-Radar Translation | LeiChen Wang (Daimler AG)*; Bastian Goldluecke (University of Konstanz); Carsten Anklam (Daimler AG) |
Title | Authors |
Adversarial Semi-Supervised Multi-Domain Tracking | Kourosh Meshgi (RIKEN AIP)*; Maryam Sadat Mirzaei (Riken AIP / Kyoto University) |
Any-Shot Object Detection | Shafin Rahman (North South University)*; Salman Khan (IIAI); Nick Barnes (ANU); Fahad Shahbaz Khan (Inception Institute of Artificial Intelligence) |
Degradation Model Learning for Real-World Single Image Super-resolution | Jin XIAO (The Hong Kong Polytechnic University)*; Hongwei Yong (The Hong Kong Polytechnic University); Lei Zhang (“Hong Kong Polytechnic University, Hong Kong, China”) |
CloTH-VTON: Clothing Three-dimensional reconstruction for Hybrid image-based Virtual Try-ON | Matiur Rahman Minar (Seoul National University of Science and Technology); Heejune Ahn (Seoul National Univ. of Science and Technology)* |
A Global to Local Double Embedding Method for Multi-person Pose Estimation | Yiming Xu (UESTC)*; Jiaxin Li (Beijing Institute of Technology); Yan Ding (Beijing Institute of Technology); Hua-Liang Wei (University of Sheffield) |
MTNAS: Search Multi-Task Networks for Autonomous Driving | Hao Liu (Beijing Institute of Technology)*; Dong Li (Xilinx); JinZhang Peng (Xilinx); Qingjie Zhao (Beijing Institute of Technology); Lu Tian (Xilinx,Inc.); Yi Shan (Xilinx) |
Pose Correction Algorithm for Relative Frames between Keyframes in SLAM | Youngseok Jang (Seoul National University)*; Hojoon Shin (Seoul National University); H. Jin Kim (Seoul National University) |
Motion Prediction Using Temporal Inception Module | Tim Lebailly (EPFL)*; Sena Kiciroglu (EPFL (École polytechnique fédérale de Lausanne)); Mathieu Salzmann (EPFL); Pascal Fua (EPFL, Switzerland); Wei Wang (EPFL) |
VAN: Versatile Affinity Network for End-to-end Online Multi-Object Tracking | Hyemin Lee (POSTECH)*; Inhan Kim (POSTECH); Daijin Kim (Pohang University of Science and Technology) |
Low-light Color Imaging via Dual Camera Acquisition | Peiyao Guo (Nanjing University); Zhan Ma (Nanjing University)* |
Low-level Sensor Fusion Network for 3D Vehicle Detection using Radar Range-Azimuth Heatmap and Monocular Image | Jinhyeong Kim ( Korea Advanced Institute of Science and Technology); Youngseok Kim (Korea Advanced Institute of Science and Technology (KAIST))*; Dongsuk Kum (Korea Advanced Institute of Science and Technology) |
Self-Guided Multiple Instance Learning for Weakly Supervised Thoracic Disease Classification and Localization in Chest Radiographs | Constantin Seibold (Karlsruhe Institute of Technology)*; Jens Kleesiek (German Cancer Research Center); Heinz-Peter Schlemmer (German Cancer Research Center); Rainer Stiefelhagen (Karlsruhe Institute of Technology) |
Play Fair: Frame Contributions in Video Models | Will Price (University of Bristol)*; Dima Damen (University of Bristol) |
Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses | Miao Liao (Baidu)*; Sibo Zhang (Baidu); Peng Wang (Baidu USA LLC.); Hao Zhu (Nanjing University); Xinxin Zuo (University of Kentucky); Ruigang Yang (University of Kentucky, USA) |
DeepVoxels++: Enhancing the Fidelity of Novel View Synthesis from 3D Voxel Embeddings | Tong He (UCLA)*; John Collomosse (Adobe Research); Hailin Jin (Adobe Research); Stefano Soatto (UCLA) |
Unpaired Multimodal Facial Expression Recognition | Bin Xia (University of Science and Technology of China); Shangfei Wang (University of Science and Technology of China)* |
Towards Fast and Robust Adversarial Training for Image Classification | Erh-Chung Chen (National Tsing Hua University)*; Che-Rung Lee (National Tsing Hua University ) |
MIX’EM: Unsupervised Image Classification using a Mixture of Embeddings | Ali Varamesh (KU Leuven)*; Tinne Tuytelaars (KU Leuven) |
Road Obstacle Detection Method Based on an Autoencoder with Semantic Segmentation | Toshiaki Ohgushi (TOYOTA); Kenji Horiguchi (TOYOTA); Masao Yamanaka (TOYOTA)* |
Recursive Bayesian Filtering for Multiple Human Pose Tracking from Multiple Cameras | Oh-Hun Kwon (University of Bonn); Julian Tanke (University of Bonn)*; Jürgen Gall (University of Bonn) |
Real-Time Segmentation Networks should be Latency Aware | Evann Courdier (Idiap Research Institute)*; François Fleuret (University of Geneva) |
Interpreting Video Features: A Comparison of 3D Convolutional Networks and Convolutional LSTM Networks | Joonatan Mänttäri (KTH Royal Institute of Technology); Sofia Broomé (KTH Royal Institute of Technology)*; John Folkesson (KTH Royal Institute of Technology); Hedvig Kjellström (KTH Royal Institute of Technology) |
FootNet: An efficient convolutional network for multiview 3D foot reconstruction | Felix Kok (Cambridge University)*; James Charles (Cambridge University); Roberto Cipolla (University of Cambridge) |
FAN: Feature Adaptation Network for Surveillance Face Recognition and Normalization | Xi Yin (Microsoft Cloud & AI)*; Ying Tai (Tencent YouTu); Yuge Huang (Tencent YouTu); Xiaoming Liu (Michigan State University) |
Regularizing Meta-Learning via Gradient Dropout | Hung-Yu Tseng (University of California, Merced)*; Yi-Wen Chen (University of California, Merced); Yi-Hsuan Tsai (NEC Labs America); Sifei Liu (NVIDIA); Yen-Yu Lin (National Chiao Tung University); Ming-Hsuan Yang (University of California at Merced) |
Addressing Class Imbalance in Scene Graph Parsing by Learning to Contrast and Score | He Huang (University of Illinois at Chicago)*; Shunta Saito (Preferred Networks, Inc.); Yuta Kikuchi (Preferred Networks, Inc.); Eiichi Matsumoto (Preferred Networks, Inc.); Wei Tang (University of Illinois at Chicago); Philip S. Yu (UIC) |
Spatial Class Distribution Shift in Unsupervised Domain Adaptation: Local Alignment Comes to Rescue | Safa Cicek (UCLA)*; Ning Xu (Adobe Research); Zhaowen Wang (Adobe Research); Hailin Jin (Adobe Research); Stefano Soatto (UCLA) |
BLT: Balancing Long-Tailed Datasets with Adversarially-Perturbed Images | Jedrzej Kozerawski (UC Santa Barbara); Victor Fragoso (Microsoft)*; Nikolaos Karianakis (Microsoft); Gaurav Mittal (Microsoft); Matthew Turk (TTIC); Mei Chen (Microsoft) |
Vax-a-Net: Training-time Defence Against Adversarial Patch Attacks | Thomas Gittings (University of Surrey); Steve Schneider (University of Surrey); John Collomosse (Adobe Research)* |
Best Buddies Registration for Point Clouds | Amnon Drory (Tel-Aviv University)*; Tal Shomer (Tel-Aviv University); Shai Avidan (Tel Aviv University); Raja Giryes (Tel Aviv University) |
Discrete Spatial Importance-Based Deep Weighted Hashing | Yang Shi (Shandong University); Xiushan Nie (Shandong Jianzhu University)*; Quan Zhou (Shandong University); Xiaoming Xi (Shandong Jianzhu University ); Yilong Yin (Shandong University) |
Double Targeted Universal Adversarial Perturbations | Philipp Benz (KAIST)*; Chaoning Zhang (KAIST); Tooba Imtiaz (KAIST); In So Kweon (KAIST) |
ERIC: Extracting Relations Inferred from Convolutions | Joe Townsend (Fujitsu Laboratories of Europe LTD)*; Theodoros Kasioumis (Fujitsu Laboratories of Europe LTD); Hiroya Inakoshi (Fujitsu Laboratories of Europe) |
Faster, Better and More Detailed: 3D Face Reconstruction with Graph Convolutional Networks | Shiyang Cheng (Samsung)*; Georgios Tzimiropoulos (Samsung AI); Jie Shen (Imperial College London); Maja Pantic (Samsung AI Centre Cambridge/ Imperial College London ) |
Title | Authors |
RealSmileNet: A Deep End-To-End Network for Spontaneous and Posed Smile Recognition | Yan Yang (Australian National University)*; Md Zakir Hossain (The Australian National University ); Tom Gedeon (The Australian National University); Shafin Rahman (North South University) |
IAFA: Instance-Aware Feature Aggregation for 3D Object Detection from a Single Image | Dingfu Zhou (Baidu)*; Xibin Song (Baidu); Yuchao Dai (Northwestern Polytechnical University); Junbo Yin (Beijing Institute of Technology); Feixiang Lu (Baidu); Miao Liao (Baidu); Jin Fang (Baidu ); Liangjun Zhang (Baidu) |
Synthetic-to-Real Unsupervised Domain Adaptation for Scene Text Detection in the Wild | weijia wu (Zhejiang University)*; Ning Lu (Tencent Cloud Product Department); Enze Xie (The University of Hong Kong); Yuxing Wang (Zhejiang University); Wenwen Yu (Xuzhou Medical University); Cheng Yang (Zhejiang University); HONG ZHOU (Zhejiang University) |
3D Guided Weakly Supervised Semantic Segmentation | Weixuan Sun (Australian National University, Data61 )*; Jing Zhang (Australian National University); Nick Barnes (ANU) |
Learning Global Pose Features in Graph Convolutional Networks for 3D Human Pose Estimation | Kenkun Liu ( University of Illinois at Chicago); Zhiming Zou (University of Illinois at Chicago); Wei Tang (University of Illinois at Chicago)* |
Attended-Auxiliary Supervision Representation for Face Anti-spoofing | Son Minh Nguyen (Teikyo University)*; Linh Duy Tran (Teikyo University); Masayuki Arai (Teikyo Univ.) |
TSI: Temporal Scale Invariant Network for Action Proposal Generation | Shuming Liu (Shanghai Jiao Tong University); Xu Zhao (Shanghai Jiao Tong University)*; Haisheng Su (Shanghai Jiao Tong University); Zhilan Hu (Huawei) |
MagGAN: High-Resolution Face Attribute Editing with Mask-Guided Generative Adversarial Network | Yi Wei (University at Albany – SUNY)*; Zhe Gan (Microsoft); Wenbo Li (Samsung Research America); Siwei Lyu (University at Albany); Ming-Ching Chang (University at Albany – SUNY); Lei Zhang (Microsoft); Jianfeng Gao (Microsoft Research); Pengchuan Zhang (Microsoft Research AI) |
Local Facial Makeup Transfer via Disentangled Representation | Zhaoyang Sun (Wuhan University of Technology)*; Feng Liu (Wuhan University of Technology); Wen Liu (Wuhan University of Technology); Shengwu Xiong (Wuhan University of Technology); Wenxuan Liu (Wuhan University of Technology) |
Multi-Task Learning for Simultaneous Video Generation and Remote Photoplethysmography Estimation | Yun-Yun Tsou (National Tsing Hua University)*; Yi-An Lee (National Tsing Hua University); Chiou-Ting Hsu (National Tsing Hua University) |
Feature Variance Ratio-Guided Channel Pruning for Deep Convolutional Network Acceleration | Junjie He (Zhejiang University)*; Bohua Chen (Zhejiang University); Yinzhang Ding (Zhejiang University); Dongxiao Li (Zhejiang University) |
SDCNet: Size Divide and Conquer Network for Salient Object Detection | Senbo Yan (Zhejiang University); Xiaowen Song (Zhejiang University)*; chuer yu (ZheJiang University) |
Attention-Based Fine-Grained Classification of Bone Marrow Cells | Weining Wang (South China University of Technology); Peirong Guo (South China University of Technology)*; Lemin Li (South China University of Technology); Yan Tan (Peking University People’s Hospital); Hongxia Shi (Peking University People’s Hospital); Yan Wei (Peking University People’s Hospital); Xiangmin Xu (South China University of Technology) |
Adaptive Spotting: Deep Reinforcement Object Search in 3D Point Clouds | Onkar Krishna (NTT Corporation, Japan)*; Go Irie (NTT Corporation); Xiaomeng Wu (NTT Corporation); Takahito Kawanishi (NTT Corporation); Kunio Kashino (NTT Communication Science Laboratories) |
Point Proposal based Instance Segmentation with Rectangular Masks for Robot Picking Task | Satoshi Ito (Toshiba Corporation)*; Susumu Kubota (Toshiba Corporation) |
Hierarchical X-Ray Report Generation via Pathology tags and Multi Head Attention | Preethi Srinivasan (IIT Mandi); Daksh Thapar (Indian Institute of Technology, Mandi)*; Arnav Bhavsar (IIT Mandi); Aditya Nigam (IIT mandi) |
TinyGAN: Distilling BigGAN for Conditional Image Generation | Ting-Yun Chang (National Taiwan University)*; Chi-Jen Lu (Academia Sinica) |
CS-MCNet:A Video Compressive Sensing Reconstruction Network with Interpretable Motion Compensation | Bowen Huang (Fudan University)*; Jinjia Zhou (Hosei University); Xiao Yan (Fudan University); Ming’e Jing (Fudan University); Rentao Wan (Fudan University); Yibo Fan (Fudan University) |
Lightweight Single-Image Super-Resolution Network with Attentive Auxiliary Feature Learning | Xuehui Wang (School of Data and Computer Science, Sun Yat-sen University); qing wang (School of Data and Computer Science, Sun Yat-sen University); Yuzhi Zhao (City University of Hong Kong); Junchi Yan (Shanghai Jiao Tong University); Lei Fan (Northwestern University); long chen (School of Data and Computer Science, Sun Yat-sen University)* |
Deep Priors inside an Unrolled and Adaptive Deconvolution Model | Hung-Chih Ko (National Taiwan University); Je-Yuan Chang (National Taiwan University); Jian-Jiun Ding (National Taiwan University)* |
Few-Shot Object Detection by Second-order Pooling | Shan Zhang (ANU, Beijing Union University)*; Dawei Luo (Beijing Key Laboratory of Information Service Engineering, Beijing Union University ); Lei Wang (“University of Wollongong, Australia”); Piotr Koniusz (Data61/CSIRO, ANU) |
Title | Authors |
Progressive Batching for Efficient Non-linear Least Squares | Huu Le (Chalmers University of Technology)*; Christopher Zach (Chalmers University); Edward Rosten (Snap Inc.); Oliver J. Woodford (Snap Inc) |
Watch, read and lookup: learning to spot signs from multiple supervisors | Liliane Momeni (University of Oxford); Gul Varol (University of Oxford)*; Samuel Albanie (University of Oxford); Triantafyllos Afouras (University of Oxford); Andrew Zisserman (University of Oxford) |
D2D: Keypoint Extraction with Describe to Detect Approach | Yurun Tian (Imperial College London)*; Vassileios Balntas (Scape Technologies); Tony Ng (Imperial College London); Axel Barroso-Laguna (Imperial College London); Yiannis Demiris (Imperial College London); Krystian Mikolajczyk (Imperial College London) |
SpotPatch: Parameter-Efficient Transfer Learning for Mobile Object Detection | Keren Ye (University of Pittsburgh)*; Adriana Kovashka (University of Pittsburgh); Mark Sandler (Google); Menglong Zhu (UPenn); Andrew Howard (Google); Marco Fornoni (Google) |
A Sparse Gaussian Approach to Region-Based 6DoF Object Tracking | Manuel Stoiber (German Aerospace Center (DLR))*; Martin Pfanne (German Aerospace Center); Klaus H. Strobl (DLR); Rudolph Triebel (German Aerospace Center (DLR)); Alin Albu-Schaeffer (Robotics and Mechatronics Center (RMC), German Aerospace Center (DLR)) |
Encode the Unseen: Predictive Video Hashing for Scalable Mid-Stream Retrieval | Tong Yu (University of Strasbourg)*; Nicolas Padoy (University of Strasbourg) |
In Defense of LSTMs for Addressing Multiple Instance Learning Problems | Kaili Wang (KU Leuven, UAntwerpen)*; Jose Oramas (UAntwerp, imec-IDLab); Tinne Tuytelaars (KU Leuven) |
FreezeNet: Full Performance by Reduced Storage Costs | Paul Wimmer (Luebeck University / Robert Bosch GmbH)*; Jens Mehnert (Robert Bosch GmbH); Alexandru Condurache (Bosch) |
Lossless Image Compression Using a Multi-Scale Progressive Statistical Model | Honglei Zhang (Nokia Technologies)*; Francesco Cricri (Nokia Technologies); Hamed R. Tavakoli (Nokia Technologies); Nannan Zou (Tampere University); Emre Aksu (Nokia Technologies); Miska M. Hannuksela (Nokia Technologies) |
Semantics through Time: Semi-supervised Segmentation of Aerial Videos with Iterative Label Propagation | Alina Marcu (University “Politehnica” of Bucharest)*; Vlad Licaret (Autonomous Systems); Dragos Costea (University “Politehnica” of Bucharest); Marius Leordeanu (University “Politehnica” of Bucharest) |
Generic Image Segmentation in Fully Convolutional Networks by Superpixel Merging Map | Jin-Yu Huang (National Taiwan University); Jian-Jiun Ding (National Taiwan University)* |
Title | Authors |
Transforming Multi-Concept Attention into Video Summarization | Yen-Ting Liu (National Taiwan University)*; Yu-Jhe Li (Carnegie Mellon University); Yu-Chiang Frank Wang (National Taiwan University) |
Visual Tracking by TridentAlign and Context Embedding | Janghoon Choi (Seoul National University)*; Junseok Kwon (Chung-Ang Univ., Korea); Kyoung Mu Lee (Seoul National University) |
Exploiting Transferable Knowledge for Fairness-aware Image Classification | sunhee hwang (Yonsei university)*; Sungho Park (Yonsei University); Pilhyeon Lee (Yonsei University); seogkyu jeon (Yonsei university); Dohyung Kim (Yonsei University); Hyeran Byun (Yonsei University) |
Adversarial Semi-Supervised Multi-Domain Tracking | Kourosh Meshgi (RIKEN AIP)*; Maryam Sadat Mirzaei (Riken AIP / Kyoto University) |
Multi-View Consistency Loss for Improved Single-Image 3D Reconstruction of Clothed People | Akin Caliskan (Center for Vision Speech and Signal Processing – University of Surrey)*; Armin Mustafa (University of Surrey); Evren Imre (Vicon); Adrian Hilton (University of Surrey) |
Multiple Exemplars-based Hallucination for Face Super-resolution and Editing | Kaili Wang (KU Leuven, UAntwerpen)*; Jose Oramas (UAntwerp, imec-IDLab); Tinne Tuytelaars (KU Leuven) |
Audiovisual Transformer with Instance Attention for Audio-Visual Event Localization | Yan-Bo Lin (National Taiwan Unviersity)*; Yu-Chiang Frank Wang (National Taiwan University) |
Utilizing Transfer Learning and a Customized Loss Function for Optic Disc Segmentation from Retinal Images | Abdullah Sarhan (University of Calgary)*; Ali Al-Khaz’Aly (University of Calgary); Adam Gorner (University of Calgary); Andrew Swift (University of Calgary); Jon Rokne (University of Calgary); Reda Alhajj (University of Calgary); Andrew Crichton (University of Calgary) |
Any-Shot Object Detection | Shafin Rahman (North South University)*; Salman Khan (IIAI); Nick Barnes (ANU); Fahad Shahbaz Khan (Inception Institute of Artificial Intelligence) |
Degradation Model Learning for Real-World Single Image Super-resolution | Jin XIAO (The Hong Kong Polytechnic University)*; Hongwei Yong (The Hong Kong Polytechnic University); Lei Zhang (“Hong Kong Polytechnic University, Hong Kong, China”) |
Leveraging Tacit Information Embedded in CNN Layers for Visual Tracking | Kourosh Meshgi (RIKEN AIP)*; Maryam Sadat Mirzaei (Riken AIP / Kyoto University); Shigeyuki Oba (Kyoto University) |
dpVAEs: Fixing Sample Generation for Regularized VAEs | Riddhish Bhalodia (Scientific Computing and Imaging Institute); Iain Lee (Scientific computing and Imaging Institute, University of Utah); Shireen Elhabian (Scientific Computing and Imaging Institute, University of Utah)* |
CloTH-VTON: Clothing Three-dimensional reconstruction for Hybrid image-based Virtual Try-ON | Matiur Rahman Minar (Seoul National University of Science and Technology); Heejune Ahn (Seoul National Univ. of Science and Technology)* |
Learn more, forget less: Cues from human brain | Arijit Patra (University of Oxford); Tapabrata Chakraborti (University of Oxford)* |
SGNet: Semantics Guided Deep Stereo Matching | Shuya Chen (Zhejiang University); Zhiyu Xiang (Zhejiang University)*; Chengyu Qiao (Zhejiang University); Yiman Chen (Zhejiang University); Tingming Bai (Zhejiang University) |
A Global to Local Double Embedding Method for Multi-person Pose Estimation | Yiming Xu (UESTC)*; Jiaxin Li (Beijing Institute of Technology); Yan Ding (Beijing Institute of Technology); Hua-Liang Wei (University of Sheffield) |
Reconstructing Human Body Mesh from Point Clouds by Adversarial GP Network | Boyao Zhou (Inria)*; Jean-Sebastien Franco (INRIA); Federica Bogo (Microsoft); Bugra Tekin (Microsoft); Edmond Boyer (Inria) |
Semi-supervised Facial Action Unit Intensity Estimation with Contrastive Learning | Enrique Sanchez (Samsung AI Centre)*; Adrian Bulat (Samsung AI Center, Cambridge); Anestis Zaganidis (Samsung); Georgios Tzimiropoulos (Queen Mary University of London) |
MTNAS: Search Multi-Task Networks for Autonomous Driving | Hao Liu (Beijing Institute of Technology)*; Dong Li (Xilinx); JinZhang Peng (Xilinx); Qingjie Zhao (Beijing Institute of Technology); Lu Tian (Xilinx,Inc.); Yi Shan (Xilinx) |
Towards Optimal Filter Pruning with Balanced Performance and Pruning Speed | Dong Li (Nuctech)*; Sitong Chen (Nuctech); Xudong Liu (Nuctech); Yunda Sun (Nuctech); Li Zhang (Nuctech) |
Fully Supervised and Guided Distillation for One-Stage Detectors | Deyu Wang (Canon Information Technology (Beijing) Co., LTD)*; Dongchao Wen (Canon Information Technology (Beijing) Co., LTD); Junjie Liu (Canon Information Technology (Beijing) Co., LTD); Wei Tao (Canon Information Technology (Beijing) Co., LTD); Tse-Wei Chen (Canon Inc.); Kinya Osa (Canon Inc.); Masami Kato (Canon Inc.) |
Localin Reshuffle Net: Toward Naturally and Efficiently Facial Image Blending | Chengyao Zheng (Southeast Univeristy); Siyu Xia (Southeast University, China); Joseph Robinson (Northeastern University)*; Changsheng Lu (Shanghai Jiao Tong University); Wayne Wu (Tsinghua University); Chen Qian (SenseTime); Ming Shao (University of Massachusetts Dartmouth) |
Local Context Attention for Salient Object Segmentation | Jing Tan (Megvii(face++) Research); Pengfei Xiong (Megvii(face++) Research)*; Zhengyi Lv (Megvii(face++) Research); Kuntao Xiao (Megvii(face++) Research); Yuwen He (Megvii(face++) Research) |
Learning Multi-Instance Sub-pixel Point Localization | Julien Schroeter (Cardiff University)*; Tinne Tuytelaars (KU Leuven); Kirill Sidorov (Cardiff University); David Marshall (Cardiff University) |
Explaining image classifiers by removing input features using generative models | Chirag Agarwal (UIC); Anh Nguyen (Auburn University)* |
Compact and Fast Underwater Segmentation Network for Autonomous Underwater Vehicles | Jiangtao Wang (Loughborough University); Baihua Li (Loughborough University)*; Yang Zhou (Loughborough University); Emanuele Rocco (Witted Srl); Qinggang Meng (Computer Science Department Loughborough University) |
Pose Correction Algorithm for Relative Frames between Keyframes in SLAM | Youngseok Jang (Seoul National University)*; Hojoon Shin (Seoul National University); H. Jin Kim (Seoul National University) |
Motion Prediction Using Temporal Inception Module | Tim Lebailly (EPFL)*; Sena Kiciroglu (EPFL (École polytechnique fédérale de Lausanne)); Mathieu Salzmann (EPFL); Pascal Fua (EPFL, Switzerland); Wei Wang (EPFL) |
VAN: Versatile Affinity Network for End-to-end Online Multi-Object Tracking | Hyemin Lee (POSTECH)*; Inhan Kim (POSTECH); Daijin Kim (Pohang University of Science and Technology) |
Rotation Equivariant Orientation Estimation for Omnidirectional Localization | Chao Zhang (Toshiba Europe Limited)*; Ignas Budvytis (Department of Engineering, University of Cambridge); Stephan Liwicki (Toshiba Europe Limited); Roberto Cipolla (University of Cambridge) |
Low-light Color Imaging via Dual Camera Acquisition | Peiyao Guo (Nanjing University); Zhan Ma (Nanjing University)* |
Low-level Sensor Fusion Network for 3D Vehicle Detection using Radar Range-Azimuth Heatmap and Monocular Image | Jinhyeong Kim ( Korea Advanced Institute of Science and Technology); Youngseok Kim (Korea Advanced Institute of Science and Technology (KAIST))*; Dongsuk Kum (Korea Advanced Institute of Science and Technology) |
Dense-Scale Feature Learning in Person Re-Identification | Li Wang (Inspur); Baoyu Fan (Inspur Electronic Information Industry Co.,Ltd.)*; Zhenhua Guo (Inspur Electronic Information Industry Co.,Ltd.); Yaqian Zhao (Inspur); Runze Zhang (Inspur Electronic Information Industry Co.,Ltd.); Rengang Li (Inspur); Weifeng Gong ( Inspur Electronic Information Industry Co.,Ltd.) |
Discovering Multi-Label Actor-Action Association in a Weakly Supervised Setting | Sovan Biswas (University of Bonn)*; Juergen Gall (University of Bonn) |