Best Paper Award
"A Theory of Fermat Paths for Non-Line-of-Sight Shape Reconstruction" by Shumian Xin, Sotiris Nousias, Kyros Kutulakos, Aswin Sankaranarayanan, Srinivasa G. Narasimhan and Ioannis Gkioulekas.
Best Student Paper Award
"Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation" by Xin Wang, Qiuyuan Huang, Asli Celikyilmaz, Jianfeng Gao, Dinghan Shen, Yuan-Fang Wang, William Yang Wang and Lei Zhang.
Best Paper Honorable Mention
"A Style-Based Generator Architecture for Generative Adversarial Networks" by Tero Karras, Samuli Laine and Timo Aila.
"Learning the Depths of Moving People by Watching Frozen People" by Zhengqi Li, Tali Dekel, Forrester Cole, Richard Tucker, Ce Liu, Bill Freeman and Noah Snavely.
PAMI Longuet-Higgins Prize (Retrospective Most Impactful Paper from CVPR 2009)
"ImageNet: A large-scale hierarchical image database" by Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei.
PAMI Young Researcher Award
Karen Simonyan
2019 Computer Pioneer Award
Jitendra Malik
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Deep Learning | 1 | 09:00 | Finding Task-Relevant Features for Few-Shot Learning by Category Traversal | Hongyang Li; David Eigen; Samuel Dodge; Matthew Zeiler; Xiaogang Wang | 5 |
2 | 09:05 | Edge-Labeling Graph Neural Network for Few-Shot Learning | Jongmin Kim; Taesup Kim; Sungwoong Kim; Chang D. Yoo | 6340 | |
3 | 09:10 | Generating Classification Weights With GNN Denoising Autoencoders for Few-Shot Learning | Spyros Gidaris; Nikos Komodakis | 5728 | |
4 | 09:18 | Kervolutional Neural Networks | Chen Wang; Jianfei Yang; Lihua Xie; Junsong Yuan | 257 | |
5 | 09:23 | Why ReLU Networks Yield High-Confidence Predictions Far Away From the Training Data and How to Mitigate the Problem | Matthias Hein; Maksym Andriushchenko; Julian Bitterwolf | 4863 | |
6 | 09:28 | On the Structural Sensitivity of Deep Convolutional Networks to the Directions of Fourier Basis Functions | Yusuke Tsuzuku; Issei Sato | 6679 | |
7 | 09:36 | Neural Rejuvenation: Improving Deep Network Training by Enhancing Computational Resource Utilization | Siyuan Qiao; Zhe Lin; Jianming Zhang; Alan L. Yuille | 948 | |
8 | 09:41 | Hardness-Aware Deep Metric Learning | Wenzhao Zheng; Zhaodong Chen; Jiwen Lu; Jie Zhou | 2284 | |
9 | 09:46 | Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation | Chenxi Liu; Liang-Chieh Chen; Florian Schroff; Hartwig Adam; Wei Hua; Alan L. Yuille; Li Fei-Fei | 1183 | |
10 | 09:54 | Learning Loss for Active Learning | Donggeun Yoo; In So Kweon | 1535 | |
11 | 09:59 | Striking the Right Balance With Uncertainty | Salman Khan; Munawar Hayat; Syed Waqas Zamir; Jianbing Shen; Ling Shao | 2230 | |
12 | 10:04 | AutoAugment: Learning Augmentation Strategies From Data | Ekin D. Cubuk; Barret Zoph; Dandelion Mané; Vijay Vasudevan; Quoc V. Le | 2368 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
3D Multiview | 72 | 09:00 | SDRSAC: Semidefinite-Based Randomized Approach for Robust Point Cloud Registration Without Correspondences | Huu M. Le; Thanh-Toan Do; Tuan Hoang; Ngai-Man Cheung | 494 |
73 | 09:05 | BAD SLAM: Bundle Adjusted Direct RGB-D SLAM | Thomas Schöps; Torsten Sattler; Marc Pollefeys | 2315 | |
74 | 09:10 | Revealing Scenes by Inverting Structure From Motion Reconstructions | Francesco Pittaluga; Sanjeev J. Koppal; Sing Bing Kang; Sudipta N. Sinha | 2286 | |
75 | 09:18 | Strand-Accurate Multi-View Hair Capture | Giljoo Nam; Chenglei Wu; Min H. Kim; Yaser Sheikh | 1185 | |
76 | 09:23 | DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation | Jeong Joon Park; Peter Florence; Julian Straub; Richard Newcombe; Steven Lovegrove | 6756 | |
77 | 09:28 | Pushing the Boundaries of View Extrapolation With Multiplane Images | Pratul P. Srinivasan; Richard Tucker; Jonathan T. Barron; Ravi Ramamoorthi; Ren Ng; Noah Snavely | 2957 | |
78 | 09:36 | GA-Net: Guided Aggregation Net for End-To-End Stereo Matching | Feihu Zhang; Victor Prisacariu; Ruigang Yang; Philip H.S. Torr | 1935 | |
79 | 09:41 | Real-Time Self-Adaptive Deep Stereo | Alessio Tonioni; Fabio Tosi; Matteo Poggi; Stefano Mattoccia; Luigi Di Stefano | 2901 | |
80 | 09:46 | LAF-Net: Locally Adaptive Fusion Networks for Stereo Confidence Estimation | Sunok Kim; Seungryong Kim; Dongbo Min; Kwanghoon Sohn | 6639 | |
81 | 09:54 | NM-Net: Mining Reliable Neighbors for Robust Feature Correspondences | Chen Zhao; Zhiguo Cao; Chi Li; Xin Li; Jiaqi Yang | 3522 | |
82 | 09:59 | Coordinate-Free Carlsson-Weinshall Duality and Relative Multi-View Geometry | Matthew Trager; Martial Hebert; Jean Ponce | 5852 | |
83 | 10:04 | Deep Reinforcement Learning of Volume-Guided Progressive View Inpainting for 3D Point Scene Completion From a Single Depth Image | Xiaoguang Han; Zhaoxuan Zhang; Dong Du; Mingdai Yang; Jingming Yu; Pan Pan; Xin Yang; Ligang Liu; Zixiang Xiong; Shuguang Cui | 1944 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Action & Video | 109 | 09:00 | Video Action Transformer Network | Rohit Girdhar; João Carreira; Carl Doersch; Andrew Zisserman | 292 |
110 | 09:05 | Timeception for Complex Action Recognition | Noureldien Hussein; Efstratios Gavves; Arnold W.M. Smeulders | 302 | |
111 | 09:10 | STEP: Spatio-Temporal Progressive Learning for Video Action Detection | Xitong Yang; Xiaodong Yang; Ming-Yu Liu; Fanyi Xiao; Larry S. Davis; Jan Kautz | 1670 | |
112 | 09:18 | Relational Action Forecasting | Chen Sun; Abhinav Shrivastava; Carl Vondrick; Rahul Sukthankar; Kevin Murphy; Cordelia Schmid | 1745 | |
113 | 09:23 | Long-Term Feature Banks for Detailed Video Understanding | Chao-Yuan Wu; Christoph Feichtenhofer; Haoqi Fan; Kaiming He; Philipp Krähenbühl; Ross Girshick | 2310 | |
114 | 09:28 | Which Way Are You Going? Imitative Decision Learning for Path Forecasting in Dynamic Scenes | Yuke Li | 229 | |
115 | 09:36 | What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment | Paritosh Parmar; Brendan Tran Morris | 3106 | |
116 | 09:41 | MHP-VOS: Multiple Hypotheses Propagation for Video Object Segmentation | Shuangjie Xu; Daizong Liu; Linchao Bao; Wei Liu; Pan Zhou | 1382 | |
117 | 09:46 | 2.5D Visual Sound | Ruohan Gao; Kristen Grauman | 1517 | |
118 | 09:54 | Language-Driven Temporal Activity Localization: A Semantic Matching Reinforcement Learning Model | Weining Wang; Yan Huang; Liang Wang | 1999 | |
119 | 09:59 | Gaussian Temporal Awareness Networks for Action Localization | Fuchen Long; Ting Yao; Zhaofan Qiu; Xinmei Tian; Jiebo Luo; Tao Mei | 5591 | |
120 | 10:04 | Efficient Video Classification Using Fewer Frames | Shweta Bhardwaj; Mukundhan Srinivasan; Mitesh M. Khapra | 6940 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Deep Learning | 1 | 10:15 | Finding Task-Relevant Features for Few-Shot Learning by Category Traversal | Hongyang Li; David Eigen; Samuel Dodge; Matthew Zeiler; Xiaogang Wang | 5 |
2 | 10:15 | Edge-Labeling Graph Neural Network for Few-Shot Learning | Jongmin Kim; Taesup Kim; Sungwoong Kim; Chang D. Yoo | 6340 | |
3 | 10:15 | Generating Classification Weights With GNN Denoising Autoencoders for Few-Shot Learning | Spyros Gidaris; Nikos Komodakis | 5728 | |
4 | 10:15 | Kervolutional Neural Networks | Chen Wang; Jianfei Yang; Lihua Xie; Junsong Yuan | 257 | |
5 | 10:15 | Why ReLU Networks Yield High-Confidence Predictions Far Away From the Training Data and How to Mitigate the Problem | Matthias Hein; Maksym Andriushchenko; Julian Bitterwolf | 4863 | |
6 | 10:15 | On the Structural Sensitivity of Deep Convolutional Networks to the Directions of Fourier Basis Functions | Yusuke Tsuzuku; Issei Sato | 6679 | |
7 | 10:15 | Neural Rejuvenation: Improving Deep Network Training by Enhancing Computational Resource Utilization | Siyuan Qiao; Zhe Lin; Jianming Zhang; Alan L. Yuille | 948 | |
8 | 10:15 | Hardness-Aware Deep Metric Learning | Wenzhao Zheng; Zhaodong Chen; Jiwen Lu; Jie Zhou | 2284 | |
9 | 10:15 | Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation | Chenxi Liu; Liang-Chieh Chen; Florian Schroff; Hartwig Adam; Wei Hua; Alan L. Yuille; Li Fei-Fei | 1183 | |
10 | 10:15 | Learning Loss for Active Learning | Donggeun Yoo; In So Kweon | 1535 | |
11 | 10:15 | Striking the Right Balance With Uncertainty | Salman Khan; Munawar Hayat; Syed Waqas Zamir; Jianbing Shen; Ling Shao | 2230 | |
12 | 10:15 | AutoAugment: Learning Augmentation Strategies From Data | Ekin D. Cubuk; Barret Zoph; Dandelion Mané; Vijay Vasudevan; Quoc V. Le | 2368 | |
13 | 10:15 | Parsing R-CNN for Instance-Level Human Analysis | Lu Yang; Qing Song; Zhihui Wang; Ming Jiang | 143 | |
14 | 10:15 | Large Scale Incremental Learning | Yue Wu; Yinpeng Chen; Lijuan Wang; Yuancheng Ye; Zicheng Liu; Yandong Guo; Yun Fu | 306 | |
15 | 10:15 | TopNet: Structural Point Cloud Decoder | Lyne P. Tchapmi; Vineet Kosaraju; Hamid Rezatofighi; Ian Reid; Silvio Savarese | 365 | |
16 | 10:15 | Perceive Where to Focus: Learning Visibility-Aware Part-Level Features for Partial Person Re-Identification | Yifan Sun; Qin Xu; Yali Li; Chi Zhang; Yikang Li; Shengjin Wang; Jian Sun | 423 | |
17 | 10:15 | Meta-Transfer Learning for Few-Shot Learning | Qianru Sun; Yaoyao Liu; Tat-Seng Chua; Bernt Schiele | 454 | |
18 | 10:15 | Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation | Bohan Zhuang; Chunhua Shen; Mingkui Tan; Lingqiao Liu; Ian Reid | 635 | |
19 | 10:15 | Deep RNN Framework for Visual Sequential Applications | Bo Pang; Kaiwen Zha; Hanwen Cao; Chen Shi; Cewu Lu | 686 | |
20 | 10:15 | Graph-Based Global Reasoning Networks | Yunpeng Chen; Marcus Rohrbach; Zhicheng Yan; Yan Shuicheng; Jiashi Feng; Yannis Kalantidis | 750 | |
21 | 10:15 | SSN: Learning Sparse Switchable Normalization via SparsestMax | Wenqi Shao; Tianjian Meng; Jingyu Li; Ruimao Zhang; Yudian Li; Xiaogang Wang; Ping Luo | 778 | |
22 | 10:15 | Spherical Fractal Convolutional Neural Networks for Point Cloud Recognition | Yongming Rao; Jiwen Lu; Jie Zhou | 823 | |
23 | 10:15 | Learning to Generate Synthetic Data via Compositing | Shashank Tripathi; Siddhartha Chandra; Amit Agrawal; Ambrish Tyagi; James M. Rehg; Visesh Chari | 830 | |
24 | 10:15 | Divide and Conquer the Embedding Space for Metric Learning | Artsiom Sanakoyeu; Vadim Tschernezki; Uta Büchler; Björn Ommer | 833 | |
25 | 10:15 | Latent Space Autoregression for Novelty Detection | Davide Abati; Angelo Porrello; Simone Calderara; Rita Cucchiara | 858 | |
26 | 10:15 | Attending to Discriminative Certainty for Domain Adaptation | Vinod Kumar Kurmi; Shanu Kumar; Vinay P. Namboodiri | 934 | |
27 | 10:15 | Feature Denoising for Improving Adversarial Robustness | Cihang Xie; Yuxin Wu; Laurens van der Maaten; Alan L. Yuille; Kaiming He | 1061 | |
28 | 10:15 | Selective Kernel Networks | Xiang Li; Wenhai Wang; Xiaolin Hu; Jian Yang | 1065 | |
29 | 10:15 | On Implicit Filter Level Sparsity in Convolutional Neural Networks | Dushyant Mehta; Kwang In Kim; Christian Theobalt | 1146 | |
30 | 10:15 | FlowNet3D: Learning Scene Flow in 3D Point Clouds | Xingyu Liu; Charles R. Qi; Leonidas J. Guibas | 1167 | |
31 | 10:15 | Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks | Kuan Fang; Alexander Toshev; Li Fei-Fei; Silvio Savarese | 1218 | |
32 | 10:15 | Co-Occurrent Features in Semantic Segmentation | Hang Zhang; Han Zhang; Chenguang Wang; Junyuan Xie | 1222 | |
33 | 10:15 | Bag of Tricks for Image Classification with Convolutional Neural Networks | Tong He; Zhi Zhang; Hang Zhang; Zhongyue Zhang; Junyuan Xie; Mu Li | 1246 | |
34 | 10:15 | Learning Channel-Wise Interactions for Binary Convolutional Neural Networks | Ziwei Wang; Jiwen Lu; Chenxin Tao; Jie Zhou; Qi Tian | 1279 | |
35 | 10:15 | Knowledge Adaptation for Efficient Semantic Segmentation | Tong He; Chunhua Shen; Zhi Tian; Dong Gong; Changming Sun; Youliang Yan | 1290 | |
36 | 10:15 | Parametric Noise Injection: Trainable Randomness to Improve Deep Neural Network Robustness Against Adversarial Attack | Zhezhi He; Adnan Siraj Rakin; Deliang Fan | 7000 | |
Recognition | 37 | 10:15 | Invariance Matters: Exemplar Memory for Domain Adaptive Person Re-Identification | Zhun Zhong; Liang Zheng; Zhiming Luo; Shaozi Li; Yi Yang | 17 |
38 | 10:15 | Dissecting Person Re-Identification From the Viewpoint of Viewpoint | Xiaoxiao Sun; Liang Zheng | 19 | |
39 | 10:15 | Learning to Reduce Dual-Level Discrepancy for Infrared-Visible Person Re-Identification | Zhixiang Wang; Zheng Wang; Yinqiang Zheng; Yung-Yu Chuang; Shin'ichi Satoh | 158 | |
40 | 10:15 | Progressive Feature Alignment for Unsupervised Domain Adaptation | Chaoqi Chen; Weiping Xie; Wenbing Huang; Yu Rong; Xinghao Ding; Yue Huang; Tingyang Xu; Junzhou Huang | 208 | |
41 | 10:15 | Feature-Level Frankenstein: Eliminating Variations for Discriminative Recognition | Xiaofeng Liu; Site Li; Lingsheng Kong; Wanqing Xie; Ping Jia; Jane You; B.V.K. Kumar | 278 | |
42 | 10:15 | Learning a Deep ConvNet for Multi-Label Classification With Partial Labels | Thibaut Durand; Nazanin Mehrasa; Greg Mori | 300 | |
43 | 10:15 | Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression | Hamid Rezatofighi; Nathan Tsoi; JunYoung Gwak; Amir Sadeghian; Ian Reid; Silvio Savarese | 366 | |
44 | 10:15 | Densely Semantically Aligned Person Re-Identification | Zhizheng Zhang; Cuiling Lan; Wenjun Zeng; Zhibo Chen | 677 | |
45 | 10:15 | Generalising Fine-Grained Sketch-Based Image Retrieval | Kaiyue Pang; Ke Li; Yongxin Yang; Honggang Zhang; Timothy M. Hospedales; Tao Xiang; Yi-Zhe Song | 698 | |
46 | 10:15 | Adapting Object Detectors via Selective Cross-Domain Alignment | Xinge Zhu; Jiangmiao Pang; Ceyuan Yang; Jianping Shi; Dahua Lin | 764 | |
47 | 10:15 | Cyclic Guidance for Weakly Supervised Joint Detection and Segmentation | Yunhang Shen; Rongrong Ji; Yan Wang; Yongjian Wu; Liujuan Cao | 774 | |
48 | 10:15 | Thinking Outside the Pool: Active Training Image Creation for Relative Attributes | Aron Yu; Kristen Grauman | 802 | |
49 | 10:15 | Generalizable Person Re-Identification by Domain-Invariant Mapping Network | Jifei Song; Yongxin Yang; Yi-Zhe Song; Tao Xiang; Timothy M. Hospedales | 848 | |
50 | 10:15 | Visual Attention Consistency Under Image Transforms for Multi-Label Image Classification | Hao Guo; Kang Zheng; Xiaochuan Fan; Hongkai Yu; Song Wang | 851 | |
51 | 10:15 | Re-Ranking via Metric Fusion for Object Retrieval and Person Re-Identification | Song Bai; Peng Tang; Philip H.S. Torr; Longin Jan Latecki | 866 | |
52 | 10:15 | Unsupervised Open Domain Recognition by Semantic Discrepancy Minimization | Junbao Zhuo; Shuhui Wang; Shuhao Cui; Qingming Huang | 956 | |
53 | 10:15 | Weakly Supervised Person Re-Identification | Jingke Meng; Sheng Wu; Wei-Shi Zheng | 964 | |
54 | 10:15 | PointRCNN: 3D Object Proposal Generation and Detection From Point Cloud | Shaoshuai Shi; Xiaogang Wang; Hongsheng Li | 993 | |
55 | 10:15 | Automatic Adaptation of Object Detectors to New Domains Using Self-Training | Aruni RoyChowdhury; Prithvijit Chakrabarty; Ashish Singh; SouYoung Jin; Huaizu Jiang; Liangliang Cao; Erik Learned-Miller | 1023 | |
56 | 10:15 | Deep Sketch-Shape Hashing With Segmented 3D Stochastic Viewing | Jiaxin Chen; Jie Qin; Li Liu; Fan Zhu; Fumin Shen; Jin Xie; Ling Shao | 1050 | |
57 | 10:15 | Generative Dual Adversarial Network for Generalized Zero-Shot Learning | He Huang; Changhu Wang; Philip S. Yu; Chang-Dong Wang | 1062 | |
58 | 10:15 | Query-Guided End-To-End Person Search | Bharti Munjal; Sikandar Amin; Federico Tombari; Fabio Galasso | 1115 | |
59 | 10:15 | Libra R-CNN: Towards Balanced Learning for Object Detection | Jiangmiao Pang; Kai Chen; Jianping Shi; Huajun Feng; Wanli Ouyang; Dahua Lin | 1128 | |
60 | 10:15 | Learning a Unified Classifier Incrementally via Rebalancing | Saihui Hou; Xinyu Pan; Chen Change Loy; Zilei Wang; Dahua Lin | 1165 | |
61 | 10:15 | Feature Selective Anchor-Free Module for Single-Shot Object Detection | Chenchen Zhu; Yihui He; Marios Savvides | 1260 | |
62 | 10:15 | Bottom-Up Object Detection by Grouping Extreme and Center Points | Xingyi Zhou; Jiacheng Zhuo; Philipp Krähenbühl | 1324 | |
63 | 10:15 | Feature Distillation: DNN-Oriented JPEG Compression Against Adversarial Examples | Zihao Liu; Qi Liu; Tao Liu; Nuo Xu; Xue Lin; Yanzhi Wang; Wujie Wen | 6349 | |
Segmentation, Grouping, & Shape | 64 | 10:15 | SCOPS: Self-Supervised Co-Part Segmentation | Wei-Chih Hung; Varun Jampani; Sifei Liu; Pavlo Molchanov; Ming-Hsuan Yang; Jan Kautz | 68 |
65 | 10:15 | Unsupervised Moving Object Detection via Contextual Information Separation | Yanchao Yang; Antonio Loquercio; Davide Scaramuzza; Stefano Soatto | 838 | |
66 | 10:15 | Pose2Seg: Detection Free Human Instance Segmentation | Song-Hai Zhang; Ruilong Li; Xin Dong; Paul Rosin; Zixi Cai; Xi Han; Dingcheng Yang; Haozhi Huang; Shi-Min Hu | 1291 | |
Statistics, Physics, Theory, & Datasets | 67 | 10:15 | DrivingStereo: A Large-Scale Dataset for Stereo Matching in Autonomous Driving Scenarios | Guorun Yang; Xiao Song; Chaoqin Huang; Zhidong Deng; Jianping Shi; Bolei Zhou | 156 |
68 | 10:15 | PartNet: A Large-Scale Benchmark for Fine-Grained and Hierarchical Part-Level 3D Object Understanding | Kaichun Mo; Shilin Zhu; Angel X. Chang; Li Yi; Subarna Tripathi; Leonidas J. Guibas; Hao Su | 177 | |
69 | 10:15 | A Dataset and Benchmark for Large-Scale Multi-Modal Face Anti-Spoofing | Shifeng Zhang; Xiaobo Wang; Ajian Liu; Chenxu Zhao; Jun Wan; Sergio Escalera; Hailin Shi; Zezheng Wang; Stan Z. Li | 503 | |
70 | 10:15 | Unsupervised Learning of Consensus Maximization for 3D Vision Problems | Thomas Probst; Danda Pani Paudel; Ajad Chhatkuli; Luc Van Gool | 1047 | |
71 | 10:15 | VizWiz-Priv: A Dataset for Recognizing the Presence and Purpose of Private Visual Information in Images Taken by Blind People | Danna Gurari; Qing Li; Chi Lin; Yinan Zhao; Anhong Guo; Abigale Stangl; Jeffrey P. Bigham | 1257 | |
3D Multiview | 72 | 10:15 | SDRSAC: Semidefinite-Based Randomized Approach for Robust Point Cloud Registration Without Correspondences | Huu M. Le; Thanh-Toan Do; Tuan Hoang; Ngai-Man Cheung | 494 |
73 | 10:15 | BAD SLAM: Bundle Adjusted Direct RGB-D SLAM | Thomas Schöps; Torsten Sattler; Marc Pollefeys | 2315 | |
74 | 10:15 | Revealing Scenes by Inverting Structure From Motion Reconstructions | Francesco Pittaluga; Sanjeev J. Koppal; Sing Bing Kang; Sudipta N. Sinha | 2286 | |
75 | 10:15 | Strand-Accurate Multi-View Hair Capture | Giljoo Nam; Chenglei Wu; Min H. Kim; Yaser Sheikh | 1185 | |
76 | 10:15 | DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation | Jeong Joon Park; Peter Florence; Julian Straub; Richard Newcombe; Steven Lovegrove | 6756 | |
77 | 10:15 | Pushing the Boundaries of View Extrapolation With Multiplane Images | Pratul P. Srinivasan; Richard Tucker; Jonathan T. Barron; Ravi Ramamoorthi; Ren Ng; Noah Snavely | 2957 | |
78 | 10:15 | GA-Net: Guided Aggregation Net for End-To-End Stereo Matching | Feihu Zhang; Victor Prisacariu; Ruigang Yang; Philip H.S. Torr | 1935 | |
79 | 10:15 | Real-Time Self-Adaptive Deep Stereo | Alessio Tonioni; Fabio Tosi; Matteo Poggi; Stefano Mattoccia; Luigi Di Stefano | 2901 | |
80 | 10:15 | LAF-Net: Locally Adaptive Fusion Networks for Stereo Confidence Estimation | Sunok Kim; Seungryong Kim; Dongbo Min; Kwanghoon Sohn | 6639 | |
81 | 10:15 | NM-Net: Mining Reliable Neighbors for Robust Feature Correspondences | Chen Zhao; Zhiguo Cao; Chi Li; Xin Li; Jiaqi Yang | 3522 | |
82 | 10:15 | Coordinate-Free Carlsson-Weinshall Duality and Relative Multi-View Geometry | Matthew Trager; Martial Hebert; Jean Ponce | 5852 | |
83 | 10:15 | Deep Reinforcement Learning of Volume-Guided Progressive View Inpainting for 3D Point Scene Completion From a Single Depth Image | Xiaoguang Han; Zhaoxuan Zhang; Dong Du; Mingdai Yang; Jingming Yu; Pan Pan; Xin Yang; Ligang Liu; Zixiang Xiong; Shuguang Cui | 1944 | |
84 | 10:15 | Structural Relational Reasoning of Point Clouds | Yueqi Duan; Yu Zheng; Jiwen Lu; Jie Zhou; Qi Tian | 476 | |
85 | 10:15 | MVF-Net: Multi-View 3D Face Morphable Model Regression | Fanzi Wu; Linchao Bao; Yajing Chen; Yonggen Ling; Yibing Song; Songnan Li; King Ngi Ngan; Wei Liu | 666 | |
86 | 10:15 | Photometric Mesh Optimization for Video-Aligned 3D Object Reconstruction | Chen-Hsuan Lin; Oliver Wang; Bryan C. Russell; Eli Shechtman; Vladimir G. Kim; Matthew Fisher; Simon Lucey | 787 | |
87 | 10:15 | Guided Stereo Matching | Matteo Poggi; Davide Pallotti; Fabio Tosi; Stefano Mattoccia | 1011 | |
88 | 10:15 | Unsupervised Event-Based Learning of Optical Flow, Depth, and Egomotion | Alex Zihao Zhu; Liangzhe Yuan; Kenneth Chaney; Kostas Daniilidis | 1191 | |
89 | 10:15 | Modeling Local Geometric Structure of 3D Point Clouds Using Geo-CNN | Shiyi Lan; Ruichi Yu; Gang Yu; Larry S. Davis | 1221 | |
3D Single View & RGBD | 90 | 10:15 | 3D Point Capsule Networks | Yongheng Zhao; Tolga Birdal; Haowen Deng; Federico Tombari | 155 |
91 | 10:15 | GS3D: An Efficient 3D Object Detection Framework for Autonomous Driving | Buyu Li; Wanli Ouyang; Lu Sheng; Xingyu Zeng; Xiaogang Wang | 226 | |
92 | 10:15 | Single-Image Piece-Wise Planar 3D Reconstruction via Associative Embedding | Zehao Yu; Jia Zheng; Dongze Lian; Zihan Zhou; Shenghua Gao | 348 | |
93 | 10:15 | 3DN: 3D Deformation Network | Weiyue Wang; Duygu Ceylan; Radomir Mech; Ulrich Neumann | 593 | |
94 | 10:15 | HorizonNet: Learning Room Layout With 1D Representation and Pano Stretch Data Augmentation | Cheng Sun; Chi-Wei Hsiao; Min Sun; Hwann-Tzong Chen | 1073 | |
95 | 10:15 | Deep Fitting Degree Scoring Network for Monocular 3D Object Detection | Lijie Liu; Jiwen Lu; Chunjing Xu; Qi Tian; Jie Zhou | 1077 | |
Face & Body | 96 | 10:15 | Pushing the Envelope for RGB-Based Dense 3D Hand Pose Estimation via Neural Rendering | Seungryul Baek; Kwang In Kim; Tae-Kyun Kim | 55 |
97 | 10:15 | Self-Supervised Learning of 3D Human Pose Using Multi-View Geometry | Muhammed Kocabas; Salih Karagoz; Emre Akbas | 117 | |
98 | 10:15 | FSA-Net: Learning Fine-Grained Structure Aggregation for Head Pose Estimation From a Single Image | Tsun-Yi Yang; Yi-Ting Chen; Yen-Yu Lin; Yung-Yu Chuang | 191 | |
99 | 10:15 | Dense 3D Face Decoding Over 2500FPS: Joint Texture & Shape Convolutional Mesh Decoders | Yuxiang Zhou; Jiankang Deng; Irene Kotsia; Stefanos Zafeiriou | 332 | |
100 | 10:15 | Does Learning Specific Features for Related Parts Help Human Pose Estimation? | Wei Tang; Ying Wu | 370 | |
101 | 10:15 | Linkage Based Face Clustering via Graph Convolution Network | Zhongdao Wang; Liang Zheng; Yali Li; Shengjin Wang | 410 | |
102 | 10:15 | Towards High-Fidelity Nonlinear 3D Face Morphable Model | Luan Tran; Feng Liu; Xiaoming Liu | 514 | |
103 | 10:15 | RegularFace: Deep Face Recognition via Exclusive Regularization | Kai Zhao; Jingyi Xu; Ming-Ming Cheng | 552 | |
104 | 10:15 | BridgeNet: A Continuity-Aware Probabilistic Network for Age Estimation | Wanhua Li; Jiwen Lu; Jianjiang Feng; Chunjing Xu; Jie Zhou; Qi Tian | 808 | |
105 | 10:15 | GANFIT: Generative Adversarial Network Fitting for High Fidelity 3D Face Reconstruction | Baris Gecer; Stylianos Ploumpis; Irene Kotsia; Stefanos Zafeiriou | 821 | |
106 | 10:15 | Improving the Performance of Unimodal Dynamic Hand-Gesture Recognition With Multimodal Training | Mahdi Abavisani; Hamid Reza Vaezi Joze; Vishal M. Patel | 984 | |
107 | 10:15 | Learning to Reconstruct People in Clothing From a Single RGB Camera | Thiemo Alldieck; Marcus Magnor; Bharat Lal Bhatnagar; Christian Theobalt; Gerard Pons-Moll | 1043 | |
108 | 10:15 | Distilled Person Re-Identification: Towards a More Scalable System | Ancong Wu; Wei-Shi Zheng; Xiaowei Guo; Jian-Huang Lai | 1157 | |
Action & Video | 109 | 10:15 | Video Action Transformer Network | Rohit Girdhar; João Carreira; Carl Doersch; Andrew Zisserman | 292 |
110 | 10:15 | Timeception for Complex Action Recognition | Noureldien Hussein; Efstratios Gavves; Arnold W.M. Smeulders | 302 | |
111 | 10:15 | STEP: Spatio-Temporal Progressive Learning for Video Action Detection | Xitong Yang; Xiaodong Yang; Ming-Yu Liu; Fanyi Xiao; Larry S. Davis; Jan Kautz | 1670 | |
112 | 10:15 | Relational Action Forecasting | Chen Sun; Abhinav Shrivastava; Carl Vondrick; Rahul Sukthankar; Kevin Murphy; Cordelia Schmid | 1745 | |
113 | 10:15 | Long-Term Feature Banks for Detailed Video Understanding | Chao-Yuan Wu; Christoph Feichtenhofer; Haoqi Fan; Kaiming He; Philipp Krähenbühl; Ross Girshick | 2310 | |
114 | 10:15 | Which Way Are You Going? Imitative Decision Learning for Path Forecasting in Dynamic Scenes | Yuke Li | 229 | |
115 | 10:15 | What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment | Paritosh Parmar; Brendan Tran Morris | 3106 | |
116 | 10:15 | MHP-VOS: Multiple Hypotheses Propagation for Video Object Segmentation | Shuangjie Xu; Daizong Liu; Linchao Bao; Wei Liu; Pan Zhou | 1382 | |
117 | 10:15 | 2.5D Visual Sound | Ruohan Gao; Kristen Grauman | 1517 | |
118 | 10:15 | Language-Driven Temporal Activity Localization: A Semantic Matching Reinforcement Learning Model | Weining Wang; Yan Huang; Liang Wang | 1999 | |
119 | 10:15 | Gaussian Temporal Awareness Networks for Action Localization | Fuchen Long; Ting Yao; Zhaofan Qiu; Xinmei Tian; Jiebo Luo; Tao Mei | 5591 | |
120 | 10:15 | Efficient Video Classification Using Fewer Frames | Shweta Bhardwaj; Mukundhan Srinivasan; Mitesh M. Khapra | 6940 | |
121 | 10:15 | A Perceptual Prediction Framework for Self Supervised Event Segmentation | Sathyanarayanan N. Aakur; Sudeep Sarkar | 239 | |
122 | 10:15 | COIN: A Large-Scale Dataset for Comprehensive Instructional Video Analysis | Yansong Tang; Dajun Ding; Yongming Rao; Yu Zheng; Danyang Zhang; Lili Zhao; Jiwen Lu; Jie Zhou | 260 | |
123 | 10:15 | Recurrent Attentive Zooming for Joint Crowd Counting and Precise Localization | Chenchen Liu; Xinyu Weng; Yadong Mu | 484 | |
124 | 10:15 | An Attention Enhanced Graph Convolutional LSTM Network for Skeleton-Based Action Recognition | Chenyang Si; Wentao Chen; Wei Wang; Liang Wang; Tieniu Tan | 662 | |
125 | 10:15 | Graph Convolutional Label Noise Cleaner: Train a Plug-And-Play Action Classifier for Anomaly Detection | Jia-Xing Zhong; Nannan Li; Weijie Kong; Shan Liu; Thomas H. Li; Ge Li | 727 | |
126 | 10:15 | MAN: Moment Alignment Network for Natural Language Moment Retrieval via Iterative Graph Adjustment | Da Zhang; Xiyang Dai; Xin Wang; Yuan-Fang Wang; Larry S. Davis | 747 | |
127 | 10:15 | Less Is More: Learning Highlight Detection From Video Duration | Bo Xiong; Yannis Kalantidis; Deepti Ghadiyaram; Kristen Grauman | 762 | |
128 | 10:15 | DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition | Zheng Shou; Xudong Lin; Yannis Kalantidis; Laura Sevilla-Lara; Marcus Rohrbach; Shih-Fu Chang; Zhicheng Yan | 1054 | |
129 | 10:15 | AdaFrame: Adaptive Frame Selection for Fast Video Recognition | Zuxuan Wu; Caiming Xiong; Chih-Yao Ma; Richard Socher; Larry S. Davis | 1099 | |
130 | 10:15 | Spatio-Temporal Video Re-Localization by Warp LSTM | Yang Feng; Lin Ma; Wei Liu; Jiebo Luo | 1134 | |
131 | 10:15 | Completeness Modeling and Context Separation for Weakly Supervised Temporal Action Localization | Daochang Liu; Tingting Jiang; Yizhou Wang | 1273 | |
Motion & Biometrics | 132 | 10:15 | Unsupervised Deep Tracking | Ning Wang; Yibing Song; Chao Ma; Wengang Zhou; Wei Liu; Houqiang Li | 629 |
133 | 10:15 | Tracking by Animation: Unsupervised Learning of Multi-Object Attentive Trackers | Zhen He; Jian Li; Daxue Liu; Hangen He; David Barber | 648 | |
134 | 10:15 | Fast Online Object Tracking and Segmentation: A Unifying Approach | Qiang Wang; Li Zhang; Luca Bertinetto; Weiming Hu; Philip H.S. Torr | 699 | |
135 | 10:15 | Object Tracking by Reconstruction With View-Specific Discriminative Correlation Filters | Uğur Kart; Alan Lukežič; Matej Kristan; Joni-Kristian Kämäräinen; Jiří Matas | 743 | |
136 | 10:15 | SoPhie: An Attentive GAN for Predicting Paths Compliant to Social and Physical Constraints | Amir Sadeghian; Vineet Kosaraju; Ali Sadeghian; Noriaki Hirose; Hamid Rezatofighi; Silvio Savarese | 992 | |
137 | 10:15 | Leveraging Shape Completion for 3D Siamese Tracking | Silvio Giancola; Jesus Zarzar; Bernard Ghanem | 1103 | |
138 | 10:15 | Target-Aware Deep Tracking | Xin Li; Chao Ma; Baoyuan Wu; Zhenyu He; Ming-Hsuan Yang | 1225 | |
139 | 10:15 | Spatiotemporal CNN for Video Object Segmentation | Kai Xu; Longyin Wen; Guorong Li; Liefeng Bo; Qingming Huang | 1258 | |
140 | 10:15 | Towards Rich Feature Discovery With Class Activation Maps Augmentation for Person Re-Identification | Wenjie Yang; Houjing Huang; Zhang Zhang; Xiaotang Chen; Kaiqi Huang; Shu Zhang | 1335 | |
Synthesis | 141 | 10:15 | Wide-Context Semantic Image Extrapolation | Yi Wang; Xin Tao; Xiaoyong Shen; Jiaya Jia | 577 |
142 | 10:15 | End-To-End Time-Lapse Video Synthesis From a Single Outdoor Image | Seonghyeon Nam; Chongyang Ma; Menglei Chai; William Brendel; Ning Xu; Seon Joo Kim | 673 | |
143 | 10:15 | GIF2Video: Color Dequantization and Temporal Interpolation of GIF Images | Yang Wang; Haibin Huang; Chuan Wang; Tong He; Jue Wang; Minh Hoai | 710 | |
144 | 10:15 | Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis | Qi Mao; Hsin-Ying Lee; Hung-Yu Tseng; Siwei Ma; Ming-Hsuan Yang | 869 | |
145 | 10:15 | Pluralistic Image Completion | Chuanxia Zheng; Tat-Jen Cham; Jianfei Cai | 963 | |
146 | 10:15 | Salient Object Detection With Pyramid Attention and Salient Edges | Wenguan Wang; Shuyang Zhao; Jianbing Shen; Steven C. H. Hoi; Ali Borji | 1005 | |
147 | 10:15 | Latent Filter Scaling for Multimodal Unsupervised Image-To-Image Translation | Yazeed Alharbi; Neil Smith; Peter Wonka | 1038 | |
148 | 10:15 | Attention-Aware Multi-Stroke Style Transfer | Yuan Yao; Jianqiang Ren; Xuansong Xie; Weidong Liu; Yong-Jin Liu; Jun Wang | 1040 | |
149 | 10:15 | Feedback Adversarial Learning: Spatial Feedback for Improving Generative Adversarial Networks | Minyoung Huh; Shao-Hua Sun; Ning Zhang | 1111 | |
150 | 10:15 | Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting | Yanhong Zeng; Jianlong Fu; Hongyang Chao; Baining Guo | 1127 | |
151 | 10:15 | Example-Guided Style-Consistent Image Synthesis From Semantic Labeling | Miao Wang; Guo-Ye Yang; Ruilong Li; Run-Ze Liang; Song-Hai Zhang; Peter M. Hall; Shi-Min Hu | 1135 | |
152 | 10:15 | MirrorGAN: Learning Text-To-Image Generation by Redescription | Tingting Qiao; Jing Zhang; Duanqing Xu; Dacheng Tao | 1166 | |
Computational Photography & Graphics | 153 | 10:15 | Light Field Messaging With Deep Photographic Steganography | Eric Wengrowski; Kristin Dana | 270 |
154 | 10:15 | Im2Pencil: Controllable Pencil Illustration From Photographs | Yijun Li; Chen Fang; Aaron Hertzmann; Eli Shechtman; Ming-Hsuan Yang | 719 | |
155 | 10:15 | When Color Constancy Goes Wrong: Correcting Improperly White-Balanced Images | Mahmoud Afifi; Brian Price; Scott Cohen; Michael S. Brown | 797 | |
156 | 10:15 | Beyond Volumetric Albedo — A Surface Optimization Framework for Non-Line-Of-Sight Imaging | Chia-Yin Tsai; Aswin C. Sankaranarayanan; Ioannis Gkioulekas | 856 | |
157 | 10:15 | Reflection Removal Using a Dual-Pixel Sensor | Abhijith Punnappurath; Michael S. Brown | 859 | |
158 | 10:15 | Practical Coding Function Design for Time-Of-Flight Imaging | Felipe Gutierrez-Barragan; Syed Azer Reza; Andreas Velten; Mohit Gupta | 1060 | |
159 | 10:15 | Meta-SR: A Magnification-Arbitrary Network for Super-Resolution | Xuecai Hu; Haoyuan Mu; Xiangyu Zhang; Zilei Wang; Tieniu Tan; Jian Sun | 1227 | |
Low-Level & Optimization | 160 | 10:15 | Multispectral and Hyperspectral Image Fusion by MS/HS Fusion Net | Qi Xie; Minghao Zhou; Qian Zhao; Deyu Meng; Wangmeng Zuo; Zongben Xu | 60 |
161 | 10:15 | Learning Attraction Field Representation for Robust Line Segment Detection | Nan Xue; Song Bai; Fudong Wang; Gui-Song Xia; Tianfu Wu; Liangpei Zhang | 145 | |
162 | 10:15 | Blind Super-Resolution With Iterative Kernel Correction | Jinjin Gu; Hannan Lu; Wangmeng Zuo; Chao Dong | 219 | |
163 | 10:15 | Video Magnification in the Wild Using Fractional Anisotropy in Temporal Distribution | Shoichiro Takeda; Yasunori Akagi; Kazuki Okami; Megumi Isogai; Hideaki Kimata | 435 | |
164 | 10:15 | Attentive Feedback Network for Boundary-Aware Salient Object Detection | Mengyang Feng; Huchuan Lu; Errui Ding | 438 | |
165 | 10:15 | Heavy Rain Image Restoration: Integrating Physics Model and Conditional Adversarial Learning | Ruoteng Li; Loong-Fah Cheong; Robby T. Tan | 444 | |
166 | 10:15 | Learning to Calibrate Straight Lines for Fisheye Image Rectification | Zhucun Xue; Nan Xue; Gui-Song Xia; Weiming Shen | 515 | |
167 | 10:15 | Camera Lens Super-Resolution | Chang Chen; Zhiwei Xiong; Xinmei Tian; Zheng-Jun Zha; Feng Wu | 584 | |
168 | 10:15 | Frame-Consistent Recurrent Video Deraining With Dual-Level Flow | Wenhan Yang; Jiaying Liu; Jiashi Feng | 586 | |
169 | 10:15 | Deep Plug-And-Play Super-Resolution for Arbitrary Blur Kernels | Kai Zhang; Wangmeng Zuo; Lei Zhang | 675 | |
170 | 10:15 | Sea-Thru: A Method for Removing Water From Underwater Images | Derya Akkaynak; Tali Treibitz | 695 | |
171 | 10:15 | Deep Network Interpolation for Continuous Imagery Effect Transition | Xintao Wang; Ke Yu; Chao Dong; Xiaoou Tang; Chen Change Loy | 740 | |
172 | 10:15 | Spatially Variant Linear Representation Models for Joint Filtering | Jinshan Pan; Jiangxin Dong; Jimmy S. Ren; Liang Lin; Jinhui Tang; Ming-Hsuan Yang | 759 | |
173 | 10:15 | Toward Convolutional Blind Denoising of Real Photographs | Shi Guo; Zifei Yan; Kai Zhang; Wangmeng Zuo; Lei Zhang | 916 | |
174 | 10:15 | Towards Real Scene Super-Resolution With Raw Images | Xiangyu Xu; Yongrui Ma; Wenxiu Sun | 921 | |
175 | 10:15 | ODE-Inspired Network Design for Single Image Super-Resolution | Xiangyu He; Zitao Mo; Peisong Wang; Yang Liu; Mingyuan Yang; Jian Cheng | 1034 | |
176 | 10:15 | Blind Image Deblurring With Local Maximum Gradient Prior | Liang Chen; Faming Fang; Tingting Wang; Guixu Zhang | 1075 | |
177 | 10:15 | Attention-Guided Network for Ghost-Free High Dynamic Range Imaging | Qingsen Yan; Dong Gong; Qinfeng Shi; Anton van den Hengel; Chunhua Shen; Ian Reid; Yanning Zhang | 1318 | |
Scenes & Representation | 178 | 10:15 | Searching for a Robust Neural Architecture in Four GPU Hours | Xuanyi Dong; Yi Yang | 80 |
179 | 10:15 | Hierarchy Denoising Recursive Autoencoders for 3D Scene Layout Prediction | Yifei Shi; Angel X. Chang; Zhelun Wu; Manolis Savva; Kai Xu | 214 | |
180 | 10:15 | Adaptively Connected Neural Networks | Guangrun Wang; Keze Wang; Liang Lin | 242 | |
181 | 10:15 | CrDoCo: Pixel-Level Domain Transfer With Cross-Domain Consistency | Yun-Chun Chen; Yen-Yu Lin; Ming-Hsuan Yang; Jia-Bin Huang | 281 | |
182 | 10:15 | Temporal Cycle-Consistency Learning | Debidatta Dwibedi; Yusuf Aytar; Jonathan Tompson; Pierre Sermanet; Andrew Zisserman | 282 | |
183 | 10:15 | Predicting Future Frames Using Retrospective Cycle GAN | Yong-Hoon Kwon; Min-Gyu Park | 369 | |
184 | 10:15 | Density Map Regression Guided Detection Network for RGB-D Crowd Counting and Localization | Dongze Lian; Jing Li; Jia Zheng; Weixin Luo; Shenghua Gao | 390 | |
185 | 10:15 | TAFE-Net: Task-Aware Feature Embeddings for Low Shot Learning | Xin Wang; Fisher Yu; Ruth Wang; Trevor Darrell; Joseph E. Gonzalez | 440 | |
186 | 10:15 | Learning Semantic Segmentation From Synthetic Data: A Geometrically Guided Input-Output Adaptation Approach | Yuhua Chen; Wen Li; Xiaoran Chen; Luc Van Gool | 642 | |
187 | 10:15 | Attentive Single-Tasking of Multiple Tasks | Kevis-Kokitsi Maninis; Ilija Radosavovic; Iasonas Kokkinos | 651 | |
188 | 10:15 | Deep Metric Learning to Rank | Fatih Cakir; Kun He; Xide Xia; Brian Kulis; Stan Sclaroff | 718 | |
189 | 10:15 | End-To-End Multi-Task Learning With Attention | Shikun Liu; Edward Johns; Andrew J. Davison | 829 | |
190 | 10:15 | Self-Supervised Learning via Conditional Motion Propagation | Xiaohang Zhan; Xingang Pan; Ziwei Liu; Dahua Lin; Chen Change Loy | 1097 | |
191 | 10:15 | Bridging Stereo Matching and Optical Flow via Spatiotemporal Correspondence | Hsueh-Ying Lai; Yi-Hsuan Tsai; Wei-Chen Chiu | 1173 | |
192 | 10:15 | All About Structure: Adapting Structural Information Across Domains for Boosting Semantic Segmentation | Wei-Lun Chang; Hui-Po Wang; Wen-Hsiao Peng; Wei-Chen Chiu | 1176 | |
193 | 10:15 | Iterative Reorganization With Weak Spatial Constraints: Solving Arbitrary Jigsaw Puzzles for Unsupervised Representation Learning | Chen Wei; Lingxi Xie; Xutong Ren; Yingda Xia; Chi Su; Jiaying Liu; Qi Tian; Alan L. Yuille | 1199 | |
194 | 10:15 | Revisiting Self-Supervised Visual Representation Learning | Alexander Kolesnikov; Xiaohua Zhai; Lucas Beyer | 1306 | |
Language & Reasoning | 195 | 10:15 | It's Not About the Journey; It's About the Destination: Following Soft Paths Under Question-Guidance for Visual Reasoning | Monica Haurilet; Alina Roitberg; Rainer Stiefelhagen | 31 |
196 | 10:15 | Actively Seeking and Learning From Live Data | Damien Teney; Anton van den Hengel | 340 | |
197 | 10:15 | Improving Referring Expression Grounding With Cross-Modal Attention-Guided Erasing | Xihui Liu; Zihao Wang; Jing Shao; Xiaogang Wang; Hongsheng Li | 409 | |
198 | 10:15 | Neighbourhood Watch: Referring Expression Comprehension via Language-Guided Graph Attention Networks | Peng Wang; Qi Wu; Jiewei Cao; Chunhua Shen; Lianli Gao; Anton van den Hengel | 598 | |
199 | 10:15 | Scene Graph Generation With External Knowledge and Image Reconstruction | Jiuxiang Gu; Handong Zhao; Zhe Lin; Sheng Li; Jianfei Cai; Mingyang Ling | 622 | |
200 | 10:15 | Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval | Yale Song; Mohammad Soleymani | 674 | |
201 | 10:15 | MUREL: Multimodal Relational Reasoning for Visual Question Answering | Remi Cadene; Hedi Ben-younes; Matthieu Cord; Nicolas Thome | 846 | |
202 | 10:15 | Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering | Chenyou Fan; Xiaofan Zhang; Shu Zhang; Wensheng Wang; Chi Zhang; Heng Huang | 1022 | |
203 | 10:15 | Information Maximizing Visual Question Generation | Ranjay Krishna; Michael Bernstein; Li Fei-Fei | 1151 | |
204 | 10:15 | Learning to Detect Human-Object Interactions With Knowledge | Bingjie Xu; Yongkang Wong; Junnan Li; Qi Zhao; Mohan S. Kankanhalli | 1235 | |
205 | 10:15 | Learning Words by Drawing Images | Dídac Surís; Adrià Recasens; David Bau; David Harwath; James Glass; Antonio Torralba | 1298 | |
206 | 10:15 | Factor Graph Attention | Idan Schwartz; Seunghak Yu; Tamir Hazan; Alexander G. Schwing | 5401 | |
Applications, Medical, & Robotics | 207 | 10:15 | Reducing Uncertainty in Undersampled MRI Reconstruction With Active Acquisition | Zizhao Zhang; Adriana Romero; Matthew J. Muckley; Pascal Vincent; Lin Yang; Michal Drozdzal | 329 |
208 | 10:15 | ESIR: End-To-End Scene Text Recognition via Iterative Image Rectification | Fangneng Zhan; Shijian Lu | 389 | |
209 | 10:15 | ROI-10D: Monocular Lifting of 2D Detection to 6D Pose and Metric Shape | Fabian Manhardt; Wadim Kehl; Adrien Gaidon | 545 | |
210 | 10:15 | Collaborative Learning of Semi-Supervised Segmentation and Classification for Medical Images | Yi Zhou; Xiaodong He; Lei Huang; Li Liu; Fan Zhu; Shanshan Cui; Ling Shao | 810 | |
211 | 10:15 | Biologically-Constrained Graphs for Global Connectomics Reconstruction | Brian Matejek; Daniel Haehn; Haidong Zhu; Donglai Wei; Toufiq Parag; Hanspeter Pfister | 949 | |
212 | 10:15 | P3SGD: Patient Privacy Preserving SGD for Regularizing Deep CNNs in Pathological Image Classification | Bingzhe Wu; Shiwan Zhao; Guangyu Sun; Xiaolu Zhang; Zhong Su; Caihong Zeng; Zhihong Liu | 974 | |
213 | 10:15 | Elastic Boundary Projection for 3D Medical Image Segmentation | Tianwei Ni; Lingxi Xie; Huangjie Zheng; Elliot K. Fishman; Alan L. Yuille | 1070 | |
214 | 10:15 | SIXray: A Large-Scale Security Inspection X-Ray Benchmark for Prohibited Item Discovery in Overlapping Images | Caijing Miao; Lingxi Xie; Fang Wan; Chi Su; Hongye Liu; Jianbin Jiao; Qixiang Ye | 1195 | |
215 | 10:15 | Noise2Void - Learning Denoising From Single Noisy Images | Alexander Krull; Tim-Oliver Buchholz; Florian Jug | 5541 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Recognition | 18 | 13:30 | Joint Discriminative and Generative Learning for Person Re-Identification | Zhedong Zheng; Xiaodong Yang; Zhiding Yu; Liang Zheng; Yi Yang; Jan Kautz | 16 |
19 | 13:35 | Unsupervised Person Re-Identification by Soft Multilabel Learning | Hong-Xing Yu; Wei-Shi Zheng; Ancong Wu; Xiaowei Guo; Shaogang Gong; Jian-Huang Lai | 522 | |
20 | 13:40 | Learning Context Graph for Person Search | Yichao Yan; Qiang Zhang; Bingbing Ni; Wendong Zhang; Minghao Xu; Xiaokang Yang | 2262 | |
21 | 13:48 | Gradient Matching Generative Networks for Zero-Shot Learning | Mert Bulent Sariyildiz; Ramazan Gokberk Cinbis | 220 | |
22 | 13:53 | Doodle to Search: Practical Zero-Shot Sketch-Based Image Retrieval | Sounak Dey; Pau Riba; Anjan Dutta; Josep Lladós; Yi-Zhe Song | 4499 | |
23 | 13:58 | Zero-Shot Task Transfer | Arghya Pal; Vineeth N Balasubramanian | 5230 | |
24 | 14:06 | C-MIL: Continuation Multiple Instance Learning for Weakly Supervised Object Detection | Fang Wan; Chang Liu; Wei Ke; Xiangyang Ji; Jianbin Jiao; Qixiang Ye | 906 | |
25 | 14:11 | Weakly Supervised Learning of Instance Segmentation With Inter-Pixel Relations | Jiwoon Ahn; Sunghyun Cho; Suha Kwak | 2973 | |
26 | 14:16 | Attention-Based Dropout Layer for Weakly Supervised Object Localization | Junsuk Choe; Hyunjung Shim | 3916 | |
27 | 14:24 | Domain Generalization by Solving Jigsaw Puzzles | Fabio M. Carlucci; Antonio D'Innocente; Silvia Bucci; Barbara Caputo; Tatiana Tommasi | 1019 | |
28 | 14:29 | Transferrable Prototypical Networks for Unsupervised Domain Adaptation | Yingwei Pan; Ting Yao; Yehao Li; Yu Wang; Chong-Wah Ngo; Tao Mei | 3628 | |
29 | 14:34 | Blending-Target Domain Adaptation by Adversarial Meta-Adaptation Networks | Ziliang Chen; Jingyu Zhuang; Xiaodan Liang; Liang Lin | 1182 | |
30 | 14:42 | ELASTIC: Improving CNNs With Dynamic Scaling Policies | Huiyu Wang; Aniruddha Kembhavi; Ali Farhadi; Alan L. Yuille; Mohammad Rastegari | 1113 | |
31 | 14:47 | ScratchDet: Training Single-Shot Object Detectors From Scratch | Rui Zhu; Shifeng Zhang; Xiaobo Wang; Longyin Wen; Hailin Shi; Liefeng Bo; Tao Mei | 1782 | |
32 | 14:52 | SFNet: Learning Object-Aware Semantic Correspondence | Junghyup Lee; Dohyung Kim; Jean Ponce; Bumsub Ham | 3294 | |
33 | 15:00 | Deep Metric Learning Beyond Binary Supervision | Sungyeon Kim; Minkyo Seo; Ivan Laptev; Minsu Cho; Suha Kwak | 1294 | |
34 | 15:05 | Learning to Cluster Faces on an Affinity Graph | Lei Yang; Xiaohang Zhan; Dapeng Chen; Junjie Yan; Chen Change Loy; Dahua Lin | 1510 | |
35 | 15:10 | C2AE: Class Conditioned Auto-Encoder for Open-Set Recognition | Poojan Oza; Vishal M. Patel | 1610 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Synthesis | 118 | 13:30 | Shapes and Context: In-The-Wild Image Synthesis & Manipulation | Aayush Bansal; Yaser Sheikh; Deva Ramanan | 1426 |
119 | 13:35 | Semantics Disentangling for Text-To-Image Generation | Guojun Yin; Bin Liu; Lu Sheng; Nenghai Yu; Xiaogang Wang; Jing Shao | 462 | |
120 | 13:40 | Semantic Image Synthesis With Spatially-Adaptive Normalization | Taesung Park; Ming-Yu Liu; Ting-Chun Wang; Jun-Yan Zhu | 2072 | |
121 | 13:48 | Progressive Pose Attention Transfer for Person Image Generation | Zhen Zhu; Tengteng Huang; Baoguang Shi; Miao Yu; Bofei Wang; Xiang Bai | 609 | |
122 | 13:53 | Unsupervised Person Image Generation With Semantic Parsing Transformation | Sijie Song; Wei Zhang; Jiaying Liu; Tao Mei | 3269 | |
123 | 13:58 | DeepView: View Synthesis With Learned Gradient Descent | John Flynn; Michael Broxton; Paul Debevec; Matthew DuVall; Graham Fyffe; Ryan Overbeck; Noah Snavely; Richard Tucker | 2439 | |
124 | 14:06 | Animating Arbitrary Objects via Deep Motion Transfer | Aliaksandr Siarohin; Stéphane Lathuilière; Sergey Tulyakov; Elisa Ricci; Nicu Sebe | 4908 | |
125 | 14:11 | Textured Neural Avatars | Aliaksandra Shysheya; Egor Zakharov; Kara-Ali Aliev; Renat Bashirov; Egor Burkov; Karim Iskakov; Aleksei Ivakhnenko; Yury Malkov; Igor Pasechnik; Dmitry Ulyanov; Alexander Vakhitov; Victor Lempitsky | 5428 | |
126 | 14:16 | IM-Net for High Resolution Video Frame Interpolation | Tomer Peleg; Pablo Szekely; Doron Sabo; Omry Sendik | 3190 | |
127 | 14:24 | Homomorphic Latent Space Interpolation for Unpaired Image-To-Image Translation | Ying-Cong Chen; Xiaogang Xu; Zhuotao Tian; Jiaya Jia | 1240 | |
128 | 14:29 | Multi-Channel Attention Selection GAN With Cascaded Semantic Guidance for Cross-View Image Translation | Hao Tang; Dan Xu; Nicu Sebe; Yanzhi Wang; Jason J. Corso; Yan Yan | 3069 | |
129 | 14:34 | Geometry-Consistent Generative Adversarial Networks for One-Sided Unsupervised Domain Mapping | Huan Fu; Mingming Gong; Chaohui Wang; Kayhan Batmanghelich; Kun Zhang; Dacheng Tao | 4341 | |
130 | 14:42 | DeepVoxels: Learning Persistent 3D Feature Embeddings | Vincent Sitzmann; Justus Thies; Felix Heide; Matthias Nießner; Gordon Wetzstein; Michael Zollhöfer | 3521 | |
131 | 14:47 | Inverse Path Tracing for Joint Material and Lighting Estimation | Dejan Azinović; Tzu-Mao Li; Anton Kaplanyan; Matthias Nießner | 5944 | |
132 | 14:52 | The Visual Centrifuge: Model-Free Layered Video Representations | Jean-Baptiste Alayrac; João Carreira; Andrew Zisserman | 4057 | |
133 | 15:00 | Label-Noise Robust Generative Adversarial Networks | Takuhiro Kaneko; Yoshitaka Ushiku; Tatsuya Harada | 5720 | |
134 | 15:05 | DLOW: Domain Flow for Adaptation and Generalization | Rui Gong; Wen Li; Yuhua Chen; Luc Van Gool | 5766 | |
135 | 15:10 | CollaGAN: Collaborative GAN for Missing Image Data Imputation | Dongwook Lee; Junyoung Kim; Won-Jin Moon; Jong Chul Ye | 6970 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Scenes & Representation | 166 | 13:30 | d-SNE: Domain Adaptation Using Stochastic Neighborhood Embedding | Xiang Xu; Xiong Zhou; Ragav Venkatesan; Gurumurthy Swaminathan; Orchid Majumder | 6592 |
167 | 13:35 | Taking a Closer Look at Domain Shift: Category-Level Adversaries for Semantics Consistent Domain Adaptation | Yawei Luo; Liang Zheng; Tao Guan; Junqing Yu; Yi Yang | 197 | |
168 | 13:40 | ADVENT: Adversarial Entropy Minimization for Domain Adaptation in Semantic Segmentation | Tuan-Hung Vu; Himalaya Jain; Maxime Bucher; Matthieu Cord; Patrick Pérez | 396 | |
169 | 13:48 | ContextDesc: Local Descriptor Augmentation With Cross-Modality Context | Zixin Luo; Tianwei Shen; Lei Zhou; Jiahui Zhang; Yao Yao; Shiwei Li; Tian Fang; Long Quan | 325 | |
170 | 13:53 | Large-Scale Long-Tailed Recognition in an Open World | Ziwei Liu; Zhongqi Miao; Xiaohang Zhan; Jiayun Wang; Boqing Gong; Stella X. Yu | 556 | |
171 | 13:58 | AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transformations Rather Than Data | Liheng Zhang; Guo-Jun Qi; Liqiang Wang; Jiebo Luo | 5137 | |
172 | 14:06 | SDC – Stacked Dilated Convolution: A Unified Descriptor Network for Dense Matching Tasks | René Schuster; Oliver Wasenmüller; Christian Unger; Didier Stricker | 576 | |
173 | 14:11 | Learning Correspondence From the Cycle-Consistency of Time | Xiaolong Wang; Allan Jabri; Alexei A. Efros | 2746 | |
174 | 14:16 | AE2-Nets: Autoencoder in Autoencoder Networks | Changqing Zhang; Yeqing Liu; Huazhu Fu | 2131 | |
175 | 14:24 | Mitigating Information Leakage in Image Representations: A Maximum Entropy Approach | Proteek Chandan Roy; Vishnu Naresh Boddeti | 1655 | |
176 | 14:29 | Learning Spatial Common Sense With Geometry-Aware Recurrent Networks | Hsiao-Yu Fish Tung; Ricson Cheng; Katerina Fragkiadaki | 3877 | |
177 | 14:34 | Structured Knowledge Distillation for Semantic Segmentation | Yifan Liu; Ke Chen; Chris Liu; Zengchang Qin; Zhenbo Luo; Jingdong Wang | 3147 | |
178 | 14:42 | Scan2CAD: Learning CAD Model Alignment in RGB-D Scans | Armen Avetisyan; Manuel Dahnert; Angela Dai; Manolis Savva; Angel X. Chang; Matthias Nießner | 977 | |
179 | 14:47 | Towards Scene Understanding: Unsupervised Monocular Depth Estimation With Semantic-Aware Representation | Po-Yi Chen; Alexander H. Liu; Yen-Cheng Liu; Yu-Chiang Frank Wang | 2799 | |
180 | 14:52 | Tell Me Where I Am: Object-Level Scene Context Prediction | Xiaotian Qiao; Quanlong Zheng; Ying Cao; Rynson W.H. Lau | 3107 | |
181 | 15:00 | Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation | He Wang; Srinath Sridhar; Jingwei Huang; Julien Valentin; Shuran Song; Leonidas J. Guibas | 1373 | |
182 | 15:05 | Supervised Fitting of Geometric Primitives to 3D Point Clouds | Lingxiao Li; Minhyuk Sung; Anastasia Dubrovina; Li Yi; Leonidas J. Guibas | 2452 | |
183 | 15:10 | Do Better ImageNet Models Transfer Better? | Simon Kornblith; Jonathon Shlens; Quoc V. Le | 4225 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Deep Learning | 1 | 15:20 | Gotta Adapt 'Em All: Joint Pixel and Feature-Level Domain Adaptation for Recognition in the Wild | Luan Tran; Kihyuk Sohn; Xiang Yu; Xiaoming Liu; Manmohan Chandraker | 513 |
2 | 15:20 | Understanding the Disharmony Between Dropout and Batch Normalization by Variance Shift | Xiang Li; Shuo Chen; Xiaolin Hu; Jian Yang | 1438 | |
3 | 15:20 | Circulant Binary Convolutional Networks: Enhancing the Performance of 1-Bit DCNNs With Circulant Back Propagation | Chunlei Liu; Wenrui Ding; Xin Xia; Baochang Zhang; Jiaxin Gu; Jianzhuang Liu; Rongrong Ji; David Doermann | 1489 | |
4 | 15:20 | DeFusionNET: Defocus Blur Detection via Recurrently Fusing and Refining Multi-Scale Deep Features | Chang Tang; Xinzhong Zhu; Xinwang Liu; Lizhe Wang; Albert Zomaya | 1536 | |
5 | 15:20 | Deep Virtual Networks for Memory Efficient Inference of Multiple Tasks | Eunwoo Kim; Chanho Ahn; Philip H.S. Torr; Songhwai Oh | 1574 | |
6 | 15:20 | Universal Domain Adaptation | Kaichao You; Mingsheng Long; Zhangjie Cao; Jianmin Wang; Michael I. Jordan | 1628 | |
7 | 15:20 | Improving Transferability of Adversarial Examples With Input Diversity | Cihang Xie; Zhishuai Zhang; Yuyin Zhou; Song Bai; Jianyu Wang; Zhou Ren; Alan L. Yuille | 1673 | |
8 | 15:20 | Sequence-To-Sequence Domain Adaptation Network for Robust Text Image Recognition | Yaping Zhang; Shuai Nie; Wenju Liu; Xing Xu; Dongxiang Zhang; Heng Tao Shen | 1786 | |
9 | 15:20 | Hybrid-Attention Based Decoupled Metric Learning for Zero-Shot Image Retrieval | Binghui Chen; Weihong Deng | 1848 | |
10 | 15:20 | Learning to Sample | Oren Dovrat; Itai Lang; Shai Avidan | 1910 | |
11 | 15:20 | Few-Shot Learning via Saliency-Guided Hallucination of Samples | Hongguang Zhang; Jing Zhang; Piotr Koniusz | 1937 | |
12 | 15:20 | Variational Convolutional Neural Network Pruning | Chenglong Zhao; Bingbing Ni; Jian Zhang; Qiwei Zhao; Wenjun Zhang; Qi Tian | 1961 | |
13 | 15:20 | Towards Optimal Structured CNN Pruning via Generative Adversarial Learning | Shaohui Lin; Rongrong Ji; Chenqian Yan; Baochang Zhang; Liujuan Cao; Qixiang Ye; Feiyue Huang; David Doermann | 1983 | |
14 | 15:20 | Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression | Yuchao Li; Shaohui Lin; Baochang Zhang; Jianzhuang Liu; David Doermann; Yongjian Wu; Feiyue Huang; Rongrong Ji | 1989 | |
15 | 15:20 | Fully Quantized Network for Object Detection | Rundong Li; Yan Wang; Feng Liang; Hongwei Qin; Junjie Yan; Rui Fan | 2110 | |
16 | 15:20 | MnasNet: Platform-Aware Neural Architecture Search for Mobile | Mingxing Tan; Bo Chen; Ruoming Pang; Vijay Vasudevan; Mark Sandler; Andrew Howard; Quoc V. Le | 2124 | |
17 | 15:20 | Student Becoming the Master: Knowledge Amalgamation for Joint Scene Parsing, Depth Estimation, and More | Jingwen Ye; Yixin Ji; Xinchao Wang; Kairi Ou; Dapeng Tao; Mingli Song | 2205 | |
Recognition | 18 | 15:20 | Joint Discriminative and Generative Learning for Person Re-Identification | Zhedong Zheng; Xiaodong Yang; Zhiding Yu; Liang Zheng; Yi Yang; Jan Kautz | 16 |
19 | 15:20 | Unsupervised Person Re-Identification by Soft Multilabel Learning | Hong-Xing Yu; Wei-Shi Zheng; Ancong Wu; Xiaowei Guo; Shaogang Gong; Jian-Huang Lai | 522 | |
20 | 15:20 | Learning Context Graph for Person Search | Yichao Yan; Qiang Zhang; Bingbing Ni; Wendong Zhang; Minghao Xu; Xiaokang Yang | 2262 | |
21 | 15:20 | Gradient Matching Generative Networks for Zero-Shot Learning | Mert Bulent Sariyildiz; Ramazan Gokberk Cinbis | 220 | |
22 | 15:20 | Doodle to Search: Practical Zero-Shot Sketch-Based Image Retrieval | Sounak Dey; Pau Riba; Anjan Dutta; Josep Lladós; Yi-Zhe Song | 4499 | |
23 | 15:20 | Zero-Shot Task Transfer | Arghya Pal; Vineeth N Balasubramanian | 5230 | |
24 | 15:20 | C-MIL: Continuation Multiple Instance Learning for Weakly Supervised Object Detection | Fang Wan; Chang Liu; Wei Ke; Xiangyang Ji; Jianbin Jiao; Qixiang Ye | 906 | |
25 | 15:20 | Weakly Supervised Learning of Instance Segmentation With Inter-Pixel Relations | Jiwoon Ahn; Sunghyun Cho; Suha Kwak | 2973 | |
26 | 15:20 | Attention-Based Dropout Layer for Weakly Supervised Object Localization | Junsuk Choe; Hyunjung Shim | 3916 | |
27 | 15:20 | Domain Generalization by Solving Jigsaw Puzzles | Fabio M. Carlucci; Antonio D'Innocente; Silvia Bucci; Barbara Caputo; Tatiana Tommasi | 1019 | |
28 | 15:20 | Transferrable Prototypical Networks for Unsupervised Domain Adaptation | Yingwei Pan; Ting Yao; Yehao Li; Yu Wang; Chong-Wah Ngo; Tao Mei | 3628 | |
29 | 15:20 | Blending-Target Domain Adaptation by Adversarial Meta-Adaptation Networks | Ziliang Chen; Jingyu Zhuang; Xiaodan Liang; Liang Lin | 1182 | |
30 | 15:20 | ELASTIC: Improving CNNs With Dynamic Scaling Policies | Huiyu Wang; Aniruddha Kembhavi; Ali Farhadi; Alan L. Yuille; Mohammad Rastegari | 1113 | |
31 | 15:20 | ScratchDet: Training Single-Shot Object Detectors From Scratch | Rui Zhu; Shifeng Zhang; Xiaobo Wang; Longyin Wen; Hailin Shi; Liefeng Bo; Tao Mei | 1782 | |
32 | 15:20 | SFNet: Learning Object-Aware Semantic Correspondence | Junghyup Lee; Dohyung Kim; Jean Ponce; Bumsub Ham | 3294 | |
33 | 15:20 | Deep Metric Learning Beyond Binary Supervision | Sungyeon Kim; Minkyo Seo; Ivan Laptev; Minsu Cho; Suha Kwak | 1294 | |
34 | 15:20 | Learning to Cluster Faces on an Affinity Graph | Lei Yang; Xiaohang Zhan; Dapeng Chen; Junjie Yan; Chen Change Loy; Dahua Lin | 1510 | |
35 | 15:20 | C2AE: Class Conditioned Auto-Encoder for Open-Set Recognition | Poojan Oza; Vishal M. Patel | 1610 | |
36 | 15:20 | K-Nearest Neighbors Hashing | Xiangyu He; Peisong Wang; Jian Cheng | 602 | |
37 | 15:20 | Learning RoI Transformer for Oriented Object Detection in Aerial Images | Jian Ding; Nan Xue; Yang Long; Gui-Song Xia; Qikai Lu | 1393 | |
38 | 15:20 | Snapshot Distillation: Teacher-Student Optimization in One Generation | Chenglin Yang; Lingxi Xie; Chi Su; Alan L. Yuille | 1429 | |
39 | 15:20 | Geometry-Aware Distillation for Indoor Semantic Segmentation | Jianbo Jiao; Yunchao Wei; Zequn Jie; Honghui Shi; Rynson W.H. Lau; Thomas S. Huang | 1440 | |
40 | 15:20 | LiveSketch: Query Perturbations for Guided Sketch-Based Visual Search | John Collomosse; Tu Bui; Hailin Jin | 1455 | |
41 | 15:20 | Bounding Box Regression With Uncertainty for Accurate Object Detection | Yihui He; Chenchen Zhu; Jianren Wang; Marios Savvides; Xiangyu Zhang | 1525 | |
42 | 15:20 | OCGAN: One-Class Novelty Detection Using GANs With Constrained Latent Representations | Pramuditha Perera; Ramesh Nallapati; Bing Xiang | 1528 | |
43 | 15:20 | Learning Metrics From Teachers: Compact Networks for Image Embedding | Lu Yu; Vacit Oguz Yazici; Xialei Liu; Joost van de Weijer; Yongmei Cheng; Arnau Ramisa | 1651 | |
44 | 15:20 | Activity Driven Weakly Supervised Object Detection | Zhenheng Yang; Dhruv Mahajan; Deepti Ghadiyaram; Ram Nevatia; Vignesh Ramanathan | 1681 | |
45 | 15:20 | Separate to Adapt: Open Set Domain Adaptation via Progressive Separation | Hong Liu; Zhangjie Cao; Mingsheng Long; Jianmin Wang; Qiang Yang | 1738 | |
46 | 15:20 | Layout-Graph Reasoning for Fashion Landmark Detection | Weijiang Yu; Xiaodan Liang; Ke Gong; Chenhan Jiang; Nong Xiao; Liang Lin | 1810 | |
47 | 15:20 | DistillHash: Unsupervised Deep Hashing by Distilling Data Pairs | Erkun Yang; Tongliang Liu; Cheng Deng; Wei Liu; Dacheng Tao | 1828 | |
48 | 15:20 | Mind Your Neighbours: Image Annotation With Metadata Neighbourhood Graph Co-Attention Networks | Junjie Zhang; Qi Wu; Jian Zhang; Chunhua Shen; Jianfeng Lu | 1834 | |
49 | 15:20 | Region Proposal by Guided Anchoring | Jiaqi Wang; Kai Chen; Shuo Yang; Chen Change Loy; Dahua Lin | 1843 | |
50 | 15:20 | Distant Supervised Centroid Shift: A Simple and Efficient Approach to Visual Domain Adaptation | Jian Liang; Ran He; Zhenan Sun; Tieniu Tan | 1847 | |
51 | 15:20 | Learning to Transfer Examples for Partial Domain Adaptation | Zhangjie Cao; Kaichao You; Mingsheng Long; Jianmin Wang; Qiang Yang | 1855 | |
52 | 15:20 | Generalized Zero-Shot Recognition Based on Visually Semantic Embedding | Pengkai Zhu; Hanxiao Wang; Venkatesh Saligrama | 1915 | |
53 | 15:20 | Towards Visual Feature Translation | Jie Hu; Rongrong Ji; Hong Liu; Shengchuan Zhang; Cheng Deng; Qi Tian | 1962 | |
54 | 15:20 | Amodal Instance Segmentation With KINS Dataset | Lu Qi; Li Jiang; Shu Liu; Xiaoyong Shen; Jiaya Jia | 1997 | |
55 | 15:20 | Global Second-Order Pooling Convolutional Networks | Zilin Gao; Jiangtao Xie; Qilong Wang; Peihua Li | 2016 | |
56 | 15:20 | Weakly Supervised Complementary Parts Models for Fine-Grained Image Classification From the Bottom Up | Weifeng Ge; Xiangru Lin; Yizhou Yu | 2125 | |
57 | 15:20 | NetTailor: Tuning the Architecture, Not Just the Weights | Pedro Morgado; Nuno Vasconcelos | 2139 | |
Segmentation, Grouping, & Shape | 58 | 15:20 | Learning-Based Sampling for Natural Image Matting | Jingwei Tang; Yağiz Aksoy; Cengiz Öztireli; Markus Gross; Tunç Ozan Aydin | 1358 |
59 | 15:20 | Learning Unsupervised Video Object Segmentation Through Visual Attention | Wenguan Wang; Hongmei Song; Shuyang Zhao; Jianbing Shen; Sanyuan Zhao; Steven C. H. Hoi; Haibin Ling | 1437 | |
60 | 15:20 | 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks | Christopher Choy; JunYoung Gwak; Silvio Savarese | 1523 | |
61 | 15:20 | Pyramid Feature Attention Network for Saliency Detection | Ting Zhao; Xiangqian Wu | 1642 | |
62 | 15:20 | Co-Saliency Detection via Mask-Guided Fully Convolutional Networks With Multi-Scale Label Smoothing | Kaihua Zhang; Tengpeng Li; Bo Liu; Qingshan Liu | 1682 | |
63 | 15:20 | SAIL-VOS: Semantic Amodal Instance Level Video Object Segmentation – A Synthetic Dataset and Baselines | Yuan-Ting Hu; Hong-Shuo Chen; Kexin Hui; Jia-Bin Huang; Alexander G. Schwing | 1750 | |
64 | 15:20 | Learning Instance Activation Maps for Weakly Supervised Instance Segmentation | Yi Zhu; Yanzhao Zhou; Huijuan Xu; Qixiang Ye; David Doermann; Jianbin Jiao | 1798 | |
65 | 15:20 | Decoders Matter for Semantic Segmentation: Data-Dependent Decoding Enables Flexible Feature Aggregation | Zhi Tian; Tong He; Chunhua Shen; Youliang Yan | 1970 | |
66 | 15:20 | Box-Driven Class-Wise Region Masking and Filling Rate Guided Loss for Weakly Supervised Semantic Segmentation | Chunfeng Song; Yan Huang; Wanli Ouyang; Liang Wang | 1998 | |
67 | 15:20 | Dual Attention Network for Scene Segmentation | Jun Fu; Jing Liu; Haijie Tian; Yong Li; Yongjun Bao; Zhiwei Fang; Hanqing Lu | 2137 | |
Statistics, Physics, Theory, & Datasets | 68 | 15:20 | InverseRenderNet: Learning Single Image Inverse Rendering | Ye Yu; William A. P. Smith | 1444 |
69 | 15:20 | A Variational Auto-Encoder Model for Stochastic Point Processes | Nazanin Mehrasa; Akash Abdu Jyothi; Thibaut Durand; Jiawei He; Leonid Sigal; Greg Mori | 1470 | |
70 | 15:20 | Unifying Heterogeneous Classifiers With Distillation | Jayakorn Vongkulbhisal; Phongtharin Vinayavekhin; Marco Visentini-Scarzanella | 1558 | |
71 | 15:20 | Assessment of Faster R-CNN in Man-Machine Collaborative Search | Arturo Deza; Amit Surana; Miguel P. Eckstein | 1606 | |
72 | 15:20 | OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge | Kenneth Marino; Mohammad Rastegari; Ali Farhadi; Roozbeh Mottaghi | 1758 | |
73 | 15:20 | NDDR-CNN: Layerwise Feature Fusing in Multi-Task CNNs by Neural Discriminative Dimensionality Reduction | Yuan Gao; Jiayi Ma; Mingbo Zhao; Wei Liu; Alan L. Yuille | 1835 | |
74 | 15:20 | Spectral Metric for Dataset Complexity Assessment | Frédéric Branchaud-Charron; Andrew Achkar; Pierre-Marc Jodoin | 1932 | |
75 | 15:20 | ADCrowdNet: An Attention-Injective Deformable Convolutional Network for Crowd Understanding | Ning Liu; Yongchao Long; Changqing Zou; Qun Niu; Li Pan; Hefeng Wu | 1994 | |
76 | 15:20 | VERI-Wild: A Large Dataset and a New Method for Vehicle Re-Identification in the Wild | Yihang Lou; Yan Bai; Jun Liu; Shiqi Wang; Lingyu Duan | 2161 | |
3D Multiview | 77 | 15:20 | 3D Local Features for Direct Pairwise Registration | Haowen Deng; Tolga Birdal; Slobodan Ilic | 109 |
78 | 15:20 | HPLFlowNet: Hierarchical Permutohedral Lattice FlowNet for Scene Flow Estimation on Large-Scale Point Clouds | Xiuye Gu; Yijie Wang; Chongruo Wu; Yong Jae Lee; Panqu Wang | 1396 | |
79 | 15:20 | GPSfM: Global Projective SFM Using Algebraic Constraints on Multi-View Fundamental Matrices | Yoni Kasten; Amnon Geifman; Meirav Galun; Ronen Basri | 1409 | |
80 | 15:20 | Group-Wise Correlation Stereo Network | Xiaoyang Guo; Kai Yang; Wukui Yang; Xiaogang Wang; Hongsheng Li | 1410 | |
81 | 15:20 | Multi-Level Context Ultra-Aggregation for Stereo Matching | Guang-Yu Nie; Ming-Ming Cheng; Yun Liu; Zhengfa Liang; Deng-Ping Fan; Yue Liu; Yongtian Wang | 1562 | |
82 | 15:20 | Large-Scale, Metric Structure From Motion for Unordered Light Fields | Sotiris Nousias; Manolis Lourakis; Christos Bergeles | 1781 | |
83 | 15:20 | Understanding the Limitations of CNN-Based Absolute Camera Pose Regression | Torsten Sattler; Qunjie Zhou; Marc Pollefeys; Laura Leal-Taixé | 1887 | |
84 | 15:20 | DeepLiDAR: Deep Surface Normal Guided Depth Prediction for Outdoor Scene From Sparse LiDAR Data and Single Color Image | Jiaxiong Qiu; Zhaopeng Cui; Yinda Zhang; Xingdi Zhang; Shuaicheng Liu; Bing Zeng; Marc Pollefeys | 1899 | |
85 | 15:20 | Modeling Point Clouds With Self-Attention and Gumbel Subset Sampling | Jiancheng Yang; Qiang Zhang; Bingbing Ni; Linguo Li; Jinxian Liu; Mengdie Zhou; Qi Tian | 2003 | |
86 | 15:20 | Learning With Batch-Wise Optimal Transport Loss for 3D Shape Recognition | Lin Xu; Han Sun; Yuai Liu | 2043 | |
87 | 15:20 | DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion | Chen Wang; Danfei Xu; Yuke Zhu; Roberto Martín-Martín; Cewu Lu; Li Fei-Fei; Silvio Savarese | 2066 | |
3D Single View & RGBD | 88 | 15:20 | Dense Depth Posterior (DDP) From Single Image and Sparse Range | Yanchao Yang; Alex Wong; Stefano Soatto | 1365 |
89 | 15:20 | DuLa-Net: A Dual-Projection Network for Estimating Room Layouts From a Single RGB Panorama | Shang-Ta Yang; Fu-En Wang; Chi-Han Peng; Peter Wonka; Min Sun; Hung-Kuo Chu | 1441 | |
90 | 15:20 | Veritatem Dies Aperit - Temporally Consistent Depth Prediction Enabled by a Multi-Task Geometric and Semantic Scene Understanding Approach | Amir Atapour-Abarghouei; Toby P. Breckon | 1638 | |
91 | 15:20 | Segmentation-Driven 6D Object Pose Estimation | Yinlin Hu; Joachim Hugonot; Pascal Fua; Mathieu Salzmann | 1717 | |
92 | 15:20 | Exploiting Temporal Context for 3D Human Pose Estimation in the Wild | Anurag Arnab; Carl Doersch; Andrew Zisserman | 1884 | |
93 | 15:20 | What Do Single-View 3D Reconstruction Networks Learn? | Maxim Tatarchenko; Stephan R. Richter; René Ranftl; Zhuwen Li; Vladlen Koltun; Thomas Brox | 2029 | |
Face & Body | 94 | 15:20 | UniformFace: Learning Deep Equidistributed Representation for Face Recognition | Yueqi Duan; Jiwen Lu; Jie Zhou | 391 |
95 | 15:20 | Semantic Graph Convolutional Networks for 3D Human Pose Regression | Long Zhao; Xi Peng; Yu Tian; Mubbasir Kapadia; Dimitris N. Metaxas | 1418 | |
96 | 15:20 | Mask-Guided Portrait Editing With Conditional GANs | Shuyang Gu; Jianmin Bao; Hao Yang; Dong Chen; Fang Wen; Lu Yuan | 1427 | |
97 | 15:20 | Group Sampling for Scale Invariant Face Detection | Xiang Ming; Fangyun Wei; Ting Zhang; Dong Chen; Fang Wen | 1477 | |
98 | 15:20 | Joint Representation and Estimator Learning for Facial Action Unit Intensity Estimation | Yong Zhang; Baoyuan Wu; Weiming Dong; Zhifeng Li; Wei Liu; Bao-Gang Hu; Qiang Ji | 1632 | |
99 | 15:20 | Semantic Alignment: Finding Semantically Consistent Ground-Truth for Facial Landmark Detection | Zhiwei Liu; Xiangyu Zhu; Guosheng Hu; Haiyun Guo; Ming Tang; Zhen Lei; Neil M. Robertson; Jinqiao Wang | 1692 | |
100 | 15:20 | LAEO-Net: Revisiting People Looking at Each Other in Videos | Manuel J. Marín-Jiménez; Vicky Kalogeiton; Pablo Medina-Suárez; Andrew Zisserman | 1725 | |
101 | 15:20 | Robust Facial Landmark Detection via Occlusion-Adaptive Deep Networks | Meilu Zhu; Daming Shi; Mingjie Zheng; Muhammad Sadiq | 1730 | |
102 | 15:20 | Learning Individual Styles of Conversational Gesture | Shiry Ginosar; Amir Bar; Gefen Kohavi; Caroline Chan; Andrew Owens; Jitendra Malik | 1766 | |
103 | 15:20 | Face Anti-Spoofing: Model Matters, so Does Data | Xiao Yang; Wenhan Luo; Linchao Bao; Yuan Gao; Dihong Gong; Shibao Zheng; Zhifeng Li; Wei Liu | 1863 | |
104 | 15:20 | Fast Human Pose Estimation | Feng Zhang; Xiatian Zhu; Mao Ye | 1870 | |
105 | 15:20 | Decorrelated Adversarial Learning for Age-Invariant Face Recognition | Hao Wang; Dihong Gong; Zhifeng Li; Wei Liu | 2019 | |
Action & Video | 106 | 15:20 | Cross-Task Weakly Supervised Learning From Instructional Videos | Dimitri Zhukov; Jean-Baptiste Alayrac; Ramazan Gokberk Cinbis; David Fouhey; Ivan Laptev; Josef Sivic | 1360 |
107 | 15:20 | D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation | Chien-Yi Chang; De-An Huang; Yanan Sui; Li Fei-Fei; Juan Carlos Niebles | 1466 | |
108 | 15:20 | Progressive Teacher-Student Learning for Early Action Prediction | Xionghui Wang; Jian-Fang Hu; Jian-Huang Lai; Jianguo Zhang; Wei-Shi Zheng | 1584 | |
109 | 15:20 | Social Relation Recognition From Videos via Multi-Scale Spatial-Temporal Reasoning | Xinchen Liu; Wu Liu; Meng Zhang; Jingwen Chen; Lianli Gao; Chenggang Yan; Tao Mei | 1711 | |
110 | 15:20 | MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation | Yazan Abu Farha; Jürgen Gall | 1726 | |
111 | 15:20 | Transferable Interactiveness Knowledge for Human-Object Interaction Detection | Yong-Lu Li; Siyuan Zhou; Xijie Huang; Liang Xu; Ze Ma; Hao-Shu Fang; Yanfeng Wang; Cewu Lu | 1759 | |
112 | 15:20 | Actional-Structural Graph Convolutional Networks for Skeleton-Based Action Recognition | Maosen Li; Siheng Chen; Xu Chen; Ya Zhang; Yanfeng Wang; Qi Tian | 1911 | |
113 | 15:20 | Multi-Granularity Generator for Temporal Action Proposal | Yuan Liu; Lin Ma; Yifeng Zhang; Wei Liu; Shih-Fu Chang | 1995 | |
Motion & Biometrics | 114 | 15:20 | Deep Rigid Instance Scene Flow | Wei-Chiu Ma; Shenlong Wang; Rui Hu; Yuwen Xiong; Raquel Urtasun | 1421 |
115 | 15:20 | See More, Know More: Unsupervised Video Object Segmentation With Co-Attention Siamese Networks | Xiankai Lu; Wenguan Wang; Chao Ma; Jianbing Shen; Ling Shao; Fatih Porikli | 1570 | |
116 | 15:20 | Patch-Based Discriminative Feature Learning for Unsupervised Person Re-Identification | Qize Yang; Hong-Xing Yu; Ancong Wu; Wei-Shi Zheng | 1894 | |
117 | 15:20 | SPM-Tracker: Series-Parallel Matching for Real-Time Visual Object Tracking | Guangting Wang; Chong Luo; Zhiwei Xiong; Wenjun Zeng | 2104 | |
Synthesis | 118 | 15:20 | Shapes and Context: In-The-Wild Image Synthesis & Manipulation | Aayush Bansal; Yaser Sheikh; Deva Ramanan | 1426 |
119 | 15:20 | Semantics Disentangling for Text-To-Image Generation | Guojun Yin; Bin Liu; Lu Sheng; Nenghai Yu; Xiaogang Wang; Jing Shao | 462 | |
120 | 15:20 | Semantic Image Synthesis With Spatially-Adaptive Normalization | Taesung Park; Ming-Yu Liu; Ting-Chun Wang; Jun-Yan Zhu | 2072 | |
121 | 15:20 | Progressive Pose Attention Transfer for Person Image Generation | Zhen Zhu; Tengteng Huang; Baoguang Shi; Miao Yu; Bofei Wang; Xiang Bai | 609 | |
122 | 15:20 | Unsupervised Person Image Generation With Semantic Parsing Transformation | Sijie Song; Wei Zhang; Jiaying Liu; Tao Mei | 3269 | |
123 | 15:20 | DeepView: View Synthesis With Learned Gradient Descent | John Flynn; Michael Broxton; Paul Debevec; Matthew DuVall; Graham Fyffe; Ryan Overbeck; Noah Snavely; Richard Tucker | 2439 | |
124 | 15:20 | Animating Arbitrary Objects via Deep Motion Transfer | Aliaksandr Siarohin; Stéphane Lathuilière; Sergey Tulyakov; Elisa Ricci; Nicu Sebe | 4908 | |
125 | 15:20 | Textured Neural Avatars | Aliaksandra Shysheya; Egor Zakharov; Kara-Ali Aliev; Renat Bashirov; Egor Burkov; Karim Iskakov; Aleksei Ivakhnenko; Yury Malkov; Igor Pasechnik; Dmitry Ulyanov; Alexander Vakhitov; Victor Lempitsky | 5428 | |
126 | 15:20 | IM-Net for High Resolution Video Frame Interpolation | Tomer Peleg; Pablo Szekely; Doron Sabo; Omry Sendik | 3190 | |
127 | 15:20 | Homomorphic Latent Space Interpolation for Unpaired Image-To-Image Translation | Ying-Cong Chen; Xiaogang Xu; Zhuotao Tian; Jiaya Jia | 1240 | |
128 | 15:20 | Multi-Channel Attention Selection GAN With Cascaded Semantic Guidance for Cross-View Image Translation | Hao Tang; Dan Xu; Nicu Sebe; Yanzhi Wang; Jason J. Corso; Yan Yan | 3069 | |
129 | 15:20 | Geometry-Consistent Generative Adversarial Networks for One-Sided Unsupervised Domain Mapping | Huan Fu; Mingming Gong; Chaohui Wang; Kayhan Batmanghelich; Kun Zhang; Dacheng Tao | 4341 | |
130 | 15:20 | DeepVoxels: Learning Persistent 3D Feature Embeddings | Vincent Sitzmann; Justus Thies; Felix Heide; Matthias Nießner; Gordon Wetzstein; Michael Zollhöfer | 3521 | |
131 | 15:20 | Inverse Path Tracing for Joint Material and Lighting Estimation | Dejan Azinović; Tzu-Mao Li; Anton Kaplanyan; Matthias Nießner | 5944 | |
132 | 15:20 | The Visual Centrifuge: Model-Free Layered Video Representations | Jean-Baptiste Alayrac; João Carreira; Andrew Zisserman | 4057 | |
133 | 15:20 | Label-Noise Robust Generative Adversarial Networks | Takuhiro Kaneko; Yoshitaka Ushiku; Tatsuya Harada | 5720 | |
134 | 15:20 | DLOW: Domain Flow for Adaptation and Generalization | Rui Gong; Wen Li; Yuhua Chen; Luc Van Gool | 5766 | |
135 | 15:20 | CollaGAN: Collaborative GAN for Missing Image Data Imputation | Dongwook Lee; Junyoung Kim; Won-Jin Moon; Jong Chul Ye | 6970 | |
136 | 15:20 | Spatial Fusion GAN for Image Synthesis | Fangneng Zhan; Hongyuan Zhu; Shijian Lu | 194 | |
137 | 15:20 | Text Guided Person Image Synthesis | Xingran Zhou; Siyu Huang; Bin Li; Yingming Li; Jiachen Li; Zhongfei Zhang | 1347 | |
138 | 15:20 | STGAN: A Unified Selective Transfer Network for Arbitrary Image Attribute Editing | Ming Liu; Yukang Ding; Min Xia; Xiao Liu; Errui Ding; Wangmeng Zuo; Shilei Wen | 1439 | |
139 | 15:20 | Towards Instance-Level Image-To-Image Translation | Zhiqiang Shen; Mingyang Huang; Jianping Shi; Xiangyang Xue; Thomas S. Huang | 1453 | |
140 | 15:20 | Dense Intrinsic Appearance Flow for Human Pose Transfer | Yining Li; Chen Huang; Chen Change Loy | 1637 | |
141 | 15:20 | Depth-Aware Video Frame Interpolation | Wenbo Bao; Wei-Sheng Lai; Chao Ma; Xiaoyun Zhang; Zhiyong Gao; Ming-Hsuan Yang | 1769 | |
142 | 15:20 | Sliced Wasserstein Generative Models | Jiqing Wu; Zhiwu Huang; Dinesh Acharya; Wen Li; Janine Thoma; Danda Pani Paudel; Luc Van Gool | 1891 | |
143 | 15:20 | Deep Flow-Guided Video Inpainting | Rui Xu; Xiaoxiao Li; Bolei Zhou; Chen Change Loy | 1892 | |
144 | 15:20 | Video Generation From Single Semantic Label Map | Junting Pan; Chengyu Wang; Xu Jia; Jing Shao; Lu Sheng; Junjie Yan; Xiaogang Wang | 1959 | |
Computational Photography & Graphics | 145 | 15:20 | Polarimetric Camera Calibration Using an LCD Monitor | Zhixiang Wang; Yinqiang Zheng; Yung-Yu Chuang | 50 |
146 | 15:20 | Fully Automatic Video Colorization With Self-Regularization and Diversity | Chenyang Lei; Qifeng Chen | 1399 | |
147 | 15:20 | Zoom to Learn, Learn to Zoom | Xuaner Zhang; Qifeng Chen; Ren Ng; Vladlen Koltun | 1597 | |
148 | 15:20 | Single Image Reflection Removal Beyond Linearity | Qiang Wen; Yinjie Tan; Jing Qin; Wenxi Liu; Guoqiang Han; Shengfeng He | 1787 | |
149 | 15:20 | Learning to Separate Multiple Illuminants in a Single Image | Zhuo Hui; Ayan Chakrabarti; Kalyan Sunkavalli; Aswin C. Sankaranarayanan | 1931 | |
150 | 15:20 | Shape Unicode: A Unified Shape Representation | Sanjeev Muralikrishnan; Vladimir G. Kim; Matthew Fisher; Siddhartha Chaudhuri | 2050 | |
151 | 15:20 | Robust Video Stabilization by Optimization in CNN Weight Space | Jiyang Yu; Ravi Ramamoorthi | 2119 | |
Low-Level & Optimization | 152 | 15:20 | Learning Linear Transformations for Fast Image and Video Style Transfer | Xueting Li; Sifei Liu; Jan Kautz; Ming-Hsuan Yang | 99 |
153 | 15:20 | Local Detection of Stereo Occlusion Boundaries | Jialiang Wang; Todd Zickler | 1370 | |
154 | 15:20 | Bi-Directional Cascade Network for Perceptual Edge Detection | Jianzhong He; Shiliang Zhang; Ming Yang; Yanhu Shan; Tiejun Huang | 1532 | |
155 | 15:20 | Single Image Deraining: A Comprehensive Benchmark Analysis | Siyuan Li; Iago Breno Araujo; Wenqi Ren; Zhangyang Wang; Eric K. Tokuda; Roberto Hirata Junior; Roberto Cesar-Junior; Jiawan Zhang; Xiaojie Guo; Xiaochun Cao | 1554 | |
156 | 15:20 | Dynamic Scene Deblurring With Parameter Selective Sharing and Nested Skip Connections | Hongyun Gao; Xin Tao; Xiaoyong Shen; Jiaya Jia | 1581 | |
157 | 15:20 | Events-To-Video: Bringing Modern Computer Vision to Event Cameras | Henri Rebecq; René Ranftl; Vladlen Koltun; Davide Scaramuzza | 1595 | |
158 | 15:20 | Feedback Network for Image Super-Resolution | Zhen Li; Jinglei Yang; Zheng Liu; Xiaomin Yang; Gwanggil Jeon; Wei Wu | 1648 | |
159 | 15:20 | Semi-Supervised Transfer Learning for Image Rain Removal | Wei Wei; Deyu Meng; Qian Zhao; Zongben Xu; Ying Wu | 1671 | |
160 | 15:20 | EventNet: Asynchronous Recursive Event Processing | Yusuke Sekikawa; Kosuke Hara; Hideo Saito | 1710 | |
161 | 15:20 | Recurrent Back-Projection Network for Video Super-Resolution | Muhammad Haris; Gregory Shakhnarovich; Norimichi Ukita | 1927 | |
162 | 15:20 | Cascaded Partial Decoder for Fast and Accurate Salient Object Detection | Zhe Wu; Li Su; Qingming Huang | 1968 | |
163 | 15:20 | A Simple Pooling-Based Design for Real-Time Salient Object Detection | Jiang-Jiang Liu; Qibin Hou; Ming-Ming Cheng; Jiashi Feng; Jianmin Jiang | 2031 | |
164 | 15:20 | Contrast Prior and Fluid Pyramid Integration for RGBD Salient Object Detection | Jia-Xing Zhao; Yang Cao; Deng-Ping Fan; Ming-Ming Cheng; Xuan-Yi Li; Le Zhang | 2128 | |
165 | 15:20 | Progressive Image Deraining Networks: A Better and Simpler Baseline | Dongwei Ren; Wangmeng Zuo; Qinghua Hu; Pengfei Zhu; Deyu Meng | 2140 | |
Scenes & Representation | 166 | 15:20 | d-SNE: Domain Adaptation Using Stochastic Neighborhood Embedding | Xiang Xu; Xiong Zhou; Ragav Venkatesan; Gurumurthy Swaminathan; Orchid Majumder | 6592 |
167 | 15:20 | Taking a Closer Look at Domain Shift: Category-Level Adversaries for Semantics Consistent Domain Adaptation | Yawei Luo; Liang Zheng; Tao Guan; Junqing Yu; Yi Yang | 197 | |
168 | 15:20 | ADVENT: Adversarial Entropy Minimization for Domain Adaptation in Semantic Segmentation | Tuan-Hung Vu; Himalaya Jain; Maxime Bucher; Matthieu Cord; Patrick Pérez | 396 | |
169 | 15:20 | ContextDesc: Local Descriptor Augmentation With Cross-Modality Context | Zixin Luo; Tianwei Shen; Lei Zhou; Jiahui Zhang; Yao Yao; Shiwei Li; Tian Fang; Long Quan | 325 | |
170 | 15:20 | Large-Scale Long-Tailed Recognition in an Open World | Ziwei Liu; Zhongqi Miao; Xiaohang Zhan; Jiayun Wang; Boqing Gong; Stella X. Yu | 556 | |
171 | 15:20 | AET vs. AED: Unsupervised Representation Learning by Auto-Encoding Transformations Rather Than Data | Liheng Zhang; Guo-Jun Qi; Liqiang Wang; Jiebo Luo | 5137 | |
172 | 15:20 | SDC – Stacked Dilated Convolution: A Unified Descriptor Network for Dense Matching Tasks | René Schuster; Oliver Wasenmüller; Christian Unger; Didier Stricker | 576 | |
173 | 15:20 | Learning Correspondence From the Cycle-Consistency of Time | Xiaolong Wang; Allan Jabri; Alexei A. Efros | 2746 | |
174 | 15:20 | AE2-Nets: Autoencoder in Autoencoder Networks | Changqing Zhang; Yeqing Liu; Huazhu Fu | 2131 | |
175 | 15:20 | Mitigating Information Leakage in Image Representations: A Maximum Entropy Approach | Proteek Chandan Roy; Vishnu Naresh Boddeti | 1655 | |
176 | 15:20 | Learning Spatial Common Sense With Geometry-Aware Recurrent Networks | Hsiao-Yu Fish Tung; Ricson Cheng; Katerina Fragkiadaki | 3877 | |
177 | 15:20 | Structured Knowledge Distillation for Semantic Segmentation | Yifan Liu; Ke Chen; Chris Liu; Zengchang Qin; Zhenbo Luo; Jingdong Wang | 3147 | |
178 | 15:20 | Scan2CAD: Learning CAD Model Alignment in RGB-D Scans | Armen Avetisyan; Manuel Dahnert; Angela Dai; Manolis Savva; Angel X. Chang; Matthias Nießner | 977 | |
179 | 15:20 | Towards Scene Understanding: Unsupervised Monocular Depth Estimation With Semantic-Aware Representation | Po-Yi Chen; Alexander H. Liu; Yen-Cheng Liu; Yu-Chiang Frank Wang | 2799 | |
180 | 15:20 | Tell Me Where I Am: Object-Level Scene Context Prediction | Xiaotian Qiao; Quanlong Zheng; Ying Cao; Rynson W.H. Lau | 3107 | |
181 | 15:20 | Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation | He Wang; Srinath Sridhar; Jingwei Huang; Julien Valentin; Shuran Song; Leonidas J. Guibas | 1373 | |
182 | 15:20 | Supervised Fitting of Geometric Primitives to 3D Point Clouds | Lingxiao Li; Minhyuk Sung; Anastasia Dubrovina; Li Yi; Leonidas J. Guibas | 2452 | |
183 | 15:20 | Do Better ImageNet Models Transfer Better? | Simon Kornblith; Jonathon Shlens; Quoc V. Le | 4225 | |
184 | 15:20 | GSPN: Generative Shape Proposal Network for 3D Instance Segmentation in Point Cloud | Li Yi; Wang Zhao; He Wang; Minhyuk Sung; Leonidas J. Guibas | 1371 | |
185 | 15:20 | Attentive Relational Networks for Mapping Images to Scene Graphs | Mengshi Qi; Weijian Li; Zhengyuan Yang; Yunhong Wang; Jiebo Luo | 1423 | |
186 | 15:20 | Relational Knowledge Distillation | Wonpyo Park; Dongju Kim; Yan Lu; Minsu Cho | 1500 | |
187 | 15:20 | Compressing Convolutional Neural Networks via Factorized Convolutional Filters | Tuanhui Li; Baoyuan Wu; Yujiu Yang; Yanbo Fan; Yong Zhang; Wei Liu | 1557 | |
188 | 15:20 | On the Intrinsic Dimensionality of Image Representations | Sixue Gong; Vishnu Naresh Boddeti; Anil K. Jain | 1586 | |
189 | 15:20 | Part-Regularized Near-Duplicate Vehicle Re-Identification | Bing He; Jia Li; Yifan Zhao; Yonghong Tian | 1593 | |
190 | 15:20 | Self-Supervised Spatio-Temporal Representation Learning for Videos by Predicting Motion and Appearance Statistics | Jiangliu Wang; Jianbo Jiao; Linchao Bao; Shengfeng He; Yunhui Liu; Wei Liu | 1785 | |
191 | 15:20 | Classification-Reconstruction Learning for Open-Set Recognition | Ryota Yoshihashi; Wen Shao; Rei Kawakami; Shaodi You; Makoto Iida; Takeshi Naemura | 1789 | |
192 | 15:20 | Emotion-Aware Human Attention Prediction | Macario O. Cordel II; Shaojing Fan; Zhiqi Shen; Mohan S. Kankanhalli | 1867 | |
193 | 15:20 | Residual Regression With Semantic Prior for Crowd Counting | Jia Wan; Wenhan Luo; Baoyuan Wu; Antoni B. Chan; Wei Liu | 1875 | |
194 | 15:20 | Context-Reinforced Semantic Segmentation | Yizhou Zhou; Xiaoyan Sun; Zheng-Jun Zha; Wenjun Zeng | 1881 | |
195 | 15:20 | Adversarial Structure Matching for Structured Prediction Tasks | Jyh-Jing Hwang; Tsung-Wei Ke; Jianbo Shi; Stella X. Yu | 1929 | |
196 | 15:20 | Deep Spectral Clustering Using Dual Autoencoder Network | Xu Yang; Cheng Deng; Feng Zheng; Junchi Yan; Wei Liu | 1981 | |
197 | 15:20 | Deep Asymmetric Metric Learning via Rich Relationship Mining | Xinyi Xu; Yanhua Yang; Cheng Deng; Feng Zheng | 1990 | |
198 | 15:20 | Did It Change? Learning to Detect Point-Of-Interest Changes for Proactive Map Updates | Jérôme Revaud; Minhyeok Heo; Rafael S. Rezende; Chanmi You; Seong-Gyun Jeong | 2011 | |
199 | 15:20 | Associatively Segmenting Instances and Semantics in Point Clouds | Xinlong Wang; Shu Liu; Xiaoyong Shen; Chunhua Shen; Jiaya Jia | 2017 | |
200 | 15:20 | Pattern-Affinitive Propagation Across Depth, Surface Normal and Semantic Segmentation | Zhenyu Zhang; Zhen Cui; Chunyan Xu; Yan Yan; Nicu Sebe; Jian Yang | 2027 | |
201 | 15:20 | Scene Categorization From Contours: Medial Axis Based Salience Measures | Morteza Rezanejad; Gabriel Downs; John Wilder; Dirk B. Walther; Allan Jepson; Sven Dickinson; Kaleem Siddiqi | 2048 | |
Language & Reasoning | 202 | 15:20 | Unsupervised Image Captioning | Yang Feng; Lin Ma; Wei Liu; Jiebo Luo | 473 |
203 | 15:20 | Exact Adversarial Attack to Image Captioning via Structured Output Learning With Latent Variables | Yan Xu; Baoyuan Wu; Fumin Shen; Yanbo Fan; Yong Zhang; Heng Tao Shen; Wei Liu | 1560 | |
204 | 15:20 | Cross-Modal Relationship Inference for Grounding Referring Expressions | Sibei Yang; Guanbin Li; Yizhou Yu | 1735 | |
205 | 15:20 | What's to Know? Uncertainty as a Guide to Asking Goal-Oriented Questions | Ehsan Abbasnejad; Qi Wu; Qinfeng Shi; Anton van den Hengel | 1771 | |
206 | 15:20 | Iterative Alignment Network for Continuous Sign Language Recognition | Junfu Pu; Wengang Zhou; Houqiang Li | 1777 | |
207 | 15:20 | Neural Sequential Phrase Grounding (SeqGROUND) | Pelin Dogan; Leonid Sigal; Markus Gross | 1903 | |
208 | 15:20 | CLEVR-Ref+: Diagnosing Visual Reasoning With Referring Expressions | Runtao Liu; Chenxi Liu; Yutong Bai; Alan L. Yuille | 1938 | |
209 | 15:20 | Describing Like Humans: On Diversity in Image Captioning | Qingzhong Wang; Antoni B. Chan | 1964 | |
210 | 15:20 | MSCap: Multi-Style Image Captioning With Unpaired Stylized Text | Longteng Guo; Jing Liu; Peng Yao; Jiangwei Li; Hanqing Lu | 2078 | |
Applications, Medical, & Robotics | 211 | 15:20 | CRAVES: Controlling Robotic Arm With a Vision-Based Economic System | Yiming Zuo; Weichao Qiu; Lingxi Xie; Fangwei Zhong; Yizhou Wang; Alan L. Yuille | 1381 |
212 | 15:20 | Networks for Joint Affine and Non-Parametric Image Registration | Zhengyang Shen; Xu Han; Zhenlin Xu; Marc Niethammer | 1461 | |
213 | 15:20 | Learning Shape-Aware Embedding for Scene Text Detection | Zhuotao Tian; Michelle Shu; Pengyuan Lyu; Ruiyu Li; Chao Zhou; Xiaoyong Shen; Jiaya Jia | 1502 | |
214 | 15:20 | Learning to Film From Professional Human Motion Videos | Chong Huang; Chuan-En Lin; Zhenyu Yang; Yan Kong; Peng Chen; Xin Yang; Kwang-Ting Cheng | 1687 | |
215 | 15:20 | Pay Attention! - Robustifying a Deep Visuomotor Policy Through Task-Focused Visual Attention | Pooya Abolghasemi; Amir Mazaheri; Mubarak Shah; Ladislau Bölöni | 1920 | |
216 | 15:20 | Deep Blind Video Decaptioning by Temporal Aggregation and Recurrence | Dahun Kim; Sanghyun Woo; Joon-Young Lee; In So Kweon | 2089 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Deep Learning | 1 | 08:30 | Learning Video Representations From Correspondence Proposals | Xingyu Liu; Joon-Young Lee; Hailin Jin | 1213 |
2 | 08:35 | SiamRPN++: Evolution of Siamese Visual Tracking With Very Deep Networks | Bo Li; Wei Wu; Qiang Wang; Fangyi Zhang; Junliang Xing; Junjie Yan | 1503 | |
3 | 08:40 | Sphere Generative Adversarial Network Based on Geometric Moment Matching | Sung Woo Park; Junseok Kwon | 2556 | |
4 | 08:48 | Adversarial Attacks Beyond the Image Space | Xiaohui Zeng; Chenxi Liu; Yu-Siang Wang; Weichao Qiu; Lingxi Xie; Yu-Wing Tai; Chi-Keung Tang; Alan L. Yuille | 1431 | |
5 | 08:53 | Evading Defenses to Transferable Adversarial Examples by Translation-Invariant Attacks | Yinpeng Dong; Tianyu Pang; Hang Su; Jun Zhu | 5297 | |
6 | 08:58 | Decoupling Direction and Norm for Efficient Gradient-Based L2 Adversarial Attacks and Defenses | Jérôme Rony; Luiz G. Hafemann; Luiz S. Oliveira; Ismail Ben Ayed; Robert Sabourin; Eric Granger | 6129 | |
7 | 09:06 | A General and Adaptive Robust Loss Function | Jonathan T. Barron | 1472 | |
8 | 09:11 | Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration | Yang He; Ping Liu; Ziwei Wang; Zhilan Hu; Yi Yang | 2677 | |
9 | 09:16 | Learning to Quantize Deep Networks by Optimizing Quantization Intervals With Task Loss | Sangil Jung; Changyong Son; Seohyung Lee; Jinwoo Son; Jae-Joon Han; Youngjun Kwak; Sung Ju Hwang; Changkyu Choi | 4595 | |
10 | 09:24 | Not All Areas Are Equal: Transfer Learning for Semantic Segmentation via Hierarchical Region Selection | Ruoqi Sun; Xinge Zhu; Chongruo Wu; Chen Huang; Jianping Shi; Lizhuang Ma | 1773 | |
11 | 09:29 | Unsupervised Learning of Dense Shape Correspondence | Oshri Halimi; Or Litany; Emanuele Rodolà; Alex M. Bronstein; Ron Kimmel | 3740 | |
12 | 09:34 | Unsupervised Visual Domain Adaptation: A Deep Max-Margin Gaussian Process Approach | Minyoung Kim; Pritish Sahu; Behnam Gholami; Vladimir Pavlovic | 4645 | |
13 | 09:42 | Balanced Self-Paced Learning for Generative Adversarial Clustering Network | Kamran Ghasedi; Xiaoqian Wang; Cheng Deng; Heng Huang | 2726 | |
14 | 09:47 | A Style-Based Generator Architecture for Generative Adversarial Networks | Tero Karras; Samuli Laine; Timo Aila | 2860 | |
15 | 09:52 | Parallel Optimal Transport GAN | Gil Avraham; Yan Zuo; Tom Drummond | 5426 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
3D Single View & RGBD | 106 | 08:30 | 3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans | Ji Hou; Angela Dai; Matthias Nießner | 1170 |
107 | 08:35 | Causes and Corrections for Bimodal Multi-Path Scanning With Structured Light | Yu Zhang; Daniel L. Lau; Ying Yu | 4854 | |
108 | 08:40 | TextureNet: Consistent Local Parametrizations for Learning From High-Resolution Signals on Meshes | Jingwei Huang; Haotian Zhang; Li Yi; Thomas Funkhouser; Matthias Nießner; Leonidas J. Guibas | 7048 | |
109 | 08:48 | PlaneRCNN: 3D Plane Detection and Reconstruction From a Single Image | Chen Liu; Kihwan Kim; Jinwei Gu; Yasutaka Furukawa; Jan Kautz | 704 | |
110 | 08:53 | Occupancy Networks: Learning 3D Reconstruction in Function Space | Lars Mescheder; Michael Oechsle; Michael Niemeyer; Sebastian Nowozin; Andreas Geiger | 3976 | |
111 | 08:58 | 3D Shape Reconstruction From Images in the Frequency Domain | Weichao Shen; Yunde Jia; Yuwei Wu | 2575 | |
112 | 09:06 | SiCloPe: Silhouette-Based Clothed People | Ryota Natsume; Shunsuke Saito; Zeng Huang; Weikai Chen; Chongyang Ma; Hao Li; Shigeo Morishima | 1456 | |
113 | 09:11 | Detailed Human Shape Estimation From a Single Image by Hierarchical Mesh Deformation | Hao Zhu; Xinxin Zuo; Sen Wang; Xun Cao; Ruigang Yang | 3102 | |
114 | 09:16 | Convolutional Mesh Regression for Single-Image Human Shape Reconstruction | Nikos Kolotouros; Georgios Pavlakos; Kostas Daniilidis | 4841 | |
115 | 09:24 | H+O: Unified Egocentric Recognition of 3D Hand-Object Poses and Interactions | Bugra Tekin; Federica Bogo; Marc Pollefeys | 2754 | |
116 | 09:29 | Learning the Depths of Moving People by Watching Frozen People | Zhengqi Li; Tali Dekel; Forrester Cole; Richard Tucker; Noah Snavely; Ce Liu; William T. Freeman | 3419 | |
117 | 09:34 | Extreme Relative Pose Estimation for RGB-D Scans via Scene Completion | Zhenpei Yang; Jeffrey Z. Pan; Linjie Luo; Xiaowei Zhou; Kristen Grauman; Qixing Huang | 3439 | |
118 | 09:42 | A Skeleton-Bridged Deep Learning Approach for Generating Meshes of Complex Topologies From Single RGB Images | Jiapeng Tang; Xiaoguang Han; Junyi Pan; Kui Jia; Xin Tong | 1943 | |
119 | 09:47 | Learning Structure-And-Motion-Aware Rolling Shutter Correction | Bingbing Zhuang; Quoc-Huy Tran; Pan Ji; Loong-Fah Cheong; Manmohan Chandraker | 3451 | |
120 | 09:52 | PVNet: Pixel-Wise Voting Network for 6DoF Pose Estimation | Sida Peng; Yuan Liu; Qixing Huang; Xiaowei Zhou; Hujun Bao | 3871 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Motion & Biometrics | 135 | 08:30 | SelFlow: Self-Supervised Learning of Optical Flow | Pengpeng Liu; Michael Lyu; Irwin King; Jia Xu | 236 |
136 | 08:35 | Taking a Deeper Look at the Inverse Compositional Algorithm | Zhaoyang Lv; Frank Dellaert; James M. Rehg; Andreas Geiger | 3963 | |
137 | 08:40 | Deeper and Wider Siamese Networks for Real-Time Visual Tracking | Zhipeng Zhang; Houwen Peng | 1197 | |
138 | 08:48 | Self-Supervised Adaptation of High-Fidelity Face Models for Monocular Performance Tracking | Jae Shin Yoon; Takaaki Shiratori; Shoou-I Yu; Hyun Soo Park | 952 | |
139 | 08:53 | Diverse Generation for Multi-Agent Sports Games | Raymond A. Yeh; Alexander G. Schwing; Jonathan Huang; Kevin Murphy | 2738 | |
140 | 08:58 | Efficient Online Multi-Person 2D Pose Tracking With Recurrent Spatio-Temporal Affinity Fields | Yaadhav Raaj; Haroon Idrees; Gines Hidalgo; Yaser Sheikh | 3444 | |
141 | 09:06 | GFrames: Gradient-Based Local Reference Frame for 3D Shape Matching | Simone Melzi; Riccardo Spezialetti; Federico Tombari; Michael M. Bronstein; Luigi Di Stefano; Emanuele Rodolà | 1391 | |
142 | 09:11 | Eliminating Exposure Bias and Metric Mismatch in Multiple Object Tracking | Andrii Maksai; Pascal Fua | 6191 | |
143 | 09:16 | Graph Convolutional Tracking | Junyu Gao; Tianzhu Zhang; Changsheng Xu | 3119 | |
144 | 09:24 | ATOM: Accurate Tracking by Overlap Maximization | Martin Danelljan; Goutam Bhat; Fahad Shahbaz Khan; Michael Felsberg | 4984 | |
145 | 09:29 | Visual Tracking via Adaptive Spatially-Regularized Correlation Filters | Kenan Dai; Dong Wang; Huchuan Lu; Chong Sun; Jianhua Li | 1202 | |
146 | 09:34 | Deep Tree Learning for Zero-Shot Face Anti-Spoofing | Yaojie Liu; Joel Stehouwer; Amin Jourabloo; Xiaoming Liu | 496 | |
147 | 09:42 | ArcFace: Additive Angular Margin Loss for Deep Face Recognition | Jiankang Deng; Jia Guo; Niannan Xue; Stefanos Zafeiriou | 1140 | |
148 | 09:47 | Learning Joint Gait Representation via Quintuplet Loss Minimization | Kaihao Zhang; Wenhan Luo; Lin Ma; Wei Liu; Hongdong Li | 1617 | |
149 | 09:52 | Gait Recognition via Disentangled Representation Learning | Ziyuan Zhang; Luan Tran; Xi Yin; Yousef Atoum; Xiaoming Liu; Jian Wan; Nanxin Wang | 4898 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Deep Learning | 1 | 10:00 | Learning Video Representations From Correspondence Proposals | Xingyu Liu; Joon-Young Lee; Hailin Jin | 1213 |
2 | 10:00 | SiamRPN++: Evolution of Siamese Visual Tracking With Very Deep Networks | Bo Li; Wei Wu; Qiang Wang; Fangyi Zhang; Junliang Xing; Junjie Yan | 1503 | |
3 | 10:00 | Sphere Generative Adversarial Network Based on Geometric Moment Matching | Sung Woo Park; Junseok Kwon | 2556 | |
4 | 10:00 | Adversarial Attacks Beyond the Image Space | Xiaohui Zeng; Chenxi Liu; Yu-Siang Wang; Weichao Qiu; Lingxi Xie; Yu-Wing Tai; Chi-Keung Tang; Alan L. Yuille | 1431 | |
5 | 10:00 | Evading Defenses to Transferable Adversarial Examples by Translation-Invariant Attacks | Yinpeng Dong; Tianyu Pang; Hang Su; Jun Zhu | 5297 | |
6 | 10:00 | Decoupling Direction and Norm for Efficient Gradient-Based L2 Adversarial Attacks and Defenses | Jérôme Rony; Luiz G. Hafemann; Luiz S. Oliveira; Ismail Ben Ayed; Robert Sabourin; Eric Granger | 6129 | |
7 | 10:00 | A General and Adaptive Robust Loss Function | Jonathan T. Barron | 1472 | |
8 | 10:00 | Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration | Yang He; Ping Liu; Ziwei Wang; Zhilan Hu; Yi Yang | 2677 | |
9 | 10:00 | Learning to Quantize Deep Networks by Optimizing Quantization Intervals With Task Loss | Sangil Jung; Changyong Son; Seohyung Lee; Jinwoo Son; Jae-Joon Han; Youngjun Kwak; Sung Ju Hwang; Changkyu Choi | 4595 | |
10 | 10:00 | Not All Areas Are Equal: Transfer Learning for Semantic Segmentation via Hierarchical Region Selection | Ruoqi Sun; Xinge Zhu; Chongruo Wu; Chen Huang; Jianping Shi; Lizhuang Ma | 1773 | |
11 | 10:00 | Unsupervised Learning of Dense Shape Correspondence | Oshri Halimi; Or Litany; Emanuele Rodolà; Alex M. Bronstein; Ron Kimmel | 3740 | |
12 | 10:00 | Unsupervised Visual Domain Adaptation: A Deep Max-Margin Gaussian Process Approach | Minyoung Kim; Pritish Sahu; Behnam Gholami; Vladimir Pavlovic | 4645 | |
13 | 10:00 | Balanced Self-Paced Learning for Generative Adversarial Clustering Network | Kamran Ghasedi; Xiaoqian Wang; Cheng Deng; Heng Huang | 2726 | |
14 | 10:00 | A Style-Based Generator Architecture for Generative Adversarial Networks | Tero Karras; Samuli Laine; Timo Aila | 2860 | |
15 | 10:00 | Parallel Optimal Transport GAN | Gil Avraham; Yan Zuo; Tom Drummond | 5426 | |
16 | 10:00 | Reversible GANs for Memory-Efficient Image-To-Image Translation | Tycho F.A. van der Ouderaa; Daniel E. Worrall | 2292 | |
17 | 10:00 | Sensitive-Sample Fingerprinting of Deep Neural Networks | Zecheng He; Tianwei Zhang; Ruby Lee | 2306 | |
18 | 10:00 | Soft Labels for Ordinal Regression | Raúl Díaz; Amit Marathe | 2320 | |
19 | 10:00 | Local to Global Learning: Gradually Adding Classes for Training Deep Neural Networks | Hao Cheng; Dongze Lian; Bowen Deng; Shenghua Gao; Tao Tan; Yanlin Geng | 2377 | |
20 | 10:00 | What Does It Mean to Learn in Deep Networks? And, How Does One Detect Adversarial Attacks? | Ciprian A. Corneanu; Meysam Madadi; Sergio Escalera; Aleix M. Martinez | 2447 | |
21 | 10:00 | Handwriting Recognition in Low-Resource Scripts Using Adversarial Learning | Ayan Kumar Bhunia; Abhirup Das; Ankan Kumar Bhunia; Perla Sai Raj Kishore; Partha Pratim Roy | 2459 | |
22 | 10:00 | Adversarial Defense Through Network Profiling Based Path Extraction | Yuxian Qiu; Jingwen Leng; Cong Guo; Quan Chen; Chao Li; Minyi Guo; Yuhao Zhu | 2466 | |
23 | 10:00 | RENAS: Reinforced Evolutionary Neural Architecture Search | Yukang Chen; Gaofeng Meng; Qian Zhang; Shiming Xiang; Chang Huang; Lisen Mu; Xinggang Wang | 2494 | |
24 | 10:00 | Co-Occurrence Neural Network | Irina Shevlev; Shai Avidan | 2537 | |
25 | 10:00 | SpotTune: Transfer Learning Through Adaptive Fine-Tuning | Yunhui Guo; Honghui Shi; Abhishek Kumar; Kristen Grauman; Tajana Rosing; Rogerio Feris | 2557 | |
26 | 10:00 | Signal-To-Noise Ratio: A Robust Distance Metric for Deep Metric Learning | Tongtong Yuan; Weihong Deng; Jian Tang; Yinan Tang; Binghui Chen | 2562 | |
27 | 10:00 | Detection Based Defense Against Adversarial Examples From the Steganalysis Point of View | Jiayang Liu; Weiming Zhang; Yiwei Zhang; Dongdong Hou; Yujia Liu; Hongyue Zha; Nenghai Yu | 2888 | |
28 | 10:00 | HetConv: Heterogeneous Kernel-Based Convolutions for Deep CNNs | Pravendra Singh; Vinay Kumar Verma; Piyush Rai; Vinay P. Namboodiri | 2927 | |
29 | 10:00 | Strike (With) a Pose: Neural Networks Are Easily Fooled by Strange Poses of Familiar Objects | Michael A. Alcorn; Qi Li; Zhitao Gong; Chengfei Wang; Long Mai; Wei-Shinn Ku; Anh Nguyen | 2951 | |
30 | 10:00 | Blind Geometric Distortion Correction on Images Through Deep Learning | Xiaoyu Li; Bo Zhang; Pedro V. Sander; Jing Liao | 2996 | |
31 | 10:00 | Instance-Level Meta Normalization | Songhao Jia; Ding-Jie Chen; Hwann-Tzong Chen | 3013 | |
32 | 10:00 | Iterative Normalization: Beyond Standardization Towards Efficient Whitening | Lei Huang; Yi Zhou; Fan Zhu; Li Liu; Ling Shao | 3025 | |
33 | 10:00 | On Learning Density Aware Embeddings | Soumyadeep Ghosh; Richa Singh; Mayank Vatsa | 3042 | |
34 | 10:00 | Contrastive Adaptation Network for Unsupervised Domain Adaptation | Guoliang Kang; Lu Jiang; Yi Yang; Alexander G. Hauptmann | 3083 | |
35 | 10:00 | LP-3DCNN: Unveiling Local Phase in 3D Convolutional Neural Networks | Sudhakar Kumawat; Shanmuganathan Raman | 3110 | |
36 | 10:00 | Attribute-Driven Feature Disentangling and Temporal Aggregation for Video Person Re-Identification | Yiru Zhao; Xu Shen; Zhongming Jin; Hongtao Lu; Xian-sheng Hua | 3230 | |
37 | 10:00 | Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit? | Shilin Zhu; Xin Dong; Hao Su | 3255 | |
38 | 10:00 | Distilling Object Detectors With Fine-Grained Feature Imitation | Tao Wang; Li Yuan; Xiaopeng Zhang; Jiashi Feng | 3287 | |
39 | 10:00 | Centripetal SGD for Pruning Very Deep Convolutional Networks With Complicated Structure | Xiaohan Ding; Guiguang Ding; Yuchen Guo; Jungong Han | 3323 | |
40 | 10:00 | Knockoff Nets: Stealing Functionality of Black-Box Models | Tribhuvanesh Orekondy; Bernt Schiele; Mario Fritz | 3324 | |
Recognition | 41 | 10:00 | Deep Embedding Learning With Discriminative Sampling Policy | Yueqi Duan; Lei Chen; Jiwen Lu; Jie Zhou | 392 |
42 | 10:00 | Hybrid Task Cascade for Instance Segmentation | Kai Chen; Jiangmiao Pang; Jiaqi Wang; Yu Xiong; Xiaoxiao Li; Shuyang Sun; Wansen Feng; Ziwei Liu; Jianping Shi; Wanli Ouyang; Chen Change Loy; Dahua Lin | 1714 | |
43 | 10:00 | Multi-Task Self-Supervised Object Detection via Recycling of Bounding Box Annotations | Wonhee Lee; Joonil Na; Gunhee Kim | 2219 | |
44 | 10:00 | ClusterNet: Deep Hierarchical Cluster Network With Rigorously Rotation-Invariant Representation for Point Cloud Analysis | Chao Chen; Guanbin Li; Ruijia Xu; Tianshui Chen; Meng Wang; Liang Lin | 2293 | |
45 | 10:00 | Learning to Learn Relation for Important People Detection in Still Images | Wei-Hong Li; Fa-Ting Hong; Wei-Shi Zheng | 2305 | |
46 | 10:00 | Looking for the Devil in the Details: Learning Trilinear Attention Sampling Network for Fine-Grained Image Recognition | Heliang Zheng; Jianlong Fu; Zheng-Jun Zha; Jiebo Luo | 2330 | |
47 | 10:00 | Multi-Similarity Loss With General Pair Weighting for Deep Metric Learning | Xun Wang; Xintong Han; Weilin Huang; Dengke Dong; Matthew R. Scott | 2333 | |
48 | 10:00 | Domain-Symmetric Networks for Adversarial Domain Adaptation | Yabin Zhang; Hui Tang; Kui Jia; Mingkui Tan | 2454 | |
49 | 10:00 | End-To-End Supervised Product Quantization for Image Search and Retrieval | Benjamin Klein; Lior Wolf | 2506 | |
50 | 10:00 | Learning to Learn From Noisy Labeled Data | Junnan Li; Yongkang Wong; Qi Zhao; Mohan S. Kankanhalli | 2512 | |
51 | 10:00 | DSFD: Dual Shot Face Detector | Jian Li; Yabiao Wang; Changan Wang; Ying Tai; Jianjun Qian; Jian Yang; Chengjie Wang; Jilin Li; Feiyue Huang | 2527 | |
52 | 10:00 | Label Propagation for Deep Semi-Supervised Learning | Ahmet Iscen; Giorgos Tolias; Yannis Avrithis; Ondřej Chum | 2572 | |
53 | 10:00 | Deep Global Generalized Gaussian Networks | Qilong Wang; Peihua Li; Qinghua Hu; Pengfei Zhu; Wangmeng Zuo | 2649 | |
54 | 10:00 | Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-Based Image Retrieval | Anjan Dutta; Zeynep Akata | 2659 | |
55 | 10:00 | Context-Aware Crowd Counting | Weizhe Liu; Mathieu Salzmann; Pascal Fua | 2674 | |
56 | 10:00 | Detect-To-Retrieve: Efficient Regional Aggregation for Image Search | Marvin Teichmann; André Araujo; Menglong Zhu; Jack Sim | 2758 | |
57 | 10:00 | Towards Accurate One-Stage Object Detection With AP-Loss | Kean Chen; Jianguo Li; Weiyao Lin; John See; Ji Wang; Lingyu Duan; Zhibo Chen; Changwei He; Junni Zou | 2818 | |
58 | 10:00 | On Exploring Undetermined Relationships for Visual Relationship Detection | Yibing Zhan; Jun Yu; Ting Yu; Dacheng Tao | 2856 | |
59 | 10:00 | Learning Without Memorizing | Prithviraj Dhar; Rajat Vikram Singh; Kuan-Chuan Peng; Ziyan Wu; Rama Chellappa | 2905 | |
60 | 10:00 | Dynamic Recursive Neural Network | Qiushan Guo; Zhipeng Yu; Yichao Wu; Ding Liang; Haoyu Qin; Junjie Yan | 2980 | |
61 | 10:00 | Destruction and Construction Learning for Fine-Grained Image Recognition | Yue Chen; Yalong Bai; Wei Zhang; Tao Mei | 2992 | |
62 | 10:00 | Distraction-Aware Shadow Detection | Quanlong Zheng; Xiaotian Qiao; Ying Cao; Rynson W.H. Lau | 3109 | |
63 | 10:00 | Multi-Label Image Recognition With Graph Convolutional Networks | Zhao-Min Chen; Xiu-Shen Wei; Peng Wang; Yanwen Guo | 3140 | |
64 | 10:00 | High-Level Semantic Feature Detection: A New Perspective for Pedestrian Detection | Wei Liu; Shengcai Liao; Weiqiang Ren; Weidong Hu; Yinan Yu | 3171 | |
65 | 10:00 | RepMet: Representative-Based Metric Learning for Classification and Few-Shot Object Detection | Leonid Karlinsky; Joseph Shtok; Sivan Harary; Eli Schwartz; Amit Aides; Rogerio Feris; Raja Giryes; Alex M. Bronstein | 3199 | |
66 | 10:00 | Ranked List Loss for Deep Metric Learning | Xinshao Wang; Yang Hua; Elyor Kodirov; Guosheng Hu; Romain Garnier; Neil M. Robertson | 3211 | |
67 | 10:00 | CANet: Class-Agnostic Segmentation Networks With Iterative Refinement and Attentive Few-Shot Learning | Chi Zhang; Guosheng Lin; Fayao Liu; Rui Yao; Chunhua Shen | 3315 | |
68 | 10:00 | Precise Detection in Densely Packed Scenes | Eran Goldman; Roei Herzig; Aviv Eisenschtat; Jacob Goldberger; Tal Hassner | 5953 | |
Segmentation, Grouping, & Shape | 69 | 10:00 | KE-GAN: Knowledge Embedded Generative Adversarial Networks for Semi-Supervised Scene Parsing | Mengshi Qi; Yunhong Wang; Jie Qin; Annan Li | 2188 |
70 | 10:00 | Fast User-Guided Video Object Segmentation by Interaction-And-Propagation Networks | Seoung Wug Oh; Joon-Young Lee; Ning Xu; Seon Joo Kim | 2338 | |
71 | 10:00 | Fast Interactive Object Annotation With Curve-GCN | Huan Ling; Jun Gao; Amlan Kar; Wenzheng Chen; Sanja Fidler | 2589 | |
72 | 10:00 | FickleNet: Weakly and Semi-Supervised Semantic Image Segmentation Using Stochastic Inference | Jungbeom Lee; Eunji Kim; Sungmin Lee; Jangho Lee; Sungroh Yoon | 2658 | |
73 | 10:00 | RVOS: End-To-End Recurrent Network for Video Object Segmentation | Carles Ventura; Miriam Bellver; Andreu Girbau; Amaia Salvador; Ferran Marques; Xavier Giro-i-Nieto | 2722 | |
74 | 10:00 | DeepFlux for Skeletons in the Wild | Yukang Wang; Yongchao Xu; Stavros Tsogkas; Xiang Bai; Sven Dickinson; Kaleem Siddiqi | 2878 | |
75 | 10:00 | Interactive Image Segmentation via Backpropagating Refinement Scheme | Won-Dong Jang; Chang-Su Kim | 3244 | |
76 | 10:00 | Scene Parsing via Integrated Classification Model and Variance-Based Regularization | Hengcan Shi; Hongliang Li; Qingbo Wu; Zichen Song | 3253 | |
Statistics, Physics, Theory, & Datasets | 77 | 10:00 | RAVEN: A Dataset for Relational and Analogical Visual REasoNing | Chi Zhang; Feng Gao; Baoxiong Jia; Yixin Zhu; Song-Chun Zhu | 2208 |
78 | 10:00 | Surface Reconstruction From Normals: A Robust DGP-Based Discontinuity Preservation Approach | Wuyuan Xie; Miaohui Wang; Mingqiang Wei; Jianmin Jiang; Jing Qin | 2212 | |
79 | 10:00 | DeepFashion2: A Versatile Benchmark for Detection, Pose Estimation, Segmentation and Re-Identification of Clothing Images | Yuying Ge; Ruimao Zhang; Xiaogang Wang; Xiaoou Tang; Ping Luo | 2273 | |
80 | 10:00 | Jumping Manifolds: Geometry Aware Dense Non-Rigid Structure From Motion | Suryansh Kumar | 2278 | |
81 | 10:00 | LVIS: A Dataset for Large Vocabulary Instance Segmentation | Agrim Gupta; Piotr Dollár; Ross Girshick | 2328 | |
82 | 10:00 | Fast Object Class Labelling via Speech | Michael Gygli; Vittorio Ferrari | 2392 | |
83 | 10:00 | LaSOT: A High-Quality Benchmark for Large-Scale Single Object Tracking | Heng Fan; Liting Lin; Fan Yang; Peng Chu; Ge Deng; Sijia Yu; Hexin Bai; Yong Xu; Chunyuan Liao; Haibin Ling | 2421 | |
84 | 10:00 | Creative Flow+ Dataset | Maria Shugrina; Ziheng Liang; Amlan Kar; Jiaman Li; Angad Singh; Karan Singh; Sanja Fidler | 2596 | |
85 | 10:00 | Weakly Supervised Open-Set Domain Adaptation by Dual-Domain Collaboration | Shuhan Tan; Jiening Jiao; Wei-Shi Zheng | 2693 | |
86 | 10:00 | A Neurobiological Evaluation Metric for Neural Network Model Search | Nathaniel Blanchard; Jeffery Kinnison; Brandon RichardWebster; Pouya Bashivan; Walter J. Scheirer | 2729 | |
87 | 10:00 | Iterative Projection and Matching: Finding Structure-Preserving Representatives and Its Application to Computer Vision | Alireza Zaeemzadeh; Mohsen Joneidi; Nazanin Rahnavard; Mubarak Shah | 2944 | |
88 | 10:00 | Efficient Multi-Domain Learning by Covariance Normalization | Yunsheng Li; Nuno Vasconcelos | 2971 | |
89 | 10:00 | Predicting Visible Image Differences Under Varying Display Brightness and Viewing Distance | Nanyang Ye; Krzysztof Wolski; Rafał K. Mantiuk | 3056 | |
90 | 10:00 | A Bayesian Perspective on the Deep Image Prior | Zezhou Cheng; Matheus Gadelha; Subhransu Maji; Daniel Sheldon | 3060 | |
91 | 10:00 | ApolloCar3D: A Large 3D Car Instance Understanding Benchmark for Autonomous Driving | Xibin Song; Peng Wang; Dingfu Zhou; Rui Zhu; Chenye Guan; Yuchao Dai; Hao Su; Hongdong Li; Ruigang Yang | 3193 | |
92 | 10:00 | Compressing Unknown Images With Product Quantizer for Efficient Zero-Shot Classification | Jin Li; Xuguang Lan; Yang Liu; Le Wang; Nanning Zheng | 3223 | |
93 | 10:00 | Self-Supervised Convolutional Subspace Clustering Network | Junjian Zhang; Chun-Guang Li; Chong You; Xianbiao Qi; Honggang Zhang; Jun Guo; Zhouchen Lin | 3260 | |
3D Multiview | 94 | 10:00 | Multi-Scale Geometric Consistency Guided Multi-View Stereo | Qingshan Xu; Wenbing Tao | 2374 |
95 | 10:00 | Privacy Preserving Image-Based Localization | Pablo Speciale; Johannes L. Schönberger; Sing Bing Kang; Sudipta N. Sinha; Marc Pollefeys | 2426 | |
96 | 10:00 | SimulCap : Single-View Human Performance Capture With Cloth Simulation | Tao Yu; Zerong Zheng; Yuan Zhong; Jianhui Zhao; Qionghai Dai; Gerard Pons-Moll; Yebin Liu | 2455 | |
97 | 10:00 | Hierarchical Deep Stereo Matching on High-Resolution Images | Gengshan Yang; Joshua Manela; Michael Happold; Deva Ramanan | 2471 | |
98 | 10:00 | Recurrent MVSNet for High-Resolution Multi-View Stereo Depth Inference | Yao Yao; Zixin Luo; Shiwei Li; Tianwei Shen; Tian Fang; Long Quan | 2491 | |
99 | 10:00 | Synthesizing 3D Shapes From Silhouette Image Collections Using Multi-Projection Generative Adversarial Networks | Xiao Li; Yue Dong; Pieter Peers; Xin Tong | 2503 | |
100 | 10:00 | The Perfect Match: 3D Point Cloud Matching With Smoothed Densities | Zan Gojcic; Caifa Zhou; Jan D. Wegner; Andreas Wieser | 2555 | |
101 | 10:00 | Recurrent Neural Network for (Un-)Supervised Learning of Monocular Video Visual Odometry and Depth | Rui Wang; Stephen M. Pizer; Jan-Michael Frahm | 2635 | |
102 | 10:00 | PointWeb: Enhancing Local Neighborhood Features for Point Cloud Processing | Hengshuang Zhao; Li Jiang; Chi-Wing Fu; Jiaya Jia | 2931 | |
103 | 10:00 | Scan2Mesh: From Unstructured Range Scans to 3D Meshes | Angela Dai; Matthias Nießner | 3061 | |
104 | 10:00 | Unsupervised Domain Adaptation for ToF Data Denoising With Adversarial Learning | Gianluca Agresti; Henrik Schaefer; Piergiorgio Sartor; Pietro Zanuttigh | 3179 | |
105 | 10:00 | Learning Independent Object Motion From Unlabelled Stereoscopic Videos | Zhe Cao; Abhishek Kar; Christian Häne; Jitendra Malik | 3257 | |
3D Single View & RGBD | 106 | 10:00 | 3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans | Ji Hou; Angela Dai; Matthias Nießner | 1170 |
107 | 10:00 | Causes and Corrections for Bimodal Multi-Path Scanning With Structured Light | Yu Zhang; Daniel L. Lau; Ying Yu | 4854 | |
108 | 10:00 | TextureNet: Consistent Local Parametrizations for Learning From High-Resolution Signals on Meshes | Jingwei Huang; Haotian Zhang; Li Yi; Thomas Funkhouser; Matthias Nießner; Leonidas J. Guibas | 7048 | |
109 | 10:00 | PlaneRCNN: 3D Plane Detection and Reconstruction From a Single Image | Chen Liu; Kihwan Kim; Jinwei Gu; Yasutaka Furukawa; Jan Kautz | 704 | |
110 | 10:00 | Occupancy Networks: Learning 3D Reconstruction in Function Space | Lars Mescheder; Michael Oechsle; Michael Niemeyer; Sebastian Nowozin; Andreas Geiger | 3976 | |
111 | 10:00 | 3D Shape Reconstruction From Images in the Frequency Domain | Weichao Shen; Yunde Jia; Yuwei Wu | 2575 | |
112 | 10:00 | SiCloPe: Silhouette-Based Clothed People | Ryota Natsume; Shunsuke Saito; Zeng Huang; Weikai Chen; Chongyang Ma; Hao Li; Shigeo Morishima | 1456 | |
113 | 10:00 | Detailed Human Shape Estimation From a Single Image by Hierarchical Mesh Deformation | Hao Zhu; Xinxin Zuo; Sen Wang; Xun Cao; Ruigang Yang | 3102 | |
114 | 10:00 | Convolutional Mesh Regression for Single-Image Human Shape Reconstruction | Nikos Kolotouros; Georgios Pavlakos; Kostas Daniilidis | 4841 | |
115 | 10:00 | H+O: Unified Egocentric Recognition of 3D Hand-Object Poses and Interactions | Bugra Tekin; Federica Bogo; Marc Pollefeys | 2754 | |
116 | 10:00 | Learning the Depths of Moving People by Watching Frozen People | Zhengqi Li; Tali Dekel; Forrester Cole; Richard Tucker; Noah Snavely; Ce Liu; William T. Freeman | 3419 | |
117 | 10:00 | Extreme Relative Pose Estimation for RGB-D Scans via Scene Completion | Zhenpei Yang; Jeffrey Z. Pan; Linjie Luo; Xiaowei Zhou; Kristen Grauman; Qixing Huang | 3439 | |
118 | 10:00 | A Skeleton-Bridged Deep Learning Approach for Generating Meshes of Complex Topologies From Single RGB Images | Jiapeng Tang; Xiaoguang Han; Junyi Pan; Kui Jia; Xin Tong | 1943 | |
119 | 10:00 | Learning Structure-And-Motion-Aware Rolling Shutter Correction | Bingbing Zhuang; Quoc-Huy Tran; Pan Ji; Loong-Fah Cheong; Manmohan Chandraker | 3451 | |
120 | 10:00 | PVNet: Pixel-Wise Voting Network for 6DoF Pose Estimation | Sida Peng; Yuan Liu; Qixing Huang; Xiaowei Zhou; Hujun Bao | 3871 | |
121 | 10:00 | Learning Single-Image Depth From Videos Using Quality Assessment Networks | Weifeng Chen; Shengyi Qian; Jia Deng | 2423 | |
122 | 10:00 | Learning 3D Human Dynamics From Video | Angjoo Kanazawa; Jason Y. Zhang; Panna Felsen; Jitendra Malik | 2460 | |
123 | 10:00 | Lending Orientation to Neural Networks for Cross-View Geo-Localization | Liu Liu; Hongdong Li | 2993 | |
124 | 10:00 | Visual Localization by Learning Objects-Of-Interest Dense Match Regression | Philippe Weinzaepfel; Gabriela Csurka; Yohann Cabon; Martin Humenberger | 3742 | |
125 | 10:00 | Bilateral Cyclic Constraint and Adaptive Regularization for Unsupervised Monocular Depth Prediction | Alex Wong; Stefano Soatto | 3091 | |
Face & Body | 126 | 10:00 | Face Parsing With RoI Tanh-Warping | Jinpeng Lin; Hao Yang; Dong Chen; Ming Zeng; Fang Wen; Lu Yuan | 2207 |
127 | 10:00 | Multi-Person Articulated Tracking With Spatial and Temporal Embeddings | Sheng Jin; Wentao Liu; Wanli Ouyang; Chen Qian | 2248 | |
128 | 10:00 | Multi-Person Pose Estimation With Enhanced Channel-Wise and Spatial Information | Kai Su; Dongdong Yu; Zhenqi Xu; Xin Geng; Changhu Wang | 2345 | |
129 | 10:00 | A Compact Embedding for Facial Expression Similarity | Raviteja Vemulapalli; Aseem Agarwala | 2958 | |
130 | 10:00 | Deep High-Resolution Representation Learning for Human Pose Estimation | Ke Sun; Bin Xiao; Dong Liu; Jingdong Wang | 3142 | |
131 | 10:00 | Feature Transfer Learning for Face Recognition With Under-Represented Data | Xi Yin; Xiang Yu; Kihyuk Sohn; Xiaoming Liu; Manmohan Chandraker | 3198 | |
132 | 10:00 | Unsupervised 3D Pose Estimation With Geometric Self-Supervision | Ching-Hang Chen; Ambrish Tyagi; Amit Agrawal; Dylan Drover; Rohith MV; Stefan Stojanov; James M. Rehg | 3250 | |
Action & Video | 133 | 10:00 | Peeking Into the Future: Predicting Future Person Activities and Locations in Videos | Junwei Liang; Lu Jiang; Juan Carlos Niebles; Alexander G. Hauptmann; Li Fei-Fei | 2601 |
134 | 10:00 | Re-Identification With Consistent Attentive Siamese Networks | Meng Zheng; Srikrishna Karanam; Ziyan Wu; Richard J. Radke | 2953 | |
Motion & Biometrics | 135 | 10:00 | SelFlow: Self-Supervised Learning of Optical Flow | Pengpeng Liu; Michael Lyu; Irwin King; Jia Xu | 236 |
136 | 10:00 | Taking a Deeper Look at the Inverse Compositional Algorithm | Zhaoyang Lv; Frank Dellaert; James M. Rehg; Andreas Geiger | 3963 | |
137 | 10:00 | Deeper and Wider Siamese Networks for Real-Time Visual Tracking | Zhipeng Zhang; Houwen Peng | 1197 | |
138 | 10:00 | Self-Supervised Adaptation of High-Fidelity Face Models for Monocular Performance Tracking | Jae Shin Yoon; Takaaki Shiratori; Shoou-I Yu; Hyun Soo Park | 952 | |
139 | 10:00 | Diverse Generation for Multi-Agent Sports Games | Raymond A. Yeh; Alexander G. Schwing; Jonathan Huang; Kevin Murphy | 2738 | |
140 | 10:00 | Efficient Online Multi-Person 2D Pose Tracking With Recurrent Spatio-Temporal Affinity Fields | Yaadhav Raaj; Haroon Idrees; Gines Hidalgo; Yaser Sheikh | 3444 | |
141 | 10:00 | GFrames: Gradient-Based Local Reference Frame for 3D Shape Matching | Simone Melzi; Riccardo Spezialetti; Federico Tombari; Michael M. Bronstein; Luigi Di Stefano; Emanuele Rodolà | 1391 | |
142 | 10:00 | Eliminating Exposure Bias and Metric Mismatch in Multiple Object Tracking | Andrii Maksai; Pascal Fua | 6191 | |
143 | 10:00 | Graph Convolutional Tracking | Junyu Gao; Tianzhu Zhang; Changsheng Xu | 3119 | |
144 | 10:00 | ATOM: Accurate Tracking by Overlap Maximization | Martin Danelljan; Goutam Bhat; Fahad Shahbaz Khan; Michael Felsberg | 4984 | |
145 | 10:00 | Visual Tracking via Adaptive Spatially-Regularized Correlation Filters | Kenan Dai; Dong Wang; Huchuan Lu; Chong Sun; Jianhua Li | 1202 | |
146 | 10:00 | Deep Tree Learning for Zero-Shot Face Anti-Spoofing | Yaojie Liu; Joel Stehouwer; Amin Jourabloo; Xiaoming Liu | 496 | |
147 | 10:00 | ArcFace: Additive Angular Margin Loss for Deep Face Recognition | Jiankang Deng; Jia Guo; Niannan Xue; Stefanos Zafeiriou | 1140 | |
148 | 10:00 | Learning Joint Gait Representation via Quintuplet Loss Minimization | Kaihao Zhang; Wenhan Luo; Lin Ma; Wei Liu; Hongdong Li | 1617 | |
149 | 10:00 | Gait Recognition via Disentangled Representation Learning | Ziyuan Zhang; Luan Tran; Xi Yin; Yousef Atoum; Xiaoming Liu; Jian Wan; Nanxin Wang | 4898 | |
150 | 10:00 | On the Continuity of Rotation Representations in Neural Networks | Yi Zhou; Connelly Barnes; Jingwan Lu; Jimei Yang; Hao Li | 2448 | |
151 | 10:00 | Iterative Residual Refinement for Joint Optical Flow and Occlusion Estimation | Junhwa Hur; Stefan Roth | 2597 | |
152 | 10:00 | Inverse Discriminative Networks for Handwritten Signature Verification | Ping Wei; Huan Li; Ping Hu | 2619 | |
153 | 10:00 | Led3D: A Lightweight and Efficient Deep Approach to Recognizing Low-Quality 3D Faces | Guodong Mu; Di Huang; Guosheng Hu; Jia Sun; Yunhong Wang | 2656 | |
154 | 10:00 | ROI Pooled Correlation Filters for Visual Tracking | Yuxuan Sun; Chong Sun; Dong Wang; You He; Huchuan Lu | 2985 | |
Synthesis | 155 | 10:00 | Deep Video Inpainting | Dahun Kim; Sanghyun Woo; Joon-Young Lee; In So Kweon | 1345 |
156 | 10:00 | DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-To-Image Synthesis | Minfeng Zhu; Pingbo Pan; Wei Chen; Yi Yang | 2446 | |
157 | 10:00 | Non-Adversarial Image Synthesis With Generative Latent Nearest Neighbors | Yedid Hoshen; Ke Li; Jitendra Malik | 2525 | |
158 | 10:00 | Mixture Density Generative Adversarial Networks | Hamid Eghbal-zadeh; Werner Zellinger; Gerhard Widmer | 2651 | |
159 | 10:00 | SketchGAN: Joint Sketch Completion and Recognition With Generative Adversarial Network | Fang Liu; Xiaoming Deng; Yu-Kun Lai; Yong-Jin Liu; Cuixia Ma; Hongan Wang | 2669 | |
160 | 10:00 | Foreground-Aware Image Inpainting | Wei Xiong; Jiahui Yu; Zhe Lin; Jimei Yang; Xin Lu; Connelly Barnes; Jiebo Luo | 2672 | |
161 | 10:00 | Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-To-Image Translation | Matteo Tomei; Marcella Cornia; Lorenzo Baraldi; Rita Cucchiara | 2711 | |
162 | 10:00 | Structure-Preserving Stereoscopic View Synthesis With Multi-Scale Adversarial Correlation Matching | Yu Zhang; Dongqing Zou; Jimmy S. Ren; Zhe Jiang; Xiaohao Chen | 2786 | |
163 | 10:00 | DynTypo: Example-Based Dynamic Text Effects Transfer | Yifang Men; Zhouhui Lian; Yingmin Tang; Jianguo Xiao | 2998 | |
164 | 10:00 | Arbitrary Style Transfer With Style-Attentional Networks | Dae Young Park; Kwang Hee Lee | 3151 | |
165 | 10:00 | Typography With Decor: Intelligent Text Style Transfer | Wenjing Wang; Jiaying Liu; Shuai Yang; Zongming Guo | 3159 | |
Computational Photography & Graphics | 166 | 10:00 | RL-GAN-Net: A Reinforcement Learning Agent Controlled GAN Network for Real-Time Point Cloud Shape Completion | Muhammad Sarmad; Hyunjoo Jenny Lee; Young Min Kim | 2331 |
167 | 10:00 | Photo Wake-Up: 3D Character Animation From a Single Photo | Chung-Yi Weng; Brian Curless; Ira Kemelmacher-Shlizerman | 2432 | |
168 | 10:00 | DeepLight: Learning Illumination for Unconstrained Mobile Mixed Reality | Chloe LeGendre; Wan-Chun Ma; Graham Fyffe; John Flynn; Laurent Charbonnel; Jay Busch; Paul Debevec | 2444 | |
169 | 10:00 | Iterative Residual CNNs for Burst Photography Applications | Filippos Kokkinos; Stamatis Lefkimmiatis | 2823 | |
170 | 10:00 | Learning Implicit Fields for Generative Shape Modeling | Zhiqin Chen; Hao Zhang | 2932 | |
171 | 10:00 | Reliable and Efficient Image Cropping: A Grid Anchor Based Approach | Hui Zeng; Lida Li; Zisheng Cao; Lei Zhang | 3227 | |
172 | 10:00 | Patch-Based Progressive 3D Point Set Upsampling | Wang Yifan; Shihao Wu; Hui Huang; Daniel Cohen-Or; Olga Sorkine-Hornung | 3312 | |
Low-Level & Optimization | 173 | 10:00 | An Iterative and Cooperative Top-Down and Bottom-Up Inference Network for Salient Object Detection | Wenguan Wang; Jianbing Shen; Ming-Ming Cheng; Ling Shao | 1405 |
174 | 10:00 | Deep Stacked Hierarchical Multi-Patch Network for Image Deblurring | Hongguang Zhang; Yuchao Dai; Hongdong Li; Piotr Koniusz | 1936 | |
175 | 10:00 | Turn a Silicon Camera Into an InGaAs Camera | Feifan Lv; Yinqiang Zheng; Bohan Zhang; Feng Lu | 2258 | |
176 | 10:00 | Low-Rank Tensor Completion With a New Tensor Nuclear Norm Induced by Invertible Linear Transforms | Canyi Lu; Xi Peng; Yunchao Wei | 2318 | |
177 | 10:00 | Joint Representative Selection and Feature Learning: A Semi-Supervised Approach | Suchen Wang; Jingjing Meng; Junsong Yuan; Yap-Peng Tan | 2348 | |
178 | 10:00 | The Domain Transform Solver | Akash Bapat; Jan-Michael Frahm | 2468 | |
179 | 10:00 | CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection | Lu Zhang; Jianming Zhang; Zhe Lin; Huchuan Lu; You He | 2483 | |
180 | 10:00 | Phase-Only Image Based Kernel Estimation for Single Image Blind Deblurring | Liyuan Pan; Richard Hartley; Miaomiao Liu; Yuchao Dai | 2485 | |
181 | 10:00 | Hierarchical Discrete Distribution Decomposition for Match Density Estimation | Zhichao Yin; Trevor Darrell; Fisher Yu | 2752 | |
182 | 10:00 | FOCNet: A Fractional Optimal Control Network for Image Denoising | Xixi Jia; Sanyang Liu; Xiangchu Feng; Lei Zhang | 2791 | |
183 | 10:00 | Orthogonal Decomposition Network for Pixel-Wise Binary Classification | Chang Liu; Fang Wan; Wei Ke; Zhuowei Xiao; Yuan Yao; Xiaosong Zhang; Qixiang Ye | 2840 | |
184 | 10:00 | Multi-Source Weak Supervision for Saliency Detection | Yu Zeng; Yunzhi Zhuge; Huchuan Lu; Lihe Zhang; Mingyang Qian; Yizhou Yu | 2984 | |
185 | 10:00 | ComDefend: An Efficient Image Compression Model to Defend Adversarial Examples | Xiaojun Jia; Xingxing Wei; Xiaochun Cao; Hassan Foroosh | 2999 | |
186 | 10:00 | Combinatorial Persistency Criteria for Multicut and Max-Cut | Jan-Hendrik Lange; Bjoern Andres; Paul Swoboda | 3062 | |
187 | 10:00 | S4Net: Single Stage Salient-Instance Segmentation | Ruochen Fan; Ming-Ming Cheng; Qibin Hou; Tai-Jiang Mu; Jingdong Wang; Shi-Min Hu | 3132 | |
188 | 10:00 | A Decomposition Algorithm for the Sparse Generalized Eigenvalue Problem | Ganzhao Yuan; Li Shen; Wei-Shi Zheng | 3258 | |
Scenes & Representation | 189 | 10:00 | Polynomial Representation for Persistence Diagram | Zhichao Wang; Qian Li; Gang Li; Guandong Xu | 2334 |
190 | 10:00 | Crowd Counting and Density Estimation by Trellis Encoder-Decoder Networks | Xiaolong Jiang; Zehao Xiao; Baochang Zhang; Xiantong Zhen; Xianbin Cao; David Doermann; Ling Shao | 2336 | |
191 | 10:00 | Cross-Atlas Convolution for Parameterization Invariant Learning on Textured Mesh Surface | Shiwei Li; Zixin Luo; Mingmin Zhen; Yao Yao; Tianwei Shen; Tian Fang; Long Quan | 2492 | |
192 | 10:00 | Deep Surface Normal Estimation With Hierarchical RGB-D Fusion | Jin Zeng; Yanfeng Tong; Yunmu Huang; Qiong Yan; Wenxiu Sun; Jing Chen; Yongtian Wang | 2699 | |
193 | 10:00 | Knowledge-Embedded Routing Network for Scene Graph Generation | Tianshui Chen; Weihao Yu; Riquan Chen; Liang Lin | 2807 | |
194 | 10:00 | An End-To-End Network for Panoptic Segmentation | Huanyu Liu; Chao Peng; Changqian Yu; Jingbo Wang; Xu Liu; Gang Yu; Wei Jiang | 2873 | |
195 | 10:00 | Fast and Flexible Indoor Scene Synthesis via Deep Convolutional Generative Models | Daniel Ritchie; Kai Wang; Yu-An Lin | 2950 | |
196 | 10:00 | Marginalized Latent Semantic Encoder for Zero-Shot Learning | Zhengming Ding; Hongfu Liu | 3081 | |
197 | 10:00 | Scale-Adaptive Neural Dense Features: Learning via Hierarchical Context Aggregation | Jaime Spencer; Richard Bowden; Simon Hadfield | 3177 | |
198 | 10:00 | Unsupervised Embedding Learning via Invariant and Spreading Instance Feature | Mang Ye; Xu Zhang; Pong C. Yuen; Shih-Fu Chang | 3213 | |
199 | 10:00 | AOGNets: Compositional Grammatical Architectures for Deep Learning | Xilai Li; Xi Song; Tianfu Wu | 3256 | |
200 | 10:00 | A Robust Local Spectral Descriptor for Matching Non-Rigid Shapes With Incompatible Shape Structures | Yiqun Wang; Jianwei Guo; Dong-Ming Yan; Kai Wang; Xiaopeng Zhang | 3270 | |
Language & Reasoning | 201 | 10:00 | Context and Attribute Grounded Dense Captioning | Guojun Yin; Lu Sheng; Bin Liu; Nenghai Yu; Xiaogang Wang; Jing Shao | 1800 |
202 | 10:00 | Spot and Learn: A Maximum-Entropy Patch Sampler for Few-Shot Image Classification | Wen-Hsuan Chu; Yu-Jhe Li; Jing-Cheng Chang; Yu-Chiang Frank Wang | 2206 | |
203 | 10:00 | Interpreting CNNs via Decision Trees | Quanshi Zhang; Yu Yang; Haotian Ma; Ying Nian Wu | 2349 | |
204 | 10:00 | Dense Relational Captioning: Triple-Stream Networks for Relationship-Based Captioning | Dong-Jin Kim; Jinsoo Choi; Tae-Hyun Oh; In So Kweon | 2380 | |
205 | 10:00 | Deep Modular Co-Attention Networks for Visual Question Answering | Zhou Yu; Jun Yu; Yuhao Cui; Dacheng Tao; Qi Tian | 2401 | |
206 | 10:00 | Synthesizing Environment-Aware Activities via Activity Sketches | Yuan-Hong Liao; Xavier Puig; Marko Boben; Antonio Torralba; Sanja Fidler | 2591 | |
207 | 10:00 | Self-Critical n-Step Training for Image Captioning | Junlong Gao; Shiqi Wang; Shanshe Wang; Siwei Ma; Wen Gao | 3422 | |
208 | 10:00 | Multi-Target Embodied Question Answering | Licheng Yu; Xinlei Chen; Georgia Gkioxari; Mohit Bansal; Tamara L. Berg; Dhruv Batra | 2730 | |
209 | 10:00 | Visual Question Answering as Reading Comprehension | Hui Li; Peng Wang; Chunhua Shen; Anton van den Hengel | 2795 | |
210 | 10:00 | StoryGAN: A Sequential Conditional GAN for Story Visualization | Yitong Li; Zhe Gan; Yelong Shen; Jingjing Liu; Yu Cheng; Yuexin Wu; Lawrence Carin; David Carlson; Jianfeng Gao | 3040 | |
Applications, Medical, & Robotics | 211 | 10:00 | Noise-Aware Unsupervised Deep Lidar-Stereo Fusion | Xuelian Cheng; Yiran Zhong; Yuchao Dai; Pan Ji; Hongdong Li | 2298 |
212 | 10:00 | Versatile Multiple Choice Learning and Its Application to Vision Computing | Kai Tian; Yi Xu; Shuigeng Zhou; Jihong Guan | 2513 | |
213 | 10:00 | EV-Gait: Event-Based Robust Gait Recognition Using Dynamic Vision Sensors | Yanxiang Wang; Bowen Du; Yiran Shen; Kai Wu; Guangrong Zhao; Jianguo Sun; Hongkai Wen | 2867 | |
214 | 10:00 | ToothNet: Automatic Tooth Instance Segmentation and Identification From Cone Beam CT Images | Zhiming Cui; Changjian Li; Wenping Wang | 2907 | |
215 | 10:00 | Modularized Textual Grounding for Counterfactual Resilience | Zhiyuan Fang; Shu Kong; Charless Fowlkes; Yezhou Yang | 3007 | |
216 | 10:00 | L3-Net: Towards Learning Based LiDAR Localization for Autonomous Driving | Weixin Lu; Yao Zhou; Guowei Wan; Shenhua Hou; Shiyu Song | 3124 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Recognition | 25 | 13:30 | Panoptic Feature Pyramid Networks | Alexander Kirillov; Ross Girshick; Kaiming He; Piotr Dollár | 1462 |
26 | 13:35 | Mask Scoring R-CNN | Zhaojin Huang; Lichao Huang; Yongchao Gong; Chang Huang; Xinggang Wang | 2705 | |
27 | 13:40 | Reasoning-RCNN: Unifying Adaptive Global Reasoning Into Large-Scale Object Detection | Hang Xu; Chenhan Jiang; Xiaodan Liang; Liang Lin; Zhenguo Li | 3864 | |
28 | 13:48 | Cross-Modality Personalization for Retrieval | Nils Murrugarra-Llerena; Adriana Kovashka | 1476 | |
29 | 13:53 | Composing Text and Image for Image Retrieval - an Empirical Odyssey | Nam Vo; Lu Jiang; Chen Sun; Kevin Murphy; Li-Jia Li; Li Fei-Fei; James Hays | 2623 | |
30 | 13:58 | Arbitrary Shape Scene Text Detection With Adaptive Text Region Representation | Xiaobing Wang; Yingying Jiang; Zhenbo Luo; Cheng-Lin Liu; Hyunsoo Choi; Sungjin Kim | 3524 | |
31 | 14:06 | Adaptive NMS: Refining Pedestrian Detection in a Crowd | Songtao Liu; Di Huang; Yunhong Wang | 2657 | |
32 | 14:11 | Point in, Box Out: Beyond Counting Persons in Crowds | Yuting Liu; Miaojing Shi; Qijun Zhao; Xiaofang Wang | 3517 | |
33 | 14:16 | Locating Objects Without Bounding Boxes | Javier Ribera; David Güera; Yuhao Chen; Edward J. Delp | 6264 | |
34 | 14:24 | FineGAN: Unsupervised Hierarchical Disentanglement for Fine-Grained Object Generation and Discovery | Krishna Kumar Singh; Utkarsh Ojha; Yong Jae Lee | 3333 | |
35 | 14:29 | Mutual Learning of Complementary Networks via Residual Correction for Improving Semi-Supervised Classification | Si Wu; Jichang Li; Cheng Liu; Zhiwen Yu; Hau-San Wong | 3505 | |
36 | 14:34 | Sampling Techniques for Large-Scale Object Detection From Sparsely Annotated Objects | Yusuke Niitani; Takuya Akiba; Tommi Kerola; Toru Ogawa; Shotaro Sano; Shuji Suzuki | 4012 | |
37 | 14:42 | Curls & Whey: Boosting Black-Box Adversarial Attacks | Yucheng Shi; Siyu Wang; Yahong Han | 4099 | |
38 | 14:47 | Barrage of Random Transforms for Adversarially Robust Defense | Edward Raff; Jared Sylvester; Steven Forsyth; Mark McLean | 5988 | |
39 | 14:52 | Aggregation Cross-Entropy for Sequence Recognition | Zecheng Xie; Yaoxiong Huang; Yuanzhi Zhu; Lianwen Jin; Yuliang Liu; Lele Xie | 4648 | |
40 | 15:00 | LaSO: Label-Set Operations Networks for Multi-Label Few-Shot Learning | Amit Alfassy; Leonid Karlinsky; Amit Aides; Joseph Shtok; Sivan Harary; Rogerio Feris; Raja Giryes; Alex M. Bronstein | 4674 | |
41 | 15:05 | Few-Shot Learning With Localization in Realistic Settings | Davis Wertheimer; Bharath Hariharan | 5352 | |
42 | 15:10 | AdaGraph: Unifying Predictive and Continuous Domain Adaptation Through Graphs | Massimiliano Mancini; Samuel Rota Bulò; Barbara Caputo; Elisa Ricci | 5575 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Language & Reasoning | 177 | 13:30 | Grounded Video Description | Luowei Zhou; Yannis Kalantidis; Xinlei Chen; Jason J. Corso; Marcus Rohrbach | 12 |
178 | 13:35 | Streamlined Dense Video Captioning | Jonghwan Mun; Linjie Yang; Zhou Ren; Ning Xu; Bohyung Han | 3566 | |
179 | 13:40 | Adversarial Inference for Multi-Sentence Video Description | Jae Sung Park; Marcus Rohrbach; Trevor Darrell; Anna Rohrbach | 5612 | |
180 | 13:48 | Unified Visual-Semantic Embeddings: Bridging Vision and Language With Structured Meaning Representations | Hao Wu; Jiayuan Mao; Yufeng Zhang; Yuning Jiang; Lei Li; Weiwei Sun; Wei-Ying Ma | 4705 | |
181 | 13:53 | Learning to Compose Dynamic Tree Structures for Visual Contexts | Kaihua Tang; Hanwang Zhang; Baoyuan Wu; Wenhan Luo; Wei Liu | 3640 | |
182 | 13:58 | Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation | Xin Wang; Qiuyuan Huang; Asli Celikyilmaz; Jianfeng Gao; Dinghan Shen; Yuan-Fang Wang; William Yang Wang; Lei Zhang | 5104 | |
183 | 14:06 | Dynamic Fusion With Intra- and Inter-Modality Attention Flow for Visual Question Answering | Peng Gao; Zhengkai Jiang; Haoxuan You; Pan Lu; Steven C. H. Hoi; Xiaogang Wang; Hongsheng Li | 1824 | |
184 | 14:11 | Cycle-Consistency for Robust Visual Question Answering | Meet Shah; Xinlei Chen; Marcus Rohrbach; Devi Parikh | 3454 | |
185 | 14:16 | Embodied Question Answering in Photorealistic Environments With Point Cloud Perception | Erik Wijmans; Samyak Datta; Oleksandr Maksymets; Abhishek Das; Georgia Gkioxari; Stefan Lee; Irfan Essa; Devi Parikh; Dhruv Batra | 135 | |
186 | 14:24 | Reasoning Visual Dialogs With Structural and Partial Observations | Zilong Zheng; Wenguan Wang; Siyuan Qi; Song-Chun Zhu | 3909 | |
187 | 14:29 | Recursive Visual Attention in Visual Dialog | Yulei Niu; Hanwang Zhang; Manli Zhang; Jianhong Zhang; Zhiwu Lu; Ji-Rong Wen | 3129 | |
188 | 14:34 | Two Body Problem: Collaborative Visual Task Completion | Unnat Jain; Luca Weihs; Eric Kolve; Mohammad Rastegari; Svetlana Lazebnik; Ali Farhadi; Alexander G. Schwing; Aniruddha Kembhavi | 3820 | |
189 | 14:42 | GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering | Drew A. Hudson; Christopher D. Manning | 7021 | |
190 | 14:47 | Text2Scene: Generating Compositional Scenes From Textual Descriptions | Fuwen Tan; Song Feng; Vicente Ordonez | 1530 | |
191 | 14:52 | From Recognition to Cognition: Visual Commonsense Reasoning | Rowan Zellers; Yonatan Bisk; Ali Farhadi; Yejin Choi | 5126 | |
192 | 15:00 | The Regretful Agent: Heuristic-Aided Navigation Through Progress Estimation | Chih-Yao Ma; Zuxuan Wu; Ghassan AlRegib; Caiming Xiong; Zsolt Kira | 3587 | |
193 | 15:05 | Tactical Rewind: Self-Correction via Backtracking in Vision-And-Language Navigation | Liyiming Ke; Xiujun Li; Yonatan Bisk; Ari Holtzman; Zhe Gan; Jingjing Liu; Jianfeng Gao; Yejin Choi; Siddhartha Srinivasa | 6287 | |
194 | 15:10 | Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning | Mitchell Wortsman; Kiana Ehsani; Mohammad Rastegari; Ali Farhadi; Roozbeh Mottaghi | 1770 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Computational Photography & Graphics | 130 | 13:30 | Photon-Flooded Single-Photon 3D Cameras | Anant Gupta; Atul Ingle; Andreas Velten; Mohit Gupta | 1624 |
131 | 13:35 | High Flux Passive Imaging With Single-Photon Sensors | Atul Ingle; Andreas Velten; Mohit Gupta | 1064 | |
132 | 13:40 | Acoustic Non-Line-Of-Sight Imaging | David B. Lindell; Gordon Wetzstein; Vladlen Koltun | 2059 | |
133 | 13:48 | Steady-State Non-Line-Of-Sight Imaging | Wenzheng Chen; Simon Daneau; Fahim Mannan; Felix Heide | 3310 | |
134 | 13:53 | A Theory of Fermat Paths for Non-Line-Of-Sight Shape Reconstruction | Shumian Xin; Sotiris Nousias; Kiriakos N. Kutulakos; Aswin C. Sankaranarayanan; Srinivasa G. Narasimhan; Ioannis Gkioulekas | 2427 | |
135 | 13:58 | End-To-End Projector Photometric Compensation | Bingyao Huang; Haibin Ling | 474 | |
136 | 14:06 | Bringing a Blurry Frame Alive at High Frame-Rate With an Event Camera | Liyuan Pan; Cedric Scheerlinck; Xin Yu; Richard Hartley; Miaomiao Liu; Yuchao Dai | 2605 | |
137 | 14:11 | Bringing Alive Blurred Moments | Kuldeep Purohit; Anshul Shah; A. N. Rajagopalan | 5932 | |
138 | 14:16 | Learning to Synthesize Motion Blur | Tim Brooks; Jonathan T. Barron | 1607 | |
139 | 14:24 | Underexposed Photo Enhancement Using Deep Illumination Estimation | Ruixing Wang; Qing Zhang; Chi-Wing Fu; Xiaoyong Shen; Wei-Shi Zheng; Jiaya Jia | 2861 | |
140 | 14:29 | Blind Visual Motif Removal From a Single Image | Amir Hertz; Sharon Fogel; Rana Hanocka; Raja Giryes; Daniel Cohen-Or | 2843 | |
141 | 14:34 | Non-Local Meets Global: An Integrated Paradigm for Hyperspectral Denoising | Wei He; Quanming Yao; Chao Li; Naoto Yokoya; Qibin Zhao | 6541 | |
142 | 14:42 | Neural Rerendering in the Wild | Moustafa Meshry; Dan B. Goldman; Sameh Khamis; Hugues Hoppe; Rohit Pandey; Noah Snavely; Ricardo Martin-Brualla | 4943 | |
143 | 14:47 | GeoNet: Deep Geodesic Networks for Point Cloud Analysis | Tong He; Haibin Huang; Li Yi; Yuqian Zhou; Chihao Wu; Jue Wang; Stefano Soatto | 430 | |
144 | 14:52 | MeshAdv: Adversarial Meshes for Visual Recognition | Chaowei Xiao; Dawei Yang; Bo Li; Jia Deng; Mingyan Liu | 2440 | |
145 | 15:00 | Fast Spatially-Varying Indoor Lighting Estimation | Mathieu Garon; Kalyan Sunkavalli; Sunil Hadap; Nathan Carr; Jean-François Lalonde | 4701 | |
146 | 15:05 | Neural Illumination: Lighting Prediction for Indoor Environments | Shuran Song; Thomas Funkhouser | 1188 | |
147 | 15:10 | Deep Sky Modeling for Single Image Outdoor Lighting Estimation | Yannick Hold-Geoffroy; Akshaya Athawale; Jean-François Lalonde | 4363 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Deep Learning | 1 | 15:20 | Bidirectional Learning for Domain Adaptation of Semantic Segmentation | Yunsheng Li; Lu Yuan; Nuno Vasconcelos | 2969 |
2 | 15:20 | Enhanced Bayesian Compression via Deep Reinforcement Learning | Xin Yuan; Liangliang Ren; Jiwen Lu; Jie Zhou | 3330 | |
3 | 15:20 | Strong-Weak Distribution Alignment for Adaptive Object Detection | Kuniaki Saito; Yoshitaka Ushiku; Tatsuya Harada; Kate Saenko | 3387 | |
4 | 15:20 | MFAS: Multimodal Fusion Architecture Search | Juan-Manuel Pérez-Rúa;; Valentin Vielzeuf; Stéphane Pateux; Moez Baccouche; Frederic Jurie | 3429 | |
5 | 15:20 | Disentangling Adversarial Robustness and Generalization | David Stutz; Matthias Hein; Bernt Schiele | 3440 | |
6 | 15:20 | ShieldNets: Defending Against Adversarial Attacks Using Probabilistic Adversarial Robustness | Rajkumar Theagarajan; Ming Chen; Bir Bhanu; Jing Zhang | 3483 | |
7 | 15:20 | Deeply-Supervised Knowledge Synergy | Dawei Sun; Anbang Yao; Aojun Zhou; Hao Zhao | 3487 | |
8 | 15:20 | Dual Residual Networks Leveraging the Potential of Paired Operations for Image Restoration | Xing Liu; Masanori Suganuma; Zhun Sun; Takayuki Okatani | 3503 | |
9 | 15:20 | Probabilistic End-To-End Noise Correction for Learning With Noisy Labels | Kun Yi; Jianxin Wu | 3543 | |
10 | 15:20 | Attention-Guided Unified Network for Panoptic Segmentation | Yanwei Li; Xinze Chen; Zheng Zhu; Lingxi Xie; Guan Huang; Dalong Du; Xingang Wang | 3547 | |
11 | 15:20 | NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection | Golnaz Ghiasi; Tsung-Yi Lin; Quoc V. Le | 3583 | |
12 | 15:20 | OICSR: Out-In-Channel Sparsity Regularization for Compact Deep Neural Networks | Jiashi Li; Qi Qi; Jingyu Wang; Ce Ge; Yujian Li; Zhangzhang Yue; Haifeng Sun | 3645 | |
13 | 15:20 | Semantically Aligned Bias Reducing Zero Shot Learning | Akanksha Paul; Narayanan C. Krishnan; Prateek Munjal | 3667 | |
14 | 15:20 | Feature Space Perturbations Yield More Transferable Adversarial Examples | Nathan Inkawhich; Wei Wen; Hai (Helen) Li; Yiran Chen | 3737 | |
15 | 15:20 | IGE-Net: Inverse Graphics Energy Networks for Human Pose Estimation and Single-View Reconstruction | Dominic Jack; Frederic Maire; Sareh Shirazi; Anders Eriksson | 3787 | |
16 | 15:20 | Accelerating Convolutional Neural Networks via Activation Map Compression | Georgios Georgiadis | 3815 | |
17 | 15:20 | Knowledge Distillation via Instance Relationship Graph | Yufan Liu; Jiajiong Cao; Bing Li; Chunfeng Yuan; Weiming Hu; Yangxi Li; Yunqiang Duan | 3844 | |
18 | 15:20 | PPGNet: Learning Point-Pair Graph for Line Segment Detection | Ziheng Zhang; Zhengxin Li; Ning Bi; Jia Zheng; Jinlei Wang; Kun Huang; Weixin Luo; Yanyu Xu; Shenghua Gao | 3943 | |
19 | 15:20 | Building Detail-Sensitive Semantic Segmentation Networks With Polynomial Pooling | Zhen Wei; Jingyi Zhang; Li Liu; Fan Zhu; Fumin Shen; Yi Zhou; Si Liu; Yao Sun; Ling Shao | 3951 | |
20 | 15:20 | Variational Bayesian Dropout With a Hierarchical Prior | Yuhang Liu; Wenyong Dong; Lei Zhang; Dong Gong; Qinfeng Shi | 3967 | |
21 | 15:20 | AANet: Attribute Attention Network for Person Re-Identifications | Chiat-Pin Tay; Sharmili Roy; Kim-Hui Yap | 4013 | |
22 | 15:20 | Overcoming Limitations of Mixture Density Networks: A Sampling and Fitting Framework for Multimodal Future Prediction | Osama Makansi; Eddy Ilg; Özgün Çiçek; Thomas Brox | 4106 | |
23 | 15:20 | A Main/Subsidiary Network Framework for Simplifying Binary Neural Networks | Yinghao Xu; Xin Dong; Yudian Li; Hao Su | 4129 | |
24 | 15:20 | PointNetLK: Robust & Efficient Point Cloud Registration Using PointNet | Yasuhiro Aoki; Hunter Goforth; Rangaprasad Arun Srivatsan; Simon Lucey | 4131 | |
Recognition | 25 | 15:20 | Panoptic Feature Pyramid Networks | Alexander Kirillov; Ross Girshick; Kaiming He; Piotr Dollár | 1462 |
26 | 15:20 | Mask Scoring R-CNN | Zhaojin Huang; Lichao Huang; Yongchao Gong; Chang Huang; Xinggang Wang | 2705 | |
27 | 15:20 | Reasoning-RCNN: Unifying Adaptive Global Reasoning Into Large-Scale Object Detection | Hang Xu; Chenhan Jiang; Xiaodan Liang; Liang Lin; Zhenguo Li | 3864 | |
28 | 15:20 | Cross-Modality Personalization for Retrieval | Nils Murrugarra-Llerena; Adriana Kovashka | 1476 | |
29 | 15:20 | Composing Text and Image for Image Retrieval - an Empirical Odyssey | Nam Vo; Lu Jiang; Chen Sun; Kevin Murphy; Li-Jia Li; Li Fei-Fei; James Hays | 2623 | |
30 | 15:20 | Arbitrary Shape Scene Text Detection With Adaptive Text Region Representation | Xiaobing Wang; Yingying Jiang; Zhenbo Luo; Cheng-Lin Liu; Hyunsoo Choi; Sungjin Kim | 3524 | |
31 | 15:20 | Adaptive NMS: Refining Pedestrian Detection in a Crowd | Songtao Liu; Di Huang; Yunhong Wang | 2657 | |
32 | 15:20 | Point in, Box Out: Beyond Counting Persons in Crowds | Yuting Liu; Miaojing Shi; Qijun Zhao; Xiaofang Wang | 3517 | |
33 | 15:20 | Locating Objects Without Bounding Boxes | Javier Ribera; David Güera; Yuhao Chen; Edward J. Delp | 6264 | |
34 | 15:20 | FineGAN: Unsupervised Hierarchical Disentanglement for Fine-Grained Object Generation and Discovery | Krishna Kumar Singh; Utkarsh Ojha; Yong Jae Lee | 3333 | |
35 | 15:20 | Mutual Learning of Complementary Networks via Residual Correction for Improving Semi-Supervised Classification | Si Wu; Jichang Li; Cheng Liu; Zhiwen Yu; Hau-San Wong | 3505 | |
36 | 15:20 | Sampling Techniques for Large-Scale Object Detection From Sparsely Annotated Objects | Yusuke Niitani; Takuya Akiba; Tommi Kerola; Toru Ogawa; Shotaro Sano; Shuji Suzuki | 4012 | |
37 | 15:20 | Curls & Whey: Boosting Black-Box Adversarial Attacks | Yucheng Shi; Siyu Wang; Yahong Han | 4099 | |
38 | 15:20 | Barrage of Random Transforms for Adversarially Robust Defense | Edward Raff; Jared Sylvester; Steven Forsyth; Mark McLean | 5988 | |
39 | 15:20 | Aggregation Cross-Entropy for Sequence Recognition | Zecheng Xie; Yaoxiong Huang; Yuanzhi Zhu; Lianwen Jin; Yuliang Liu; Lele Xie | 4648 | |
40 | 15:20 | LaSO: Label-Set Operations Networks for Multi-Label Few-Shot Learning | Amit Alfassy; Leonid Karlinsky; Amit Aides; Joseph Shtok; Sivan Harary; Rogerio Feris; Raja Giryes; Alex M. Bronstein | 4674 | |
41 | 15:20 | Few-Shot Learning With Localization in Realistic Settings | Davis Wertheimer; Bharath Hariharan | 5352 | |
42 | 15:20 | AdaGraph: Unifying Predictive and Continuous Domain Adaptation Through Graphs | Massimiliano Mancini; Samuel Rota Bulò; Barbara Caputo; Elisa Ricci | 5575 | |
43 | 15:20 | Few-Shot Adaptive Faster R-CNN | Tao Wang; Xiaopeng Zhang; Li Yuan; Jiashi Feng | 2532 | |
44 | 15:20 | VRSTC: Occlusion-Free Video Person Re-Identification | Ruibing Hou; Bingpeng Ma; Hong Chang; Xinqian Gu; Shiguang Shan; Xilin Chen | 3351 | |
45 | 15:20 | Compact Feature Learning for Multi-Domain Image Classification | Yajing Liu; Xinmei Tian; Ya Li; Zhiwei Xiong; Feng Wu | 3356 | |
46 | 15:20 | Adaptive Transfer Network for Cross-Domain Person Re-Identification | Jiawei Liu; Zheng-Jun Zha; Di Chen; Richang Hong; Meng Wang | 3360 | |
47 | 15:20 | Large-Scale Few-Shot Learning: Knowledge Transfer With Class Hierarchy | Aoxue Li; Tiange Luo; Zhiwu Lu; Tao Xiang; Liwei Wang | 3371 | |
48 | 15:20 | Moving Object Detection Under Discontinuous Change in Illumination Using Tensor Low-Rank and Invariant Sparse Decomposition | Moein Shakeri; Hong Zhang | 3466 | |
49 | 15:20 | Pedestrian Detection With Autoregressive Network Phases | Garrick Brazil; Xiaoming Liu | 3573 | |
50 | 15:20 | All You Need Is a Few Shifts: Designing Efficient Convolutional Neural Networks for Image Classification | Weijie Chen; Di Xie; Yuan Zhang; Shiliang Pu | 3594 | |
51 | 15:20 | Stochastic Class-Based Hard Example Mining for Deep Metric Learning | Yumin Suh; Bohyung Han; Wonsik Kim; Kyoung Mu Lee | 3633 | |
52 | 15:20 | Revisiting Local Descriptor Based Image-To-Class Measure for Few-Shot Learning | Wenbin Li; Lei Wang; Jinglin Xu; Jing Huo; Yang Gao; Jiebo Luo | 3682 | |
53 | 15:20 | Towards Robust Curve Text Detection With Conditional Spatial Expansion | Zichuan Liu; Guosheng Lin; Sheng Yang; Fayao Liu; Weisi Lin; Wang Ling Goh | 3684 | |
54 | 15:20 | Revisiting Perspective Information for Efficient Crowd Counting | Miaojing Shi; Zhaohui Yang; Chao Xu; Qijun Chen | 3750 | |
55 | 15:20 | Towards Universal Object Detection by Domain Attention | Xudong Wang; Zhaowei Cai; Dashan Gao; Nuno Vasconcelos | 3770 | |
56 | 15:20 | Ensemble Deep Manifold Similarity Learning Using Hard Proxies | Nicolas Aziere; Sinisa Todorovic | 3773 | |
57 | 15:20 | Quantization Networks | Jiwei Yang; Xu Shen; Jun Xing; Xinmei Tian; Houqiang Li; Bing Deng; Jianqiang Huang; Xian-sheng Hua | 3816 | |
58 | 15:20 | RES-PCA: A Scalable Approach to Recovering Low-Rank Matrices | Chong Peng; Chenglizhao Chen; Zhao Kang; Jianbo Li; Qiang Cheng | 3890 | |
59 | 15:20 | Occlusion-Net: 2D/3D Occluded Keypoint Localization Using Graph Networks | N. Dinesh Reddy; Minh Vo; Srinivasa G. Narasimhan | 3898 | |
60 | 15:20 | Efficient Featurized Image Pyramid Network for Single Shot Detector | Yanwei Pang; Tiancai Wang; Rao Muhammad Anwer; Fahad Shahbaz Khan; Ling Shao | 3924 | |
61 | 15:20 | Multi-Task Multi-Sensor Fusion for 3D Object Detection | Ming Liang; Bin Yang; Yun Chen; Rui Hu; Raquel Urtasun | 3955 | |
62 | 15:20 | Domain-Specific Batch Normalization for Unsupervised Domain Adaptation | Woong-Gi Chang; Tackgeun You; Seonguk Seo; Suha Kwak; Bohyung Han | 3961 | |
63 | 15:20 | Grid R-CNN | Xin Lu; Buyu Li; Yuxin Yue; Quanquan Li; Junjie Yan | 3997 | |
64 | 15:20 | MetaCleaner: Learning to Hallucinate Clean Representations for Noisy-Labeled Visual Recognition | Weihe Zhang; Yali Wang; Yu Qiao | 4062 | |
65 | 15:20 | Mapping, Localization and Path Planning for Image-Based Navigation Using Visual Features and Map | Janine Thoma; Danda Pani Paudel; Ajad Chhatkuli; Thomas Probst; Luc Van Gool | 4087 | |
66 | 15:20 | Triply Supervised Decoder Networks for Joint Detection and Segmentation | Jiale Cao; Yanwei Pang; Xuelong Li | 4092 | |
67 | 15:20 | Leveraging the Invariant Side of Generative Zero-Shot Learning | Jingjing Li; Mengmeng Jing; Ke Lu; Zhengming Ding; Lei Zhu; Zi Huang | 4107 | |
68 | 15:20 | Exploring the Bounds of the Utility of Context for Object Detection | Ehud Barnea; Ohad Ben-Shahar | 4171 | |
Segmentation, Grouping, & Shape | 69 | 15:20 | A-CNN: Annularly Convolutional Neural Networks on Point Clouds | Artem Komarichev; Zichun Zhong; Jing Hua | 3443 |
70 | 15:20 | DARNet: Deep Active Ray Network for Building Segmentation | Dominic Cheng; Renjie Liao; Sanja Fidler; Raquel Urtasun | 3475 | |
71 | 15:20 | Point Cloud Oversegmentation With Graph-Structured Deep Metric Learning | Loic Landrieu; Mohamed Boussaha | 3631 | |
72 | 15:20 | Graphonomy: Universal Human Parsing via Graph Transfer Learning | Ke Gong; Yiming Gao; Xiaodan Liang; Xiaohui Shen; Meng Wang; Liang Lin | 3655 | |
73 | 15:20 | Fitting Multiple Heterogeneous Models by Multi-Class Cascaded T-Linkage | Luca Magri; Andrea Fusiello | 3681 | |
74 | 15:20 | A Late Fusion CNN for Digital Matting | Yunke Zhang; Lixue Gong; Lubin Fan; Peiran Ren; Qixing Huang; Hujun Bao; Weiwei Xu | 3710 | |
75 | 15:20 | BASNet: Boundary-Aware Salient Object Detection | Xuebin Qin; Zichen Zhang; Chenyang Huang; Chao Gao; Masood Dehghan; Martin Jagersand | 3784 | |
76 | 15:20 | ZigZagNet: Fusing Top-Down and Bottom-Up Context for Object Segmentation | Di Lin; Dingguo Shen; Siting Shen; Yuanfeng Ji; Dani Lischinski; Daniel Cohen-Or; Hui Huang | 3828 | |
77 | 15:20 | Object Instance Annotation With Deep Extreme Level Set Evolution | Zian Wang; David Acuna; Huan Ling; Amlan Kar; Sanja Fidler | 3977 | |
78 | 15:20 | Leveraging Crowdsourced GPS Data for Road Extraction From Aerial Imagery | Tao Sun; Zonglin Di; Pengyu Che; Chun Liu; Yin Wang | 4018 | |
79 | 15:20 | Adaptive Pyramid Context Network for Semantic Segmentation | Junjun He; Zhongying Deng; Lei Zhou; Yali Wang; Yu Qiao | 4080 | |
Statistics, Physics, Theory, & Datasets | 80 | 15:20 | Isospectralization, or How to Hear Shape, Style, and Correspondence | Luca Cosmo; Mikhail Panine; Arianna Rampini; Maks Ovsjanikov; Michael M. Bronstein; Emanuele Rodolà | 3421 |
81 | 15:20 | Speech2Face: Learning the Face Behind a Voice | Tae-Hyun Oh; Tali Dekel; Changil Kim; Inbar Mosseri; William T. Freeman; Michael Rubinstein; Wojciech Matusik | 3427 | |
82 | 15:20 | Joint Manifold Diffusion for Combining Predictions on Decoupled Observations | Kwang In Kim; Hyung Jin Chang | 3455 | |
83 | 15:20 | Audio Visual Scene-Aware Dialog | Huda Alamri; Vincent Cartillier; Abhishek Das; Jue Wang; Anoop Cherian; Irfan Essa; Dhruv Batra; Tim K. Marks; Chiori Hori; Peter Anderson; Stefan Lee; Devi Parikh | 3469 | |
84 | 15:20 | Learning to Minify Photometric Stereo | Junxuan Li; Antonio Robles-Kelly; Shaodi You; Yasuyuki Matsushita | 3540 | |
85 | 15:20 | Reflective and Fluorescent Separation Under Narrow-Band Illumination | Koji Koyamatsu; Daichi Hidaka; Takahiro Okabe; Hendrik P. A. Lensch | 3702 | |
86 | 15:20 | Depth From a Polarisation + RGB Stereo Pair | Dizhong Zhu; William A. P. Smith | 3760 | |
87 | 15:20 | Rethinking the Evaluation of Video Summaries | Mayu Otani; Yuta Nakashima; Esa Rahtu; Janne Heikkilä | 4016 | |
88 | 15:20 | What Object Should I Use? - Task Driven Object Detection | Johann Sawatzky; Yaser Souri; Christian Grund; Jürgen Gall | 4153 | |
3D Multiview | 89 | 15:20 | Triangulation Learning Network: From Monocular to Stereo 3D Object Detection | Zengyi Qin; Jinglu Wang; Yan Lu | 3878 |
90 | 15:20 | Connecting the Dots: Learning Representations for Active Monocular Depth Estimation | Gernot Riegler; Yiyi Liao; Simon Donné; Vladlen Koltun; Andreas Geiger | 3965 | |
91 | 15:20 | Learning Non-Volumetric Depth Fusion Using Successive Reprojections | Simon Donné; Andreas Geiger | 4006 | |
92 | 15:20 | Stereo R-CNN Based 3D Object Detection for Autonomous Driving | Peiliang Li; Xiaozhi Chen; Shaojie Shen | 4090 | |
93 | 15:20 | Hybrid Scene Compression for Visual Localization | Federico Camposeco; Andrea Cohen; Marc Pollefeys; Torsten Sattler | 4167 | |
3D Single View & RGBD | 94 | 15:20 | MMFace: A Multi-Metric Regression Network for Unconstrained Face Reconstruction | Hongwei Yi; Chen Li; Qiong Cao; Xiaoyong Shen; Sheng Li; Guoping Wang; Yu-Wing Tai | 3339 |
95 | 15:20 | 3D Motion Decomposition for RGBD Future Dynamic Scene Synthesis | Xiaojuan Qi; Zhengzhe Liu; Qifeng Chen; Jiaya Jia | 3727 | |
96 | 15:20 | Single Image Depth Estimation Trained via Depth From Defocus Cues | Shir Gur; Lior Wolf | 3020 | |
97 | 15:20 | RGBD Based Dimensional Decomposition Residual Network for 3D Semantic Scene Completion | Jie Li; Yu Liu; Dong Gong; Qinfeng Shi; Xia Yuan; Chunxia Zhao; Ian Reid | 3858 | |
98 | 15:20 | Neural Scene Decomposition for Multi-Person Motion Capture | Helge Rhodin; Victor Constantin; Isinsu Katircioglu; Mathieu Salzmann; Pascal Fua | 4159 | |
Face & Body | 99 | 15:20 | Efficient Decision-Based Black-Box Adversarial Attacks on Face Recognition | Yinpeng Dong; Hang Su; Baoyuan Wu; Zhifeng Li; Wei Liu; Tong Zhang; Jun Zhu | 2676 |
100 | 15:20 | FA-RPN: Floating Region Proposals for Face Detection | Mahyar Najibi; Bharat Singh; Larry S. Davis | 3406 | |
101 | 15:20 | Bayesian Hierarchical Dynamic Model for Human Action Recognition | Rui Zhao; Wanru Xu; Hui Su; Qiang Ji | 3416 | |
102 | 15:20 | Mixed Effects Neural Networks (MeNets) With Applications to Gaze Estimation | Yunyang Xiong; Hyunwoo J. Kim; Vikas Singh | 3456 | |
103 | 15:20 | 3D Human Pose Estimation in Video With Temporal Convolutions and Semi-Supervised Training | Dario Pavllo; Christoph Feichtenhofer; David Grangier; Michael Auli | 3495 | |
104 | 15:20 | Learning to Regress 3D Face Shape and Expression From an Image Without 3D Supervision | Soubhik Sanyal; Timo Bolkart; Haiwen Feng; Michael J. Black | 4586 | |
105 | 15:20 | PoseFix: Model-Agnostic General Human Pose Refinement Network | Gyeongsik Moon; Ju Yong Chang; Kyoung Mu Lee | 3668 | |
106 | 15:20 | RepNet: Weakly Supervised Training of an Adversarial Reprojection Network for 3D Human Pose Estimation | Bastian Wandt; Bodo Rosenhahn | 3692 | |
107 | 15:20 | Fast and Robust Multi-Person 3D Pose Estimation From Multiple Views | Junting Dong; Wen Jiang; Qixing Huang; Hujun Bao; Xiaowei Zhou | 3873 | |
108 | 15:20 | Face-Focused Cross-Stream Network for Deception Detection in Videos | Mingyu Ding; An Zhao; Zhiwu Lu; Tao Xiang; Ji-Rong Wen | 3875 | |
109 | 15:20 | Unequal-Training for Deep Face Recognition With Long-Tailed Noisy Data | Yaoyao Zhong; Weihong Deng; Mei Wang; Jiani Hu; Jianteng Peng; Xunqiang Tao; Yaohai Huang | 3883 | |
110 | 15:20 | T-Net: Parametrizing Fully Convolutional Nets With a Single High-Order Tensor | Jean Kossaifi; Adrian Bulat; Georgios Tzimiropoulos; Maja Pantic | 4029 | |
111 | 15:20 | Hierarchical Cross-Modal Talking Face Generation With Dynamic Pixel-Wise Loss | Lele Chen; Ross K. Maddox; Zhiyao Duan; Chenliang Xu | 4104 | |
Action & Video | 112 | 15:20 | Object-Centric Auto-Encoders and Dummy Anomalies for Abnormal Event Detection in Video | Radu Tudor Ionescu; Fahad Shahbaz Khan; Mariana-Iuliana Georgescu; Ling Shao | 3379 |
113 | 15:20 | DDLSTM: Dual-Domain LSTM for Cross-Dataset Action Recognition | Toby Perrett; Dima Damen | 3391 | |
114 | 15:20 | The Pros and Cons: Rank-Aware Temporal Attention for Skill Determination in Long Videos | Hazel Doughty; Walterio Mayol-Cuevas; Dima Damen | 3398 | |
115 | 15:20 | Collaborative Spatiotemporal Feature Learning for Video Action Recognition | Chao Li; Qiaoyong Zhong; Di Xie; Shiliang Pu | 3544 | |
116 | 15:20 | MARS: Motion-Augmented RGB Stream for Action Recognition | Nieves Crasto; Philippe Weinzaepfel; Karteek Alahari; Cordelia Schmid | 3605 | |
117 | 15:20 | Convolutional Relational Machine for Group Activity Recognition | Sina Mokhtarzadeh Azar; Mina Ghadimi Atigh; Ahmad Nickabadi; Alexandre Alahi | 3798 | |
118 | 15:20 | Video Summarization by Learning From Unpaired Data | Mrigank Rochan; Yang Wang | 3885 | |
119 | 15:20 | Skeleton-Based Action Recognition With Directed Graph Neural Networks | Lei Shi; Yifan Zhang; Jian Cheng; Hanqing Lu | 4047 | |
120 | 15:20 | PA3D: Pose-Action 3D Machine for Video Recognition | An Yan; Yali Wang; Zhifeng Li; Yu Qiao | 4070 | |
121 | 15:20 | Deep Dual Relation Modeling for Egocentric Interaction Recognition | Haoxin Li; Yijun Cai; Wei-Shi Zheng | 4117 | |
Motion & Biometrics | 122 | 15:20 | MOTS: Multi-Object Tracking and Segmentation | Paul Voigtlaender; Michael Krause; Aljosa Osep; Jonathon Luiten; Berin Balachandar Gnana Sekar; Andreas Geiger; Bastian Leibe | 3336 |
123 | 15:20 | Siamese Cascaded Region Proposal Networks for Real-Time Visual Tracking | Heng Fan; Haibin Ling | 3821 | |
124 | 15:20 | PointFlowNet: Learning Representations for Rigid Motion Estimation From Point Clouds | Aseem Behl; Despoina Paschalidou; Simon Donné; Andreas Geiger | 4050 | |
Synthesis | 125 | 15:20 | Listen to the Image | Di Hu; Dong Wang; Xuelong Li; Feiping Nie; Qi Wang | 3386 |
126 | 15:20 | Image Super-Resolution by Neural Texture Transfer | Zhifei Zhang; Zhaowen Wang; Zhe Lin; Hairong Qi | 3437 | |
127 | 15:20 | Conditional Adversarial Generative Flow for Controllable Image Synthesis | Rui Liu; Yu Liu; Xinyu Gong; Xiaogang Wang; Hongsheng Li | 3619 | |
128 | 15:20 | How to Make a Pizza: Learning a Compositional Layer-Based GAN Model | Dim P. Papadopoulos; Youssef Tamaazousti; Ferda Ofli; Ingmar Weber; Antonio Torralba | 3811 | |
129 | 15:20 | TransGaGa: Geometry-Aware Unsupervised Image-To-Image Translation | Wayne Wu; Kaidi Cao; Cheng Li; Chen Qian; Chen Change Loy | 3983 | |
Computational Photography & Graphics | 130 | 15:20 | Photon-Flooded Single-Photon 3D Cameras | Anant Gupta; Atul Ingle; Andreas Velten; Mohit Gupta | 1624 |
131 | 15:20 | High Flux Passive Imaging With Single-Photon Sensors | Atul Ingle; Andreas Velten; Mohit Gupta | 1064 | |
132 | 15:20 | Acoustic Non-Line-Of-Sight Imaging | David B. Lindell; Gordon Wetzstein; Vladlen Koltun | 2059 | |
133 | 15:20 | Steady-State Non-Line-Of-Sight Imaging | Wenzheng Chen; Simon Daneau; Fahim Mannan; Felix Heide | 3310 | |
134 | 15:20 | A Theory of Fermat Paths for Non-Line-Of-Sight Shape Reconstruction | Shumian Xin; Sotiris Nousias; Kiriakos N. Kutulakos; Aswin C. Sankaranarayanan; Srinivasa G. Narasimhan; Ioannis Gkioulekas | 2427 | |
135 | 15:20 | End-To-End Projector Photometric Compensation | Bingyao Huang; Haibin Ling | 474 | |
136 | 15:20 | Bringing a Blurry Frame Alive at High Frame-Rate With an Event Camera | Liyuan Pan; Cedric Scheerlinck; Xin Yu; Richard Hartley; Miaomiao Liu; Yuchao Dai | 2605 | |
137 | 15:20 | Bringing Alive Blurred Moments | Kuldeep Purohit; Anshul Shah; A. N. Rajagopalan | 5932 | |
138 | 15:20 | Learning to Synthesize Motion Blur | Tim Brooks; Jonathan T. Barron | 1607 | |
139 | 15:20 | Underexposed Photo Enhancement Using Deep Illumination Estimation | Ruixing Wang; Qing Zhang; Chi-Wing Fu; Xiaoyong Shen; Wei-Shi Zheng; Jiaya Jia | 2861 | |
140 | 15:20 | Blind Visual Motif Removal From a Single Image | Amir Hertz; Sharon Fogel; Rana Hanocka; Raja Giryes; Daniel Cohen-Or | 2843 | |
141 | 15:20 | Non-Local Meets Global: An Integrated Paradigm for Hyperspectral Denoising | Wei He; Quanming Yao; Chao Li; Naoto Yokoya; Qibin Zhao | 6541 | |
142 | 15:20 | Neural Rerendering in the Wild | Moustafa Meshry; Dan B. Goldman; Sameh Khamis; Hugues Hoppe; Rohit Pandey; Noah Snavely; Ricardo Martin-Brualla | 4943 | |
143 | 15:20 | GeoNet: Deep Geodesic Networks for Point Cloud Analysis | Tong He; Haibin Huang; Li Yi; Yuqian Zhou; Chihao Wu; Jue Wang; Stefano Soatto | 430 | |
144 | 15:20 | MeshAdv: Adversarial Meshes for Visual Recognition | Chaowei Xiao; Dawei Yang; Bo Li; Jia Deng; Mingyan Liu | 2440 | |
145 | 15:20 | Fast Spatially-Varying Indoor Lighting Estimation | Mathieu Garon; Kalyan Sunkavalli; Sunil Hadap; Nathan Carr; Jean-François Lalonde | 4701 | |
146 | 15:20 | Neural Illumination: Lighting Prediction for Indoor Environments | Shuran Song; Thomas Funkhouser | 1188 | |
147 | 15:20 | Deep Sky Modeling for Single Image Outdoor Lighting Estimation | Yannick Hold-Geoffroy; Akshaya Athawale; Jean-François Lalonde | 4363 | |
148 | 15:20 | Depth-Attentional Features for Single-Image Rain Removal | Xiaowei Hu; Chi-Wing Fu; Lei Zhu; Pheng-Ann Heng | 3399 | |
149 | 15:20 | Hyperspectral Image Reconstruction Using a Deep Spatial-Spectral Prior | Lizhi Wang; Chen Sun; Ying Fu; Min H. Kim; Hua Huang | 3925 | |
150 | 15:20 | LiFF: Light Field Features in Scale and Depth | Donald G. Dansereau; Bernd Girod; Gordon Wetzstein | 3926 | |
151 | 15:20 | Deep Exemplar-Based Video Colorization | Bo Zhang; Mingming He; Jing Liao; Pedro V. Sander; Lu Yuan; Amine Bermak; Dong Chen | 4128 | |
152 | 15:20 | On Finding Gray Pixels | Yanlin Qian; Joni-Kristian Kämäräinen; Jarno Nikkanen; Jiří Matas | 4157 | |
Low-Level & Optimization | 153 | 15:20 | UnOS: Unified Unsupervised Optical-Flow and Stereo-Depth Estimation by Watching Videos | Yang Wang; Peng Wang; Zhenheng Yang; Chenxu Luo; Yi Yang; Wei Xu | 3486 |
154 | 15:20 | Learning Transformation Synchronization | Xiangru Huang; Zhenxiao Liang; Xiaowei Zhou; Yao Xie; Leonidas J. Guibas; Qixing Huang | 3497 | |
155 | 15:20 | D2-Net: A Trainable CNN for Joint Description and Detection of Local Features | Mihai Dusmanu; Ignacio Rocco; Tomas Pajdla; Marc Pollefeys; Josef Sivic; Akihiko Torii; Torsten Sattler | 3623 | |
156 | 15:20 | Recurrent Neural Networks With Intra-Frame Iterations for Video Deblurring | Seungjun Nah; Sanghyun Son; Kyoung Mu Lee | 3670 | |
157 | 15:20 | Learning to Extract Flawless Slow Motion From Blurry Videos | Meiguang Jin; Zhe Hu; Paolo Favaro | 3721 | |
158 | 15:20 | Natural and Realistic Single Image Super-Resolution With Explicit Natural Manifold Discrimination | Jae Woong Soh; Gu Yong Park; Junho Jo; Nam Ik Cho | 3768 | |
159 | 15:20 | RF-Net: An End-To-End Image Matching Network Based on Receptive Field | Xuelun Shen; Cheng Wang; Xin Li; Zenglei Yu; Jonathan Li; Chenglu Wen; Ming Cheng; Zijian He | 3854 | |
160 | 15:20 | Fast Single Image Reflection Suppression via Convex Optimization | Yang Yang; Wenye Ma; Yin Zheng; Jian-Feng Cai; Weiyu Xu | 3913 | |
161 | 15:20 | A Mutual Learning Method for Salient Object Detection With Intertwined Multi-Supervision | Runmin Wu; Mengyang Feng; Wenlong Guan; Dong Wang; Huchuan Lu; Errui Ding | 3929 | |
162 | 15:20 | Enhanced Pix2pix Dehazing Network | Yanyun Qu; Yizi Chen; Jingying Huang; Yuan Xie | 3948 | |
163 | 15:20 | Assessing Personally Perceived Image Quality via Image Features and Collaborative Filtering | Jari Korhonen | 4014 | |
164 | 15:20 | Single Image Reflection Removal Exploiting Misaligned Training Data and Network Enhancements | Kaixuan Wei; Jiaolong Yang; Ying Fu; David Wipf; Hua Huang | 4122 | |
Scenes & Representation | 165 | 15:20 | Exploring Context and Visual Pattern of Relationship for Scene Graph Generation | Wenbin Wang; Ruiping Wang; Shiguang Shan; Xilin Chen | 3375 |
166 | 15:20 | Learning From Synthetic Data for Crowd Counting in the Wild | Qi Wang; Junyu Gao; Wei Lin; Yuan Yuan | 3580 | |
167 | 15:20 | A Local Block Coordinate Descent Algorithm for the CSC Model | Ev Zisselman; Jeremias Sulam; Michael Elad | 3646 | |
168 | 15:20 | Not Using the Car to See the Sidewalk — Quantifying and Controlling the Effects of Context in Classification and Segmentation | Rakshith Shetty; Bernt Schiele; Mario Fritz | 3689 | |
169 | 15:20 | Discovering Fair Representations in the Data Domain | Novi Quadrianto; Viktoriia Sharmanska; Oliver Thomas | 3708 | |
170 | 15:20 | Actor-Critic Instance Segmentation | Nikita Araslanov; Constantin A. Rothkopf; Stefan Roth | 3709 | |
171 | 15:20 | Generalized Zero- and Few-Shot Learning via Aligned Variational Autoencoders | Edgar Schönfeld; Sayna Ebrahimi; Samarth Sinha; Trevor Darrell; Zeynep Akata | 3746 | |
172 | 15:20 | Semantic Projection Network for Zero- and Few-Label Semantic Segmentation | Yongqin Xian; Subhabrata Choudhury; Yang He; Bernt Schiele; Zeynep Akata | 3747 | |
173 | 15:20 | GCAN: Graph Convolutional Adversarial Network for Unsupervised Domain Adaptation | Xinhong Ma; Tianzhu Zhang; Changsheng Xu | 3927 | |
174 | 15:20 | Seamless Scene Segmentation | Lorenzo Porzi; Samuel Rota Bulò; Aleksander Colovic; Peter Kontschieder | 4003 | |
175 | 15:20 | Unsupervised Image Matching and Object Discovery as Optimization | Huy V. Vo; Francis Bach; Minsu Cho; Kai Han; Yann LeCun; Patrick Pérez; Jean Ponce | 4089 | |
176 | 15:20 | Wide-Area Crowd Counting via Ground-Plane Density Maps and Multi-View Fusion CNNs | Qi Zhang; Antoni B. Chan | 4178 | |
Language & Reasoning | 177 | 15:20 | Grounded Video Description | Luowei Zhou; Yannis Kalantidis; Xinlei Chen; Jason J. Corso; Marcus Rohrbach | 12 |
178 | 15:20 | Streamlined Dense Video Captioning | Jonghwan Mun; Linjie Yang; Zhou Ren; Ning Xu; Bohyung Han | 3566 | |
179 | 15:20 | Adversarial Inference for Multi-Sentence Video Description | Jae Sung Park; Marcus Rohrbach; Trevor Darrell; Anna Rohrbach | 5612 | |
180 | 15:20 | Unified Visual-Semantic Embeddings: Bridging Vision and Language With Structured Meaning Representations | Hao Wu; Jiayuan Mao; Yufeng Zhang; Yuning Jiang; Lei Li; Weiwei Sun; Wei-Ying Ma | 4705 | |
181 | 15:20 | Learning to Compose Dynamic Tree Structures for Visual Contexts | Kaihua Tang; Hanwang Zhang; Baoyuan Wu; Wenhan Luo; Wei Liu | 3640 | |
182 | 15:20 | Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation | Xin Wang; Qiuyuan Huang; Asli Celikyilmaz; Jianfeng Gao; Dinghan Shen; Yuan-Fang Wang; William Yang Wang; Lei Zhang | 5104 | |
183 | 15:20 | Dynamic Fusion With Intra- and Inter-Modality Attention Flow for Visual Question Answering | Peng Gao; Zhengkai Jiang; Haoxuan You; Pan Lu; Steven C. H. Hoi; Xiaogang Wang; Hongsheng Li | 1824 | |
184 | 15:20 | Cycle-Consistency for Robust Visual Question Answering | Meet Shah; Xinlei Chen; Marcus Rohrbach; Devi Parikh | 3454 | |
185 | 15:20 | Embodied Question Answering in Photorealistic Environments With Point Cloud Perception | Erik Wijmans; Samyak Datta; Oleksandr Maksymets; Abhishek Das; Georgia Gkioxari; Stefan Lee; Irfan Essa; Devi Parikh; Dhruv Batra | 135 | |
186 | 15:20 | Reasoning Visual Dialogs With Structural and Partial Observations | Zilong Zheng; Wenguan Wang; Siyuan Qi; Song-Chun Zhu | 3909 | |
187 | 15:20 | Recursive Visual Attention in Visual Dialog | Yulei Niu; Hanwang Zhang; Manli Zhang; Jianhong Zhang; Zhiwu Lu; Ji-Rong Wen | 3129 | |
188 | 15:20 | Two Body Problem: Collaborative Visual Task Completion | Unnat Jain; Luca Weihs; Eric Kolve; Mohammad Rastegari; Svetlana Lazebnik; Ali Farhadi; Alexander G. Schwing; Aniruddha Kembhavi | 3820 | |
189 | 15:20 | GQA: A New Dataset for Real-World Visual Reasoning and Compositional Question Answering | Drew A. Hudson; Christopher D. Manning | 7021 | |
190 | 15:20 | Text2Scene: Generating Compositional Scenes From Textual Descriptions | Fuwen Tan; Song Feng; Vicente Ordonez | 1530 | |
191 | 15:20 | From Recognition to Cognition: Visual Commonsense Reasoning | Rowan Zellers; Yonatan Bisk; Ali Farhadi; Yejin Choi | 5126 | |
192 | 15:20 | The Regretful Agent: Heuristic-Aided Navigation Through Progress Estimation | Chih-Yao Ma; Zuxuan Wu; Ghassan AlRegib; Caiming Xiong; Zsolt Kira | 3587 | |
193 | 15:20 | Tactical Rewind: Self-Correction via Backtracking in Vision-And-Language Navigation | Liyiming Ke; Xiujun Li; Yonatan Bisk; Ari Holtzman; Zhe Gan; Jingjing Liu; Jianfeng Gao; Yejin Choi; Siddhartha Srinivasa | 6287 | |
194 | 15:20 | Learning to Learn How to Learn: Self-Adaptive Visual Navigation Using Meta-Learning | Mitchell Wortsman; Kiana Ehsani; Mohammad Rastegari; Ali Farhadi; Roozbeh Mottaghi | 1770 | |
195 | 15:20 | Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions | Marcella Cornia; Lorenzo Baraldi; Rita Cucchiara | 2710 | |
196 | 15:20 | Towards VQA Models That Can Read | Amanpreet Singh; Vivek Natarajan; Meet Shah; Yu Jiang; Xinlei Chen; Dhruv Batra; Devi Parikh; Marcus Rohrbach | 3450 | |
197 | 15:20 | Object-Aware Aggregation With Bidirectional Temporal Graph for Video Captioning | Junchao Zhang; Yuxin Peng | 3595 | |
198 | 15:20 | Progressive Attention Memory Network for Movie Story Question Answering | Junyeong Kim; Minuk Ma; Kyungsu Kim; Sungjin Kim; Chang D. Yoo | 3652 | |
199 | 15:20 | Memory-Attended Recurrent Network for Video Captioning | Wenjie Pei; Jiyuan Zhang; Xiangrong Wang; Lei Ke; Xiaoyong Shen; Yu-Wing Tai | 3659 | |
200 | 15:20 | Visual Query Answering by Entity-Attribute Graph Matching and Reasoning | Peixi Xiong; Huayi Zhan; Xin Wang; Baivab Sinha; Ying Wu | 3846 | |
201 | 15:20 | Look Back and Predict Forward in Image Captioning | Yu Qin; Jiajun Du; Yonghua Zhang; Hongtao Lu | 3848 | |
202 | 15:20 | Explainable and Explicit Visual Reasoning Over Scene Graphs | Jiaxin Shi; Hanwang Zhang; Juanzi Li | 3908 | |
203 | 15:20 | Transfer Learning via Unsupervised Task Discovery for Visual Question Answering | Hyeonwoo Noh; Taehoon Kim; Jonghwan Mun; Bohyung Han | 4024 | |
204 | 15:20 | Intention Oriented Image Captions With Guiding Objects | Yue Zheng; Yali Li; Shengjin Wang | 4115 | |
Applications, Medical, & Robotics | 205 | 15:20 | Uncertainty Guided Multi-Scale Residual Learning-Using a Cycle Spinning CNN for Single Image De-Raining | Rajeev Yasarla; Vishal M. Patel | 3404 |
206 | 15:20 | Toward Realistic Image Compositing With Adversarial Learning | Bor-Chun Chen; Andrew Kae | 3467 | |
207 | 15:20 | Cross-Classification Clustering: An Efficient Multi-Object Tracking Technique for 3-D Instance Segmentation in Connectomics | Yaron Meirovitch; Lu Mi; Hayk Saribekyan; Alexander Matveev; David Rolnick; Nir Shavit | 3468 | |
208 | 15:20 | Deep ChArUco: Dark ChArUco Marker Pose Estimation | Danying Hu; Daniel DeTone; Tomasz Malisiewicz | 3473 | |
209 | 15:20 | Pseudo-LiDAR From Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving | Yan Wang; Wei-Lun Chao; Divyansh Garg; Bharath Hariharan; Mark Campbell; Kilian Q. Weinberger | 3482 | |
210 | 15:20 | Rules of the Road: Predicting Driving Behavior With a Convolutional Model of Semantic Interactions | Joey Hong; Benjamin Sapp; James Philbin | 3533 | |
211 | 15:20 | Metric Learning for Image Registration | Marc Niethammer; Roland Kwitt; François-Xavier Vialard | 3614 | |
212 | 15:20 | LO-Net: Deep Real-Time Lidar Odometry | Qing Li; Shaoyang Chen; Cheng Wang; Xin Li; Chenglu Wen; Ming Cheng; Jonathan Li | 3654 | |
213 | 15:20 | TraPHic: Trajectory Prediction in Dense and Heterogeneous Traffic Using Weighted Interactions | Rohan Chandra; Uttaran Bhattacharya; Aniket Bera; Dinesh Manocha | 3841 | |
214 | 15:20 | World From Blur | Jiayan Qiu; Xinchao Wang; Stephen J. Maybank; Dacheng Tao | 3912 | |
215 | 15:20 | Topology Reconstruction of Tree-Like Structure in Images via Structural Similarity Measure and Dominant Set Clustering | Jianyang Xie; Yitian Zhao; Yonghuai Liu; Pan Su; Yifan Zhao; Jun Cheng; Yalin Zheng; Jiang Liu | 3941 | |
216 | 15:20 | Pyramidal Person Re-IDentification via Multi-Loss Dynamic Training | Feng Zheng; Cheng Deng; Xing Sun; Xinyang Jiang; Xiaowei Guo; Zongqiao Yu; Feiyue Huang; Rongrong Ji | 4043 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Applications | 190 | 08:30 | Holistic and Comprehensive Annotation of Clinically Significant Findings on Diverse CT Images: Learning From Radiology Reports and Label Ontology | Ke Yan; Yifan Peng; Veit Sandfort; Mohammadhadi Bagheri; Zhiyong Lu; Ronald M. Summers | 2180 |
191 | 08:35 | Robust Histopathology Image Analysis: To Label or to Synthesize? | Le Hou; Ayush Agarwal; Dimitris Samaras; Tahsin M. Kurc; Rajarsi R. Gupta; Joel H. Saltz | 4246 | |
192 | 08:40 | Data Augmentation Using Learned Transformations for One-Shot Medical Image Segmentation | Amy Zhao; Guha Balakrishnan; Frédo Durand; John V. Guttag; Adrian V. Dalca | 6477 | |
193 | 08:48 | Shifting More Attention to Video Salient Object Detection | Deng-Ping Fan; Wenguan Wang; Ming-Ming Cheng; Jianbing Shen | 1853 | |
194 | 08:53 | Neural Task Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration | De-An Huang; Suraj Nair; Danfei Xu; Yuke Zhu; Animesh Garg; Li Fei-Fei; Silvio Savarese; Juan Carlos Niebles | 864 | |
195 | 08:58 | Beyond Tracking: Selecting Memory and Refining Poses for Deep Visual Odometry | Fei Xue; Xin Wang; Shunkai Li; Qiuyuan Wang; Junqiu Wang; Hongbin Zha | 1296 | |
196 | 09:06 | Image Generation From Layout | Bo Zhao; Lili Meng; Weidong Yin; Leonid Sigal | 3139 | |
197 | 09:11 | Multimodal Explanations by Predicting Counterfactuality in Videos | Atsushi Kanehira; Kentaro Takemoto; Sho Inayoshi; Tatsuya Harada | 4603 | |
198 | 09:16 | Learning to Explain With Complemental Examples | Atsushi Kanehira; Tatsuya Harada | 4606 | |
199 | 09:24 | HAQ: Hardware-Aware Automated Quantization With Mixed Precision | Kuan Wang; Zhijian Liu; Yujun Lin; Ji Lin; Song Han | 3441 | |
200 | 09:29 | Content Authentication for Neural Imaging Pipelines: End-To-End Optimization of Photo Provenance in Complex Distribution Channels | Pawel Korus; Nasir Memon | 4965 | |
201 | 09:34 | Inverse Procedural Modeling of Knitwear | Elena Trunz; Sebastian Merzbach; Jonathan Klein; Thomas Schulze; Michael Weinmann; Reinhard Klein | 5712 | |
202 | 09:42 | Estimating 3D Motion and Forces of Person-Object Interactions From Monocular Video | Zongmian Li; Jiri Sedlar; Justin Carpentier; Ivan Laptev; Nicolas Mansard; Josef Sivic | 2857 | |
203 | 09:47 | DeepMapping: Unsupervised Map Estimation From Multiple Point Clouds | Li Ding; Chen Feng | 4235 | |
204 | 09:52 | End-To-End Interpretable Neural Motion Planner | Wenyuan Zeng; Wenjie Luo; Simon Suo; Abbas Sadat; Bin Yang; Sergio Casas; Raquel Urtasun | 4880 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Learning, Physics, Theory, & Datasets | 77 | 08:30 | Divergence Triangle for Joint Training of Generator Model, Energy-Based Model, and Inferential Model | Tian Han; Erik Nijkamp; Xiaolin Fang; Mitch Hill; Song-Chun Zhu; Ying Nian Wu | 2609 |
78 | 08:35 | Image Deformation Meta-Networks for One-Shot Learning | Zitian Chen; Yanwei Fu; Yu-Xiong Wang; Lin Ma; Wei Liu; Martial Hebert | 2829 | |
79 | 08:40 | Online High Rank Matrix Completion | Jicong Fan; Madeleine Udell | 4917 | |
80 | 08:48 | Multispectral Imaging for Fine-Grained Recognition of Powders on Complex Backgrounds | Tiancheng Zhi; Bernardo R. Pires; Martial Hebert; Srinivasa G. Narasimhan | 1274 | |
81 | 08:53 | ContactDB: Analyzing and Predicting Grasp Contact via Thermal Imaging | Samarth Brahmbhatt; Cusuh Ham; Charles C. Kemp; James Hays | 2138 | |
82 | 08:58 | Robust Subspace Clustering With Independent and Piecewise Identically Distributed Noise Modeling | Yuanman Li; Jiantao Zhou; Xianwei Zheng; Jinyu Tian; Yuan Yan Tang | 4535 | |
83 | 09:06 | What Correspondences Reveal About Unknown Camera and Motion Models? | Thomas Probst; Ajad Chhatkuli; Danda Pani Paudel; Luc Van Gool | 4185 | |
84 | 09:11 | Self-Calibrating Deep Photometric Stereo Networks | Guanying Chen; Kai Han; Boxin Shi; Yasuyuki Matsushita; Kwan-Yee K. Wong | 1504 | |
85 | 09:16 | Argoverse: 3D Tracking and Forecasting With Rich Maps | Ming-Fang Chang; John Lambert; Patsorn Sangkloy; Jagjeet Singh; Slawomir Bak; Andrew Hartnett; De Wang; Peter Carr; Simon Lucey; Deva Ramanan; James Hays | 4994 | |
86 | 09:24 | Side Window Filtering | Hui Yin; Yuanhao Gong; Guoping Qiu | 5176 | |
87 | 09:29 | Defense Against Adversarial Images Using Web-Scale Nearest-Neighbor Search | Abhimanyu Dubey; Laurens van der Maaten; Zeki Yalniz; Yixuan Li; Dhruv Mahajan | 2319 | |
88 | 09:34 | Incremental Object Learning From Contiguous Views | Stefan Stojanov; Samarth Mishra; Ngoc Anh Thai; Nikhil Dhanda; Ahmad Humayun; Chen Yu; Linda B. Smith; James M. Rehg | 1519 | |
89 | 09:42 | IP102: A Large-Scale Benchmark Dataset for Insect Pest Recognition | Xiaoping Wu; Chi Zhan; Yu-Kun Lai; Ming-Ming Cheng; Jufeng Yang | 3627 | |
90 | 09:47 | CityFlow: A City-Scale Benchmark for Multi-Target Multi-Camera Vehicle Tracking and Re-Identification | Zheng Tang; Milind Naphade; Ming-Yu Liu; Xiaodong Yang; Stan Birchfield; Shuo Wang; Ratnesh Kumar; David Anastasiu; Jenq-Neng Hwang | 6334 | |
91 | 09:52 | Social-IQ: A Question Answering Benchmark for Artificial Social Intelligence | Amir Zadeh; Michael Chan; Paul Pu Liang; Edmund Tong; Louis-Philippe Morency | 6439 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Segmentation & Grouping | 55 | 08:30 | UPSNet: A Unified Panoptic Segmentation Network | Yuwen Xiong; Renjie Liao; Hengshuang Zhao; Rui Hu; Min Bai; Ersin Yumer; Raquel Urtasun | 3471 |
56 | 08:35 | JSIS3D: Joint Semantic-Instance Segmentation of 3D Point Clouds With Multi-Task Pointwise Networks and Multi-Value Conditional Random Fields | Quang-Hieu Pham; Thanh Nguyen; Binh-Son Hua; Gemma Roig; Sai-Kit Yeung | 5828 | |
57 | 08:40 | Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth | Davy Neven; Bert De Brabandere; Marc Proesmans; Luc Van Gool | 1303 | |
58 | 08:48 | DeepCO3: Deep Instance Co-Segmentation by Co-Peak Search and Co-Saliency Detection | Kuang-Jui Hsu; Yen-Yu Lin; Yung-Yu Chuang | 153 | |
59 | 08:53 | Improving Semantic Segmentation via Video Propagation and Label Relaxation | Yi Zhu; Karan Sapra; Fitsum A. Reda; Kevin J. Shih; Shawn Newsam; Andrew Tao; Bryan Catanzaro | 4250 | |
60 | 08:58 | Accel: A Corrective Fusion Network for Efficient Semantic Segmentation on Video | Samvit Jain; Xin Wang; Joseph E. Gonzalez | 3121 | |
61 | 09:06 | Shape2Motion: Joint Analysis of Motion Parts and Attributes From 3D Shapes | Xiaogang Wang; Bin Zhou; Yahao Shi; Xiaowu Chen; Qinping Zhao; Kai Xu | 4311 | |
62 | 09:11 | Semantic Correlation Promoted Shape-Variant Context for Segmentation | Henghui Ding; Xudong Jiang; Bing Shuai; Ai Qun Liu; Gang Wang | 256 | |
63 | 09:16 | Relation-Shape Convolutional Neural Network for Point Cloud Analysis | Yongcheng Liu; Bin Fan; Shiming Xiang; Chunhong Pan | 2493 | |
64 | 09:24 | Enhancing Diversity of Defocus Blur Detectors via Cross-Ensemble Network | Wenda Zhao; Bowen Zheng; Qiuhua Lin; Huchuan Lu | 1233 | |
65 | 09:29 | BubbleNets: Learning to Select the Guidance Frame in Video Object Segmentation by Deep Sorting Frames | Brent A. Griffin; Jason J. Corso | 2793 | |
66 | 09:34 | Collaborative Global-Local Networks for Memory-Efficient Segmentation of Ultra-High Resolution Images | Wuyang Chen; Ziyu Jiang; Zhangyang Wang; Kexin Cui; Xiaoning Qian | 3074 | |
67 | 09:42 | Efficient Parameter-Free Clustering Using First Neighbor Relations | Saquib Sarfraz; Vivek Sharma; Rainer Stiefelhagen | 3203 | |
68 | 09:47 | Learning Personalized Modular Network Guided by Structured Knowledge | Xiaodan Liang | 3859 | |
69 | 09:52 | A Generative Appearance Model for End-To-End Video Object Segmentation | Joakim Johnander; Martin Danelljan; Emil Brissman; Fahad Shahbaz Khan; Michael Felsberg | 5616 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Deep Learning | 1 | 10:00 | A Flexible Convolutional Solver for Fast Style Transfers | Gilles Puy; Patrick Pérez | 4200 |
2 | 10:00 | Cross Domain Model Compression by Structurally Weight Sharing | Shangqian Gao; Cheng Deng; Heng Huang | 4258 | |
3 | 10:00 | TraVeLGAN: Image-To-Image Translation by Transformation Vector Learning | Matthew Amodio; Smita Krishnaswamy | 4331 | |
4 | 10:00 | Deep Robust Subjective Visual Property Prediction in Crowdsourcing | Qianqian Xu; Zhiyong Yang; Yangbangyan Jiang; Xiaochun Cao; Qingming Huang; Yuan Yao | 4381 | |
5 | 10:00 | Transferable AutoML by Model Sharing Over Grouped Datasets | Chao Xue; Junchi Yan; Rong Yan; Stephen M. Chu; Yonggang Hu; Yonghua Lin | 4421 | |
6 | 10:00 | Learning Not to Learn: Training Deep Neural Networks With Biased Data | Byungju Kim; Hyunwoo Kim; Kyungsu Kim; Sungjin Kim; Junmo Kim | 4452 | |
7 | 10:00 | IRLAS: Inverse Reinforcement Learning for Architecture Search | Minghao Guo; Zhao Zhong; Wei Wu; Dahua Lin; Junjie Yan | 4472 | |
8 | 10:00 | Learning for Single-Shot Confidence Calibration in Deep Neural Networks Through Stochastic Inferences | Seonguk Seo; Paul Hongsuck Seo; Bohyung Han | 4526 | |
9 | 10:00 | Attention-Based Adaptive Selection of Operations for Image Restoration in the Presence of Unknown Combined Distortions | Masanori Suganuma; Xing Liu; Takayuki Okatani | 4562 | |
10 | 10:00 | Fully Learnable Group Convolution for Acceleration of Deep Neural Networks | Xijun Wang; Meina Kan; Shiguang Shan; Xilin Chen | 4666 | |
11 | 10:00 | EIGEN: Ecologically-Inspired GENetic Approach for Neural Network Structure Searching From Scratch | Jian Ren; Zhe Li; Jianchao Yang; Ning Xu; Tianbao Yang; David J. Foran | 4692 | |
12 | 10:00 | Deep Incremental Hashing Network for Efficient Image Retrieval | Dayan Wu; Qi Dai; Jing Liu; Bo Li; Weiping Wang | 4727 | |
13 | 10:00 | Robustness via Curvature Regularization, and Vice Versa | Seyed-Mohsen Moosavi-Dezfooli; Alhussein Fawzi; Jonathan Uesato; Pascal Frossard | 4753 | |
14 | 10:00 | SparseFool: A Few Pixels Make a Big Difference | Apostolos Modas; Seyed-Mohsen Moosavi-Dezfooli; Pascal Frossard | 4754 | |
15 | 10:00 | Interpretable and Fine-Grained Visual Explanations for Convolutional Neural Networks | Jörg Wagner; Jan Mathias Köhler; Tobias Gindele; Leon Hetzel; Jakob Thaddäus Wiedemer; Sven Behnke | 4904 | |
16 | 10:00 | Structured Pruning of Neural Networks With Budget-Aware Regularization | Carl Lemaire; Andrew Achkar; Pierre-Marc Jodoin | 4951 | |
17 | 10:00 | MBS: Macroblock Scaling for CNN Model Reduction | Yu-Hsun Lin; Chun-Nan Chou; Edward Y. Chang | 5033 | |
18 | 10:00 | Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells | Vladimir Nekrasov; Hao Chen; Chunhua Shen; Ian Reid | 5072 | |
19 | 10:00 | Generating 3D Adversarial Point Clouds | Chong Xiang; Charles R. Qi; Bo Li | 5078 | |
20 | 10:00 | Partial Order Pruning: For Best Speed/Accuracy Trade-Off in Neural Architecture Search | Xin Li; Yiming Zhou; Zheng Pan; Jiashi Feng | 5163 | |
21 | 10:00 | Memory in Memory: A Predictive Neural Network for Learning Higher-Order Non-Stationarity From Spatiotemporal Dynamics | Yunbo Wang; Jianjin Zhang; Hongyu Zhu; Mingsheng Long; Jianmin Wang; Philip S. Yu | 5165 | |
22 | 10:00 | Variational Information Distillation for Knowledge Transfer | Sungsoo Ahn; Shell Xu Hu; Andreas Damianou; Neil D. Lawrence; Zhenwen Dai | 5177 | |
23 | 10:00 | You Look Twice: GaterNet for Dynamic Filter Selection in CNNs | Zhourong Chen; Yang Li; Samy Bengio; Si Si | 5193 | |
24 | 10:00 | SpherePHD: Applying CNNs on a Spherical PolyHeDron Representation of 360° Images | Yeonkun Lee; Jaeseok Jeong; Jongseob Yun; Wonjune Cho; Kuk-Jin Yoon | 5194 | |
25 | 10:00 | ESPNetv2: A Light-Weight, Power Efficient, and General Purpose Convolutional Neural Network | Sachin Mehta; Mohammad Rastegari; Linda Shapiro; Hannaneh Hajishirzi | 5206 | |
26 | 10:00 | Assisted Excitation of Activations: A Learning Technique to Improve Object Detectors | Mohammad Mahdi Derakhshani; Saeed Masoudnia; Amir Hossein Shaker; Omid Mersa; Mohammad Amin Sadeghi; Mohammad Rastegari; Babak N. Araabi | 5208 | |
27 | 10:00 | Exploiting Edge Features for Graph Neural Networks | Liyu Gong; Qiang Cheng | 5245 | |
28 | 10:00 | Propagation Mechanism for Deep and Wide Neural Networks | Dejiang Xu; Mong Li Lee; Wynne Hsu | 5286 | |
29 | 10:00 | Catastrophic Child's Play: Easy to Perform, Hard to Defend Adversarial Attacks | Chih-Hui Ho; Brandon Leung; Erik Sandström; Yen Chang; Nuno Vasconcelos | 5348 | |
30 | 10:00 | Embedding Complementary Deep Networks for Image Classification | Qiuyu Chen; Wei Zhang; Jun Yu; Jianping Fan | 5353 | |
Recognition | 31 | 10:00 | Deep Multimodal Clustering for Unsupervised Audiovisual Learning | Di Hu; Feiping Nie; Xuelong Li | 3380 |
32 | 10:00 | Dense Classification and Implanting for Few-Shot Learning | Yann Lifchitz; Yannis Avrithis; Sylvaine Picard; Andrei Bursuc | 4215 | |
33 | 10:00 | Class-Balanced Loss Based on Effective Number of Samples | Yin Cui; Menglin Jia; Tsung-Yi Lin; Yang Song; Serge Belongie | 4223 | |
34 | 10:00 | Discovering Visual Patterns in Art Collections With Spatially-Consistent Feature Learning | Xi Shen; Alexei A. Efros; Mathieu Aubry | 4295 | |
35 | 10:00 | Min-Max Statistical Alignment for Transfer Learning | Samitha Herath; Mehrtash Harandi; Basura Fernando; Richard Nock | 4345 | |
36 | 10:00 | Spatial-Aware Graph Relation Network for Large-Scale Object Detection | Hang Xu; Chenhan Jiang; Xiaodan Liang; Zhenguo Li | 4388 | |
37 | 10:00 | Deformable ConvNets V2: More Deformable, Better Results | Xizhou Zhu; Han Hu; Stephen Lin; Jifeng Dai | 4390 | |
38 | 10:00 | Interaction-And-Aggregation Network for Person Re-Identification | Ruibing Hou; Bingpeng Ma; Hong Chang; Xinqian Gu; Shiguang Shan; Xilin Chen | 4396 | |
39 | 10:00 | Rare Event Detection Using Disentangled Representation Learning | Ryuhei Hamaguchi; Ken Sakurada; Ryosuke Nakamura | 4406 | |
40 | 10:00 | Shape Robust Text Detection With Progressive Scale Expansion Network | Wenhai Wang; Enze Xie; Xiang Li; Wenbo Hou; Tong Lu; Gang Yu; Shuai Shao | 4610 | |
41 | 10:00 | Dual Encoding for Zero-Example Video Retrieval | Jianfeng Dong; Xirong Li; Chaoxi Xu; Shouling Ji; Yuan He; Gang Yang; Xun Wang | 4657 | |
42 | 10:00 | MaxpoolNMS: Getting Rid of NMS Bottlenecks in Two-Stage Object Detectors | Lile Cai; Bin Zhao; Zhe Wang; Jie Lin; Chuan Sheng Foo; Mohamed Sabry Aly; Vijay Chandrasekhar | 4678 | |
43 | 10:00 | Character Region Awareness for Text Detection | Youngmin Baek; Bado Lee; Dongyoon Han; Sangdoo Yun; Hwalsuk Lee | 4706 | |
44 | 10:00 | Effective Aesthetics Prediction With Multi-Level Spatially Pooled Features | Vlad Hosu; Bastian Goldlücke; Dietmar Saupe | 4722 | |
45 | 10:00 | Attentive Region Embedding Network for Zero-Shot Learning | Guo-Sen Xie; Li Liu; Xiaobo Jin; Fan Zhu; Zheng Zhang; Jie Qin; Yazhou Yao; Ling Shao | 4750 | |
46 | 10:00 | Explicit Spatial Encoding for Deep Local Descriptors | Arun Mukundan; Giorgos Tolias; Ondřej Chum | 4755 | |
47 | 10:00 | Panoptic Segmentation | Alexander Kirillov; Kaiming He; Ross Girshick; Carsten Rother; Piotr Dollár | 4761 | |
48 | 10:00 | You Reap What You Sow: Using Videos to Generate High Precision Object Proposals for Weakly-Supervised Object Detection | Krishna Kumar Singh; Yong Jae Lee | 4780 | |
49 | 10:00 | Explore-Exploit Graph Traversal for Image Retrieval | Cheng Chang; Guangwei Yu; Chundi Liu; Maksims Volkovs | 4836 | |
50 | 10:00 | Dissimilarity Coefficient Based Weakly Supervised Object Detection | Aditya Arun; C.V. Jawahar; M. Pawan Kumar | 4837 | |
51 | 10:00 | Kernel Transformer Networks for Compact Spherical Convolution | Yu-Chuan Su; Kristen Grauman | 4887 | |
52 | 10:00 | Object Detection With Location-Aware Deformable Convolution and Backward Attention Filtering | Chen Zhang; Joohee Kim | 4977 | |
53 | 10:00 | Variational Prototyping-Encoder: One-Shot Learning With Prototypical Images | Junsik Kim; Tae-Hyun Oh; Seokju Lee; Fei Pan; In So Kweon | 5304 | |
54 | 10:00 | Unsupervised Domain Adaptation Using Feature-Whitening and Consensus Loss | Subhankar Roy; Aliaksandr Siarohin; Enver Sangineto; Samuel Rota Bulò; Nicu Sebe; Elisa Ricci | 5319 | |
Segmentation, Grouping, & Shape | 55 | 10:00 | UPSNet: A Unified Panoptic Segmentation Network | Yuwen Xiong; Renjie Liao; Hengshuang Zhao; Rui Hu; Min Bai; Ersin Yumer; Raquel Urtasun | 3471 |
56 | 10:00 | JSIS3D: Joint Semantic-Instance Segmentation of 3D Point Clouds With Multi-Task Pointwise Networks and Multi-Value Conditional Random Fields | Quang-Hieu Pham; Thanh Nguyen; Binh-Son Hua; Gemma Roig; Sai-Kit Yeung | 5828 | |
57 | 10:00 | Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth | Davy Neven; Bert De Brabandere; Marc Proesmans; Luc Van Gool | 1303 | |
58 | 10:00 | DeepCO3: Deep Instance Co-Segmentation by Co-Peak Search and Co-Saliency Detection | Kuang-Jui Hsu; Yen-Yu Lin; Yung-Yu Chuang | 153 | |
59 | 10:00 | Improving Semantic Segmentation via Video Propagation and Label Relaxation | Yi Zhu; Karan Sapra; Fitsum A. Reda; Kevin J. Shih; Shawn Newsam; Andrew Tao; Bryan Catanzaro | 4250 | |
60 | 10:00 | Accel: A Corrective Fusion Network for Efficient Semantic Segmentation on Video | Samvit Jain; Xin Wang; Joseph E. Gonzalez | 3121 | |
61 | 10:00 | Shape2Motion: Joint Analysis of Motion Parts and Attributes From 3D Shapes | Xiaogang Wang; Bin Zhou; Yahao Shi; Xiaowu Chen; Qinping Zhao; Kai Xu | 4311 | |
62 | 10:00 | Semantic Correlation Promoted Shape-Variant Context for Segmentation | Henghui Ding; Xudong Jiang; Bing Shuai; Ai Qun Liu; Gang Wang | 256 | |
63 | 10:00 | Relation-Shape Convolutional Neural Network for Point Cloud Analysis | Yongcheng Liu; Bin Fan; Shiming Xiang; Chunhong Pan | 2493 | |
64 | 10:00 | Enhancing Diversity of Defocus Blur Detectors via Cross-Ensemble Network | Wenda Zhao; Bowen Zheng; Qiuhua Lin; Huchuan Lu | 1233 | |
65 | 10:00 | BubbleNets: Learning to Select the Guidance Frame in Video Object Segmentation by Deep Sorting Frames | Brent A. Griffin; Jason J. Corso | 2793 | |
66 | 10:00 | Collaborative Global-Local Networks for Memory-Efficient Segmentation of Ultra-High Resolution Images | Wuyang Chen; Ziyu Jiang; Zhangyang Wang; Kexin Cui; Xiaoning Qian | 3074 | |
67 | 10:00 | Efficient Parameter-Free Clustering Using First Neighbor Relations | Saquib Sarfraz; Vivek Sharma; Rainer Stiefelhagen | 3203 | |
68 | 10:00 | Learning Personalized Modular Network Guided by Structured Knowledge | Xiaodan Liang | 3859 | |
69 | 10:00 | A Generative Appearance Model for End-To-End Video Object Segmentation | Joakim Johnander; Martin Danelljan; Emil Brissman; Fahad Shahbaz Khan; Michael Felsberg | 5616 | |
70 | 10:00 | FEELVOS: Fast End-To-End Embedding Learning for Video Object Segmentation | Paul Voigtlaender; Yuning Chai; Florian Schroff; Hartwig Adam; Bastian Leibe; Liang-Chieh Chen | 3335 | |
71 | 10:00 | PartNet: A Recursive Part Decomposition Network for Fine-Grained and Hierarchical Shape Segmentation | Fenggen Yu; Kun Liu; Yan Zhang; Chenyang Zhu; Kai Xu | 4313 | |
72 | 10:00 | Learning Multi-Class Segmentations From Single-Class Datasets | Konstantin Dmitriev; Arie E. Kaufman | 4874 | |
73 | 10:00 | Convolutional Recurrent Network for Road Boundary Extraction | Justin Liang; Namdar Homayounfar; Wei-Chiu Ma; Shenlong Wang; Raquel Urtasun | 4944 | |
74 | 10:00 | DFANet: Deep Feature Aggregation for Real-Time Semantic Segmentation | Hanchao Li; Pengfei Xiong; Haoqiang Fan; Jian Sun | 5215 | |
75 | 10:00 | A Cross-Season Correspondence Dataset for Robust Semantic Segmentation | Måns Larsson; Erik Stenborg; Lars Hammarstrand; Marc Pollefeys; Torsten Sattler; Fredrik Kahl | 5332 | |
76 | 10:00 | ManTra-Net: Manipulation Tracing Network for Detection and Localization of Image Forgeries With Anomalous Features | Yue Wu; Wael AbdAlmageed; Premkumar Natarajan | 5356 | |
Statistics, Physics, Theory, & Datasets | 77 | 10:00 | Divergence Triangle for Joint Training of Generator Model, Energy-Based Model, and Inferential Model | Tian Han; Erik Nijkamp; Xiaolin Fang; Mitch Hill; Song-Chun Zhu; Ying Nian Wu | 2609 |
78 | 10:00 | Image Deformation Meta-Networks for One-Shot Learning | Zitian Chen; Yanwei Fu; Yu-Xiong Wang; Lin Ma; Wei Liu; Martial Hebert | 2829 | |
79 | 10:00 | Online High Rank Matrix Completion | Jicong Fan; Madeleine Udell | 4917 | |
80 | 10:00 | Multispectral Imaging for Fine-Grained Recognition of Powders on Complex Backgrounds | Tiancheng Zhi; Bernardo R. Pires; Martial Hebert; Srinivasa G. Narasimhan | 1274 | |
81 | 10:00 | ContactDB: Analyzing and Predicting Grasp Contact via Thermal Imaging | Samarth Brahmbhatt; Cusuh Ham; Charles C. Kemp; James Hays | 2138 | |
82 | 10:00 | Robust Subspace Clustering With Independent and Piecewise Identically Distributed Noise Modeling | Yuanman Li; Jiantao Zhou; Xianwei Zheng; Jinyu Tian; Yuan Yan Tang | 4535 | |
83 | 10:00 | What Correspondences Reveal About Unknown Camera and Motion Models? | Thomas Probst; Ajad Chhatkuli; Danda Pani Paudel; Luc Van Gool | 4185 | |
84 | 10:00 | Self-Calibrating Deep Photometric Stereo Networks | Guanying Chen; Kai Han; Boxin Shi; Yasuyuki Matsushita; Kwan-Yee K. Wong | 1504 | |
85 | 10:00 | Argoverse: 3D Tracking and Forecasting With Rich Maps | Ming-Fang Chang; John Lambert; Patsorn Sangkloy; Jagjeet Singh; Slawomir Bak; Andrew Hartnett; De Wang; Peter Carr; Simon Lucey; Deva Ramanan; James Hays | 4994 | |
86 | 10:00 | Side Window Filtering | Hui Yin; Yuanhao Gong; Guoping Qiu | 5176 | |
87 | 10:00 | Defense Against Adversarial Images Using Web-Scale Nearest-Neighbor Search | Abhimanyu Dubey; Laurens van der Maaten; Zeki Yalniz; Yixuan Li; Dhruv Mahajan | 2319 | |
88 | 10:00 | Incremental Object Learning From Contiguous Views | Stefan Stojanov; Samarth Mishra; Ngoc Anh Thai; Nikhil Dhanda; Ahmad Humayun; Chen Yu; Linda B. Smith; James M. Rehg | 1519 | |
89 | 10:00 | IP102: A Large-Scale Benchmark Dataset for Insect Pest Recognition | Xiaoping Wu; Chi Zhan; Yu-Kun Lai; Ming-Ming Cheng; Jufeng Yang | 3627 | |
90 | 10:00 | CityFlow: A City-Scale Benchmark for Multi-Target Multi-Camera Vehicle Tracking and Re-Identification | Zheng Tang; Milind Naphade; Ming-Yu Liu; Xiaodong Yang; Stan Birchfield; Shuo Wang; Ratnesh Kumar; David Anastasiu; Jenq-Neng Hwang | 6334 | |
91 | 10:00 | Social-IQ: A Question Answering Benchmark for Artificial Social Intelligence | Amir Zadeh; Michael Chan; Paul Pu Liang; Edmund Tong; Louis-Philippe Morency | 6439 | |
92 | 10:00 | On Zero-Shot Recognition of Generic Objects | Tristan Hascoet; Yasuo Ariki; Tetsuya Takiguchi | 4302 | |
93 | 10:00 | Explicit Bias Discovery in Visual Question Answering Models | Varun Manjunatha; Nirat Saini; Larry S. Davis | 4371 | |
94 | 10:00 | REPAIR: Removing Representation Bias by Dataset Resampling | Yi Li; Nuno Vasconcelos | 4423 | |
95 | 10:00 | Label Efficient Semi-Supervised Learning via Graph Filtering | Qimai Li; Xiao-Ming Wu; Han Liu; Xiaotong Zhang; Zhichao Guan | 4744 | |
96 | 10:00 | MVTec AD — A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection | Paul Bergmann; Michael Fauser; David Sattlegger; Carsten Steger | 4769 | |
97 | 10:00 | ABC: A Big CAD Model Dataset for Geometric Deep Learning | Sebastian Koch; Albert Matveev; Zhongshi Jiang; Francis Williams; Alexey Artemov; Evgeny Burnaev; Marc Alexa; Denis Zorin; Daniele Panozzo | 4878 | |
98 | 10:00 | Tightness-Aware Evaluation Protocol for Scene Text Detection | Yuliang Liu; Lianwen Jin; Zecheng Xie; Canjie Luo; Shuaitao Zhang; Lele Xie | 4946 | |
3D Multiview | 99 | 10:00 | PointConv: Deep Convolutional Networks on 3D Point Clouds | Wenxuan Wu; Zhongang Qi; Li Fuxin | 4220 |
100 | 10:00 | Octree Guided CNN With Spherical Kernels for 3D Point Clouds | Huan Lei; Naveed Akhtar; Ajmal Mian | 4334 | |
101 | 10:00 | VITAMIN-E: VIsual Tracking and MappINg With Extremely Dense Feature Points | Masashi Yokozuka; Shuji Oishi; Simon Thompson; Atsuhiko Banno | 4463 | |
102 | 10:00 | Conditional Single-View Shape Generation for Multi-View Stereo Reconstruction | Yi Wei; Shaohui Liu; Wang Zhao; Jiwen Lu | 4478 | |
103 | 10:00 | Learning to Adapt for Stereo | Alessio Tonioni; Oscar Rahnama; Thomas Joy; Luigi Di Stefano; Thalaiyasingam Ajanthan; Philip H.S. Torr | 4522 | |
104 | 10:00 | 3D Appearance Super-Resolution With Deep Learning | Yawei Li; Vagia Tsiminaki; Radu Timofte; Marc Pollefeys; Luc Van Gool | 4851 | |
105 | 10:00 | Radial Distortion Triangulation | Zuzana Kukelova; Viktor Larsson | 4955 | |
106 | 10:00 | Robust Point Cloud Based Reconstruction of Large-Scale Outdoor Scenes | Ziquan Lan; Zi Jian Yew; Gim Hee Lee | 5331 | |
3D Single View & RGBD | 107 | 10:00 | Minimal Solvers for Mini-Loop Closures in 3D Multi-Scan Alignment | Pedro Miraldo; Surojit Saha; Srikumar Ramalingam | 4180 |
108 | 10:00 | Volumetric Capture of Humans With a Single RGBD Camera via Semi-Parametric Learning | Rohit Pandey; Anastasia Tkach; Shuoran Yang; Pavel Pidlypenskyi; Jonathan Taylor; Ricardo Martin-Brualla; Andrea Tagliasacchi; George Papandreou; Philip Davidson; Cem Keskin; Shahram Izadi; Sean Fanello | 4204 | |
109 | 10:00 | Joint Face Detection and Facial Motion Retargeting for Multiple Faces | Bindita Chaudhuri; Noranart Vesdapunt; Baoyuan Wang | 4442 | |
110 | 10:00 | Monocular Depth Estimation Using Relative Depth Maps | Jae-Han Lee; Chang-Su Kim | 4450 | |
111 | 10:00 | Unsupervised Primitive Discovery for Improved 3D Generative Modeling | Salman H. Khan; Yulan Guo; Munawar Hayat; Nick Barnes | 4635 | |
112 | 10:00 | Learning to Explore Intrinsic Saliency for Stereoscopic Video | Qiudan Zhang; Xu Wang; Shiqi Wang; Shikai Li; Sam Kwong; Jianmin Jiang | 4679 | |
113 | 10:00 | Spherical Regression: Learning Viewpoints, Surface Normals and 3D Rotations on N-Spheres | Shuai Liao; Efstratios Gavves; Cees G. M. Snoek | 4686 | |
114 | 10:00 | Refine and Distill: Exploiting Cycle-Inconsistency and Knowledge Distillation for Unsupervised Monocular Depth Estimation | Andrea Pilzer; Stéphane Lathuilière; Nicu Sebe; Elisa Ricci | 4818 | |
115 | 10:00 | Learning View Priors for Single-View 3D Reconstruction | Hiroharu Kato; Tatsuya Harada | 4999 | |
116 | 10:00 | Geometry-Aware Symmetric Domain Adaptation for Monocular Depth Estimation | Shanshan Zhao; Huan Fu; Mingming Gong; Dacheng Tao | 5090 | |
117 | 10:00 | Learning Monocular Depth Estimation Infusing Traditional Stereo Knowledge | Fabio Tosi; Filippo Aleotti; Matteo Poggi; Stefano Mattoccia | 5256 | |
118 | 10:00 | SIGNet: Semantic Instance Aided Unsupervised 3D Geometry Perception | Yue Meng; Yongxi Lu; Aman Raj; Samuel Sunarjo; Rui Guo; Tara Javidi; Gaurav Bansal; Dinesh Bharadia | 5263 | |
Face & Body | 119 | 10:00 | 3D Guided Fine-Grained Face Manipulation | Zhenglin Geng; Chen Cao; Sergey Tulyakov | 4183 |
120 | 10:00 | Neuro-Inspired Eye Tracking With Eye Movement Dynamics | Kang Wang; Hui Su; Qiang Ji | 4206 | |
121 | 10:00 | Facial Emotion Distribution Learning by Exploiting Low-Rank Label Correlations Locally | Xiuyi Jia; Xiang Zheng; Weiwei Li; Changqing Zhang; Zechao Li | 4436 | |
122 | 10:00 | Unsupervised Face Normalization With Extreme Pose and Expression in the Wild | Yichen Qian; Weihong Deng; Jiani Hu | 4531 | |
123 | 10:00 | Semantic Component Decomposition for Face Attribute Manipulation | Ying-Cong Chen; Xiaohui Shen; Zhe Lin; Xin Lu; I-Ming Pao; Jiaya Jia | 3603 | |
124 | 10:00 | R³ Adversarial Network for Cross Model Face Recognition | Ken Chen; Yichao Wu; Haoyu Qin; Ding Liang; Xuebo Liu; Junjie Yan | 4625 | |
125 | 10:00 | Disentangling Latent Hands for Image Synthesis and Pose Estimation | Linlin Yang; Angela Yao | 4791 | |
126 | 10:00 | Generating Multiple Hypotheses for 3D Human Pose Estimation With Mixture Density Network | Chen Li; Gim Hee Lee | 5180 | |
127 | 10:00 | CrossInfoNet: Multi-Task Information Sharing Based Hand Pose Estimation | Kuo Du; Xiangbo Lin; Yi Sun; Xiaohong Ma | 5207 | |
128 | 10:00 | P2SGrad: Refined Gradients for Optimizing Deep Face Models | Xiao Zhang; Rui Zhao; Junjie Yan; Mengya Gao; Yu Qiao; Xiaogang Wang; Hongsheng Li | 5285 | |
Action & Video | 129 | 10:00 | Action Recognition From Single Timestamp Supervision in Untrimmed Videos | Davide Moltisanti; Sanja Fidler; Dima Damen | 4560 |
130 | 10:00 | Time-Conditioned Action Anticipation in One Shot | Qiuhong Ke; Mario Fritz; Bernt Schiele | 4749 | |
131 | 10:00 | Dance With Flow: Two-In-One Stream Action Detection | Jiaojiao Zhao; Cees G. M. Snoek | 4882 | |
132 | 10:00 | Representation Flow for Action Recognition | AJ Piergiovanni; Michael S. Ryoo | 4973 | |
133 | 10:00 | LSTA: Long Short-Term Attention for Egocentric Action Recognition | Swathikiran Sudhakaran; Sergio Escalera; Oswald Lanz | 5009 | |
134 | 10:00 | Learning Actor Relation Graphs for Group Activity Recognition | Jianchao Wu; Limin Wang; Li Wang; Jie Guo; Gangshan Wu | 5028 | |
135 | 10:00 | A Structured Model for Action Detection | Yubo Zhang; Pavel Tokmakov; Martial Hebert; Cordelia Schmid | 5156 | |
136 | 10:00 | Out-Of-Distribution Detection for Generalized Zero-Shot Action Recognition | Devraj Mandal; Sanath Narayan; Sai Kumar Dwivedi; Vikram Gupta; Shuaib Ahmed; Fahad Shahbaz Khan; Ling Shao | 5185 | |
Motion & Biometrics | 137 | 10:00 | Object Discovery in Videos as Foreground Motion Clustering | Christopher Xie; Yu Xiang; Zaid Harchaoui; Dieter Fox | 4179 |
138 | 10:00 | Towards Natural and Accurate Future Motion Prediction of Humans and Animals | Zhenguang Liu; Shuang Wu; Shuyuan Jin; Qi Liu; Shijian Lu; Roger Zimmermann; Li Cheng | 4338 | |
139 | 10:00 | Automatic Face Aging in Videos via Deep Reinforcement Learning | Chi Nhan Duong; Khoa Luu; Kha Gia Quach; Nghia Nguyen; Eric Patterson; Tien D. Bui; Ngan Le | 4366 | |
140 | 10:00 | Multi-Adversarial Discriminative Deep Domain Generalization for Face Presentation Attack Detection | Rui Shao; Xiangyuan Lan; Jiawei Li; Pong C. Yuen | 5369 | |
Synthesis | 141 | 10:00 | A Content Transformation Block for Image Style Transfer | Dmytro Kotovenko; Artsiom Sanakoyeu; Pingchuan Ma; Sabine Lang; Björn Ommer | 4284 |
142 | 10:00 | BeautyGlow: On-Demand Makeup Transfer Framework With Reversible Generative Network | Hung-Jen Chen; Ka-Ming Hui; Szu-Yu Wang; Li-Wu Tsao; Hong-Han Shuai; Wen-Huang Cheng | 4378 | |
143 | 10:00 | Style Transfer by Relaxed Optimal Transport and Self-Similarity | Nicholas Kolkin; Jason Salavon; Gregory Shakhnarovich | 4740 | |
144 | 10:00 | Inserting Videos Into Videos | Donghoon Lee; Tomas Pfister; Ming-Hsuan Yang | 4906 | |
145 | 10:00 | Learning Image and Video Compression Through Spatial-Temporal Energy Compaction | Zhengxue Cheng; Heming Sun; Masaru Takeuchi; Jiro Katto | 5027 | |
146 | 10:00 | Event-Based High Dynamic Range Image and Very High Frame Rate Video Generation Using Conditional Generative Adversarial Networks | Lin Wang; S. Mohammad Mostafavi I.; Yo-Sung Ho; Kuk-Jin Yoon | 5141 | |
147 | 10:00 | Enhancing TripleGAN for Semi-Supervised Conditional Instance Synthesis and Classification | Si Wu; Guangchang Deng; Jichang Li; Rui Li; Zhiwen Yu; Hau-San Wong | 5293 | |
Computational Photography & Graphics | 148 | 10:00 | Capture, Learning, and Synthesis of 3D Speaking Styles | Daniel Cudeiro; Timo Bolkart; Cassidy Laidlaw; Anurag Ranjan; Michael J. Black | 4443 |
149 | 10:00 | Nesti-Net: Normal Estimation for Unstructured 3D Point Clouds Using Convolutional Neural Networks | Yizhak Ben-Shabat; Michael Lindenbaum; Anath Fischer | 4550 | |
150 | 10:00 | Ray-Space Projection Model for Light Field Camera | Qi Zhang; Jinbo Ling; Qing Wang; Jingyi Yu | 4624 | |
151 | 10:00 | Deep Geometric Prior for Surface Reconstruction | Francis Williams; Teseo Schneider; Claudio Silva; Denis Zorin; Joan Bruna; Daniele Panozzo | 4806 | |
152 | 10:00 | Analysis of Feature Visibility in Non-Line-Of-Sight Measurements | Xiaochun Liu; Sebastian Bauer; Andreas Velten | 4814 | |
153 | 10:00 | Hyperspectral Imaging With Random Printed Mask | Yuanyuan Zhao; Hui Guo; Zhan Ma; Xun Cao; Tao Yue; Xuemei Hu | 5081 | |
154 | 10:00 | All-Weather Deep Outdoor Lighting Estimation | Jinsong Zhang; Kalyan Sunkavalli; Yannick Hold-Geoffroy; Sunil Hadap; Jonathan Eisenman; Jean-François Lalonde | 5128 | |
Low-Level & Optimization | 155 | 10:00 | A Variational EM Framework With Adaptive Edge Selection for Blind Motion Deblurring | Liuge Yang; Hui Ji | 4186 |
156 | 10:00 | Viewport Proposal CNN for 360° Video Quality Assessment | Chen Li; Mai Xu; Lai Jiang; Shanyi Zhang; Xiaoming Tao | 4482 | |
157 | 10:00 | Beyond Gradient Descent for Regularized Segmentation Losses | Dmitrii Marin; Meng Tang; Ismail Ben Ayed; Yuri Boykov | 4685 | |
158 | 10:00 | MAGSAC: Marginalizing Sample Consensus | Daniel Barath; Jiří Matas; Jana Noskova | 4694 | |
159 | 10:00 | Understanding and Visualizing Deep Visual Saliency Models | Sen He; Hamed R. Tavakoli; Ali Borji; Yang Mi; Nicolas Pugeault | 4725 | |
160 | 10:00 | Divergence Prior and Vessel-Tree Reconstruction | Zhongwen Zhang; Dmitrii Marin; Egor Chesakov; Marc Moreno Maza; Maria Drangova; Yuri Boykov | 4783 | |
161 | 10:00 | Unsupervised Domain-Specific Deblurring via Disentangled Representations | Boyu Lu; Jun-Cheng Chen; Rama Chellappa | 4819 | |
162 | 10:00 | Douglas-Rachford Networks: Learning Both the Image Prior and Data Fidelity Terms for Blind Image Deconvolution | Raied Aljadaany; Dipan K. Pal; Marios Savvides | 4897 | |
163 | 10:00 | Speed Invariant Time Surface for Learning to Detect Corner Points With Event-Based Cameras | Jacques Manderscheid; Amos Sironi; Nicolas Bourdis; Davide Migliore; Vincent Lepetit | 4920 | |
164 | 10:00 | Training Deep Learning Based Image Denoisers From Undersampled Measurements Without Ground Truth and Without Image Prior | Magauiya Zhussip; Shakarim Soltanayev; Se Young Chun | 4975 | |
165 | 10:00 | A Variational Pan-Sharpening With Local Gradient Constraints | Xueyang Fu; Zihuang Lin; Yue Huang; Xinghao Ding | 5294 | |
Scenes & Representation | 166 | 10:00 | F-VAEGAN-D2: A Feature Generating Framework for Any-Shot Learning | Yongqin Xian; Saurabh Sharma; Bernt Schiele; Zeynep Akata | 3745 |
167 | 10:00 | Sliced Wasserstein Discrepancy for Unsupervised Domain Adaptation | Chen-Yu Lee; Tanmay Batra; Mohammad Haris Baig; Daniel Ulbricht | 4238 | |
168 | 10:00 | Graph Attention Convolution for Point Cloud Semantic Segmentation | Lei Wang; Yuchun Huang; Yaolin Hou; Shenman Zhang; Jie Shan | 4649 | |
169 | 10:00 | Normalized Diversification | Shaohui Liu; Xiao Zhang; Jianqiao Wangni; Jianbo Shi | 4658 | |
170 | 10:00 | Learning to Localize Through Compressed Binary Maps | Xinkai Wei; Ioan Andrei Bârsan; Shenlong Wang; Julieta Martinez; Raquel Urtasun | 4890 | |
171 | 10:00 | A Parametric Top-View Representation of Complex Road Scenes | Ziyan Wang; Buyu Liu; Samuel Schulter; Manmohan Chandraker | 4914 | |
172 | 10:00 | Self-Supervised Spatiotemporal Learning via Video Clip Order Prediction | Dejing Xu; Jun Xiao; Zhou Zhao; Jian Shao; Di Xie; Yueting Zhuang | 5096 | |
173 | 10:00 | Superquadrics Revisited: Learning 3D Shape Parsing Beyond Cuboids | Despoina Paschalidou; Ali Osman Ulusoy; Andreas Geiger | 5106 | |
174 | 10:00 | Unsupervised Disentangling of Appearance and Geometry by Deformable Generator Network | Xianglei Xing; Tian Han; Ruiqi Gao; Song-Chun Zhu; Ying Nian Wu | 5227 | |
175 | 10:00 | Self-Supervised Representation Learning by Rotation Feature Decoupling | Zeyu Feng; Chang Xu; Dacheng Tao | 5242 | |
176 | 10:00 | Weakly Supervised Deep Image Hashing Through Tag Embeddings | Vijetha Gattupalli; Yaoxin Zhuo; Baoxin Li | 5248 | |
177 | 10:00 | Improved Road Connectivity by Joint Learning of Orientation and Segmentation | Anil Batra; Suriya Singh; Guan Pang; Saikat Basu; C.V. Jawahar; Manohar Paluri | 5278 | |
178 | 10:00 | Deep Supervised Cross-Modal Retrieval | Liangli Zhen; Peng Hu; Xu Wang; Dezhong Peng | 5287 | |
179 | 10:00 | A Theoretically Sound Upper Bound on the Triplet Loss for Improving the Efficiency of Deep Distance Metric Learning | Thanh-Toan Do; Toan Tran; Ian Reid; Vijay Kumar; Tuan Hoang; Gustavo Carneiro | 5310 | |
180 | 10:00 | Data Representation and Learning With Graph Diffusion-Embedding Networks | Bo Jiang; Doudou Lin; Jin Tang; Bin Luo | 5341 | |
Language & Reasoning | 181 | 10:00 | Video Relationship Reasoning Using Gated Spatio-Temporal Energy Graph | Yao-Hung Hubert Tsai; Santosh Divvala; Louis-Philippe Morency; Ruslan Salakhutdinov; Ali Farhadi | 4277 |
182 | 10:00 | Image-Question-Answer Synergistic Network for Visual Dialog | Dalu Guo; Chang Xu; Dacheng Tao | 4385 | |
183 | 10:00 | Not All Frames Are Equal: Weakly-Supervised Video Grounding With Contextual Similarity and Visual Clustering Losses | Jing Shi; Jia Xu; Boqing Gong; Chenliang Xu | 4732 | |
184 | 10:00 | Inverse Cooking: Recipe Generation From Food Images | Amaia Salvador; Michal Drozdzal; Xavier Giro-i-Nieto; Adriana Romero | 4781 | |
185 | 10:00 | Adversarial Semantic Alignment for Improved Image Captions | Pierre Dognin; Igor Melnyk; Youssef Mroueh; Jerret Ross; Tom Sercu | 4833 | |
186 | 10:00 | Answer Them All! Toward Universal Visual Question Answering Models | Robik Shrestha; Kushal Kafle; Christopher Kanan | 4919 | |
187 | 10:00 | Unsupervised Multi-Modal Neural Machine Translation | Yuanhang Su; Kai Fan; Nguyen Bach; C.-C. Jay Kuo; Fei Huang | 4967 | |
188 | 10:00 | Multi-Task Learning of Hierarchical Vision-Language Representation | Duy-Kien Nguyen; Takayuki Okatani | 4987 | |
189 | 10:00 | Cross-Modal Self-Attention Network for Referring Image Segmentation | Linwei Ye; Mrigank Rochan; Zhi Liu; Yang Wang | 5052 | |
Applications, Medical, & Robotics | 190 | 10:00 | Holistic and Comprehensive Annotation of Clinically Significant Findings on Diverse CT Images: Learning From Radiology Reports and Label Ontology | Ke Yan; Yifan Peng; Veit Sandfort; Mohammadhadi Bagheri; Zhiyong Lu; Ronald M. Summers | 2180 |
191 | 10:00 | Robust Histopathology Image Analysis: To Label or to Synthesize? | Le Hou; Ayush Agarwal; Dimitris Samaras; Tahsin M. Kurc; Rajarsi R. Gupta; Joel H. Saltz | 4246 | |
192 | 10:00 | Data Augmentation Using Learned Transformations for One-Shot Medical Image Segmentation | Amy Zhao; Guha Balakrishnan; Frédo Durand; John V. Guttag; Adrian V. Dalca | 6477 | |
193 | 10:00 | Shifting More Attention to Video Salient Object Detection | Deng-Ping Fan; Wenguan Wang; Ming-Ming Cheng; Jianbing Shen | 1853 | |
194 | 10:00 | Neural Task Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration | De-An Huang; Suraj Nair; Danfei Xu; Yuke Zhu; Animesh Garg; Li Fei-Fei; Silvio Savarese; Juan Carlos Niebles | 864 | |
195 | 10:00 | Beyond Tracking: Selecting Memory and Refining Poses for Deep Visual Odometry | Fei Xue; Xin Wang; Shunkai Li; Qiuyuan Wang; Junqiu Wang; Hongbin Zha | 1296 | |
196 | 10:00 | Image Generation From Layout | Bo Zhao; Lili Meng; Weidong Yin; Leonid Sigal | 3139 | |
197 | 10:00 | Multimodal Explanations by Predicting Counterfactuality in Videos | Atsushi Kanehira; Kentaro Takemoto; Sho Inayoshi; Tatsuya Harada | 4603 | |
198 | 10:00 | Learning to Explain With Complemental Examples | Atsushi Kanehira; Tatsuya Harada | 4606 | |
199 | 10:00 | HAQ: Hardware-Aware Automated Quantization With Mixed Precision | Kuan Wang; Zhijian Liu; Yujun Lin; Ji Lin; Song Han | 3441 | |
200 | 10:00 | Content Authentication for Neural Imaging Pipelines: End-To-End Optimization of Photo Provenance in Complex Distribution Channels | Pawel Korus; Nasir Memon | 4965 | |
201 | 10:00 | Inverse Procedural Modeling of Knitwear | Elena Trunz; Sebastian Merzbach; Jonathan Klein; Thomas Schulze; Michael Weinmann; Reinhard Klein | 5712 | |
202 | 10:00 | Estimating 3D Motion and Forces of Person-Object Interactions From Monocular Video | Zongmian Li; Jiri Sedlar; Justin Carpentier; Ivan Laptev; Nicolas Mansard; Josef Sivic | 2857 | |
203 | 10:00 | DeepMapping: Unsupervised Map Estimation From Multiple Point Clouds | Li Ding; Chen Feng | 4235 | |
204 | 10:00 | End-To-End Interpretable Neural Motion Planner | Wenyuan Zeng; Wenjie Luo; Simon Suo; Abbas Sadat; Bin Yang; Sergio Casas; Raquel Urtasun | 4880 | |
205 | 10:00 | DuDoNet: Dual Domain Network for CT Metal Artifact Reduction | Wei-An Lin; Haofu Liao; Cheng Peng; Xiaohang Sun; Jingdan Zhang; Jiebo Luo; Rama Chellappa; Shaohua Kevin Zhou | 4261 | |
206 | 10:00 | Fast Spatio-Temporal Residual Network for Video Super-Resolution | Sheng Li; Fengxiang He; Bo Du; Lefei Zhang; Yonghao Xu; Dacheng Tao | 4384 | |
207 | 10:00 | Complete the Look: Scene-Based Complementary Product Recommendation | Wang-Cheng Kang; Eric Kim; Jure Leskovec; Charles Rosenberg; Julian McAuley | 4419 | |
208 | 10:00 | Selective Sensor Fusion for Neural Visual-Inertial Odometry | Changhao Chen; Stefano Rosa; Yishu Miao; Chris Xiaoxuan Lu; Wei Wu; Andrew Markham; Niki Trigoni | 4628 | |
209 | 10:00 | Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes | Chengquan Zhang; Borong Liang; Zuming Huang; Mengyi En; Junyu Han; Errui Ding; Xinghao Ding | 4675 | |
210 | 10:00 | Learning Binary Code for Personalized Fashion Recommendation | Zhi Lu; Yang Hu; Yunchao Jiang; Yan Chen; Bing Zeng | 4765 | |
211 | 10:00 | Attention Based Glaucoma Detection: A Large-Scale Database and CNN Model | Liu Li; Mai Xu; Xiaofei Wang; Lai Jiang; Hanruo Liu | 4789 | |
212 | 10:00 | Privacy Protection in Street-View Panoramas Using Depth and Multi-View Imagery | Ries Uittenbogaard; Clint Sebastian; Julien Vijverberg; Bas Boom; Dariu M. Gavrila; Peter H.N. de With | 4829 | |
213 | 10:00 | Grounding Human-To-Vehicle Advice for Self-Driving Vehicles | Jinkyu Kim; Teruhisa Misu; Yi-Ting Chen; Ashish Tawari; John Canny | 4838 | |
214 | 10:00 | Multi-Step Prediction of Occupancy Grid Maps With Recurrent Neural Networks | Nima Mohajerin; Mohsen Rohani | 4855 | |
215 | 10:00 | Connecting Touch and Vision via Cross-Modal Prediction | Yunzhu Li; Jun-Yan Zhu; Russ Tedrake; Antonio Torralba | 4903 | |
216 | 10:00 | X2CT-GAN: Reconstructing CT From Biplanar X-Rays With Generative Adversarial Networks | Xingde Ying; Heng Guo; Kai Ma; Jian Wu; Zhengxin Weng; Yefeng Zheng | 5238 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Deep Learning | 1 | 13:30 | Practical Full Resolution Learned Lossless Image Compression | Fabian Mentzer; Eirikur Agustsson; Michael Tschannen; Radu Timofte; Luc Van Gool | 3041 |
2 | 13:35 | Image-To-Image Translation via Group-Wise Deep Whitening-And-Coloring Transformation | Wonwoong Cho; Sungha Choi; David Keetae Park; Inkyu Shin; Jaegul Choo | 6930 | |
3 | 13:40 | Max-Sliced Wasserstein Distance and Its Use for GANs | Ishan Deshpande; Yuan-Ting Hu; Ruoyu Sun; Ayis Pyrros; Nasir Siddiqui; Sanmi Koyejo; Zhizhen Zhao; David Forsyth; Alexander G. Schwing | 6265 | |
4 | 13:48 | Meta-Learning With Differentiable Convex Optimization | Kwonjoon Lee; Subhransu Maji; Avinash Ravichandran; Stefano Soatto | 3073 | |
5 | 13:53 | RePr: Improved Training of Convolutional Filters | Aaditya Prakash; James Storer; Dinei Florencio; Cha Zhang | 6188 | |
6 | 13:58 | Tangent-Normal Adversarial Regularization for Semi-Supervised Learning | Bing Yu; Jingfeng Wu; Jinwen Ma; Zhanxing Zhu | 6332 | |
7 | 14:06 | Auto-Encoding Scene Graphs for Image Captioning | Xu Yang; Kaihua Tang; Hanwang Zhang; Jianfei Cai | 3306 | |
8 | 14:11 | Fast, Diverse and Accurate Image Captioning Guided by Part-Of-Speech | Aditya Deshpande; Jyoti Aneja; Liwei Wang; Alexander G. Schwing; David Forsyth | 6218 | |
9 | 14:16 | Attention Branch Network: Learning of Attention Mechanism for Visual Explanation | Hiroshi Fukui; Tsubasa Hirakawa; Takayoshi Yamashita; Hironobu Fujiyoshi | 6105 | |
10 | 14:24 | Cascaded Projection: End-To-End Network Compression and Acceleration | Breton Minnehan; Andreas Savakis | 3796 | |
11 | 14:29 | DeepCaps: Going Deeper With Capsule Networks | Jathushan Rajasegaran; Vinoj Jayasundara; Sandaru Jayasekara; Hirunima Jayasekara; Suranga Seneviratne; Ranga Rodrigo | 5721 | |
12 | 14:34 | FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search | Bichen Wu; Xiaoliang Dai; Peizhao Zhang; Yanghan Wang; Fei Sun; Yiming Wu; Yuandong Tian; Peter Vajda; Yangqing Jia; Kurt Keutzer | 6240 | |
13 | 14:42 | APDrawingGAN: Generating Artistic Portrait Drawings From Face Photos With Hierarchical GANs | Ran Yi; Yong-Jin Liu; Yu-Kun Lai; Paul L. Rosin | 5032 | |
14 | 14:47 | Constrained Generative Adversarial Networks for Interactive Image Generation | Eric Heim | 6431 | |
15 | 14:52 | WarpGAN: Automatic Caricature Generation | Yichun Shi; Debayan Deb; Anil K. Jain | 6807 | |
16 | 15:00 | Explainability Methods for Graph Convolutional Neural Networks | Phillip E. Pope; Soheil Kolouri; Mohammad Rostami; Charles E. Martin; Heiko Hoffmann | 5199 | |
17 | 15:05 | A Generative Adversarial Density Estimator | M. Ehsan Abbasnejad; Qinfeng Shi; Anton van den Hengel; Lingqiao Liu | 5502 | |
18 | 15:10 | SoDeep: A Sorting Deep Net to Learn Ranking Loss Surrogates | Martin Engilberge; Louis Chevallier; Patrick Pérez; Matthieu Cord | 5921 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Face & Body | 92 | 13:30 | High-Quality Face Capture Using Anatomical Muscles | Michael Bao; Matthew Cong; Stéphane Grabli; Ronald Fedkiw | 4 |
93 | 13:35 | FML: Face Model Learning From Videos | Ayush Tewari; Florian Bernard; Pablo Garrido; Gaurav Bharaj; Mohamed Elgharib; Hans-Peter Seidel; Patrick Pérez; Michael Zollhöfer; Christian Theobalt | 2408 | |
94 | 13:40 | AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations | Xiao Zhang; Rui Zhao; Yu Qiao; Xiaogang Wang; Hongsheng Li | 4483 | |
95 | 13:48 | 3D Hand Shape and Pose Estimation From a Single RGB Image | Liuhao Ge; Zhou Ren; Yuncheng Li; Zehao Xue; Yingying Wang; Jianfei Cai; Junsong Yuan | 387 | |
96 | 13:53 | 3D Hand Shape and Pose From Images in the Wild | Adnane Boukhayma; Rodrigo de Bem; Philip H.S. Torr | 647 | |
97 | 13:58 | Self-Supervised 3D Hand Pose Estimation Through Training by Fitting | Chengde Wan; Thomas Probst; Luc Van Gool; Angela Yao | 843 | |
98 | 14:06 | CrowdPose: Efficient Crowded Scenes Pose Estimation and a New Benchmark | Jiefeng Li; Can Wang; Hao Zhu; Yihuan Mao; Hao-Shu Fang; Cewu Lu | 1497 | |
99 | 14:11 | Towards Social Artificial Intelligence: Nonverbal Social Signal Prediction in a Triadic Interaction | Hanbyul Joo; Tomas Simon; Mina Cikara; Yaser Sheikh | 3233 | |
100 | 14:16 | HoloPose: Holistic 3D Human Reconstruction In-The-Wild | Rıza Alp Güler; Iasonas Kokkinos | 6947 | |
101 | 14:24 | Weakly-Supervised Discovery of Geometry-Aware Representation for 3D Human Pose Estimation | Xipeng Chen; Kwan-Yee Lin; Wentao Liu; Chen Qian; Liang Lin | 2239 | |
102 | 14:29 | In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations | Ikhsanul Habibie; Weipeng Xu; Dushyant Mehta; Gerard Pons-Moll; Christian Theobalt | 3999 | |
103 | 14:34 | Slim DensePose: Thrifty Learning From Sparse Annotations and Motion Cues | Natalia Neverova; James Thewlis; Rıza Alp Güler; Iasonas Kokkinos; Andrea Vedaldi | 33 | |
104 | 14:42 | Self-Supervised Representation Learning From Videos for Facial Action Unit Detection | Yong Li; Jiabei Zeng; Shiguang Shan; Xilin Chen | 2859 | |
105 | 14:47 | Combining 3D Morphable Models: A Large Scale Face-And-Head Model | Stylianos Ploumpis; Haoyang Wang; Nick Pears; William A. P. Smith; Stefanos Zafeiriou | 4558 | |
106 | 14:52 | Boosting Local Shape Matching for Dense 3D Face Correspondence | Zhenfeng Fan; Xiyuan Hu; Chen Chen; Silong Peng | 4364 | |
107 | 15:00 | Unsupervised Part-Based Disentangling of Object Shape and Appearance | Dominik Lorenz; Leonard Bereska; Timo Milbich; Björn Ommer | 2886 | |
108 | 15:05 | Monocular Total Capture: Posing Face, Body, and Hands in the Wild | Donglai Xiang; Hanbyul Joo; Yaser Sheikh | 2922 | |
109 | 15:10 | Expressive Body Capture: 3D Hands, Face, and Body From a Single Image | Georgios Pavlakos; Vasileios Choutas; Nima Ghorbani; Timo Bolkart; Ahmed A. A. Osman; Dimitrios Tzionas; Michael J. Black | 3128 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Low-Level & Optimization | 147 | 13:30 | Neural RGB®D Sensing: Depth and Uncertainty From a Video Camera | Chao Liu; Jinwei Gu; Kihwan Kim; Srinivasa G. Narasimhan; Jan Kautz | 707 |
148 | 13:35 | DAVANet: Stereo Deblurring With View Aggregation | Shangchen Zhou; Jiawei Zhang; Wangmeng Zuo; Haozhe Xie; Jinshan Pan; Jimmy S. Ren | 1006 | |
149 | 13:40 | DVC: An End-To-End Deep Video Compression Framework | Guo Lu; Wanli Ouyang; Dong Xu; Xiaoyun Zhang; Chunlei Cai; Zhiyong Gao | 3657 | |
150 | 13:48 | SOSNet: Second Order Similarity Regularization for Local Descriptor Learning | Yurun Tian; Xin Yu; Bin Fan; Fuchao Wu; Huub Heijnen; Vassileios Balntas | 1098 | |
151 | 13:53 | “Double-DIP”: Unsupervised Image Decomposition via Coupled Deep-Image-Priors | Yosef Gandelsman; Assaf Shocher; Michal Irani | 2154 | |
152 | 13:58 | Unprocessing Images for Learned Raw Denoising | Tim Brooks; Ben Mildenhall; Tianfan Xue; Jiawen Chen; Dillon Sharlet; Jonathan T. Barron | 2579 | |
153 | 14:06 | Residual Networks for Light Field Image Super-Resolution | Shuo Zhang; Youfang Lin; Hao Sheng | 3342 | |
154 | 14:11 | Modulating Image Restoration With Continual Levels via Adaptive Feature Modification Layers | Jingwen He; Chao Dong; Yu Qiao | 3959 | |
155 | 14:16 | Second-Order Attention Network for Single Image Super-Resolution | Tao Dai; Jianrui Cai; Yongbing Zhang; Shu-Tao Xia; Lei Zhang | 5318 | |
156 | 14:24 | Devil Is in the Edges: Learning Semantic Boundaries From Noisy Annotations | David Acuna; Amlan Kar; Sanja Fidler | 2599 | |
157 | 14:29 | Path-Invariant Map Networks | Zaiwei Zhang; Zhenxiao Liang; Lemeng Wu; Xiaowei Zhou; Qixing Huang | 3097 | |
158 | 14:34 | FilterReg: Robust and Efficient Probabilistic Point-Set Registration Using Gaussian Filter and Twist Parameterization | Wei Gao; Russ Tedrake | 5608 | |
159 | 14:42 | Probabilistic Permutation Synchronization Using the Riemannian Structure of the Birkhoff Polytope | Tolga Birdal; Umut Şimşekli | 108 | |
160 | 14:47 | Lifting Vectorial Variational Problems: A Natural Formulation Based on Geometric Measure Theory and Discrete Exterior Calculus | Thomas Möllenhoff; Daniel Cremers | 190 | |
161 | 14:52 | A Sufficient Condition for Convergences of Adam and RMSProp | Fangyu Zou; Li Shen; Zequn Jie; Weizhong Zhang; Wei Liu | 1428 | |
162 | 15:00 | Guaranteed Matrix Completion Under Multiple Linear Transformations | Chao Li; Wei He; Longhao Yuan; Zhun Sun; Qibin Zhao | 5959 | |
163 | 15:05 | MAP Inference via Block-Coordinate Frank-Wolfe Algorithm | Paul Swoboda; Vladimir Kolmogorov | 4802 | |
164 | 15:10 | A Convex Relaxation for Multi-Graph Matching | Paul Swoboda; Dagmar Kainm¨uller; Ashkan Mokarian; Christian Theobalt; Florian Bernard | 5321 |
Session Title/Poster Group |
Poster # |
Presentation Time |
Title |
Author(s) |
Paper ID |
---|---|---|---|---|---|
Deep Learning | 1 | 15:20 | Practical Full Resolution Learned Lossless Image Compression | Fabian Mentzer; Eirikur Agustsson; Michael Tschannen; Radu Timofte; Luc Van Gool | 3041 |
2 | 15:20 | Image-To-Image Translation via Group-Wise Deep Whitening-And-Coloring Transformation | Wonwoong Cho; Sungha Choi; David Keetae Park; Inkyu Shin; Jaegul Choo | 6930 | |
3 | 15:20 | Max-Sliced Wasserstein Distance and Its Use for GANs | Ishan Deshpande; Yuan-Ting Hu; Ruoyu Sun; Ayis Pyrros; Nasir Siddiqui; Sanmi Koyejo; Zhizhen Zhao; David Forsyth; Alexander G. Schwing | 6265 | |
4 | 15:20 | Meta-Learning With Differentiable Convex Optimization | Kwonjoon Lee; Subhransu Maji; Avinash Ravichandran; Stefano Soatto | 3073 | |
5 | 15:20 | RePr: Improved Training of Convolutional Filters | Aaditya Prakash; James Storer; Dinei Florencio; Cha Zhang | 6188 | |
6 | 15:20 | Tangent-Normal Adversarial Regularization for Semi-Supervised Learning | Bing Yu; Jingfeng Wu; Jinwen Ma; Zhanxing Zhu | 6332 | |
7 | 15:20 | Auto-Encoding Scene Graphs for Image Captioning | Xu Yang; Kaihua Tang; Hanwang Zhang; Jianfei Cai | 3306 | |
8 | 15:20 | Fast, Diverse and Accurate Image Captioning Guided by Part-Of-Speech | Aditya Deshpande; Jyoti Aneja; Liwei Wang; Alexander G. Schwing; David Forsyth | 6218 | |
9 | 15:20 | Attention Branch Network: Learning of Attention Mechanism for Visual Explanation | Hiroshi Fukui; Tsubasa Hirakawa; Takayoshi Yamashita; Hironobu Fujiyoshi | 6105 | |
10 | 15:20 | Cascaded Projection: End-To-End Network Compression and Acceleration | Breton Minnehan; Andreas Savakis | 3796 | |
11 | 15:20 | DeepCaps: Going Deeper With Capsule Networks | Jathushan Rajasegaran; Vinoj Jayasundara; Sandaru Jayasekara; Hirunima Jayasekara; Suranga Seneviratne; Ranga Rodrigo | 5721 | |
12 | 15:20 | FBNet: Hardware-Aware Efficient ConvNet Design via Differentiable Neural Architecture Search | Bichen Wu; Xiaoliang Dai; Peizhao Zhang; Yanghan Wang; Fei Sun; Yiming Wu; Yuandong Tian; Peter Vajda; Yangqing Jia; Kurt Keutzer | 6240 | |
13 | 15:20 | APDrawingGAN: Generating Artistic Portrait Drawings From Face Photos With Hierarchical GANs | Ran Yi; Yong-Jin Liu; Yu-Kun Lai; Paul L. Rosin | 5032 | |
14 | 15:20 | Constrained Generative Adversarial Networks for Interactive Image Generation | Eric Heim | 6431 | |
15 | 15:20 | WarpGAN: Automatic Caricature Generation | Yichun Shi; Debayan Deb; Anil K. Jain | 6807 | |
16 | 15:20 | Explainability Methods for Graph Convolutional Neural Networks | Phillip E. Pope; Soheil Kolouri; Mohammad Rostami; Charles E. Martin; Heiko Hoffmann | 5199 | |
17 | 15:20 | A Generative Adversarial Density Estimator | M. Ehsan Abbasnejad; Qinfeng Shi; Anton van den Hengel; Lingqiao Liu | 5502 | |
18 | 15:20 | SoDeep: A Sorting Deep Net to Learn Ranking Loss Surrogates | Martin Engilberge; Louis Chevallier; Patrick Pérez; Matthieu Cord | 5921 | |
19 | 15:20 | Pixel-Adaptive Convolutional Neural Networks | Hang Su; Varun Jampani; Deqing Sun; Orazio Gallo; Erik Learned-Miller; Jan Kautz | 89 | |
20 | 15:20 | Single-Frame Regularization for Temporally Stable CNNs | Gabriel Eilertsen; Rafal K. Mantiuk; Jonas Unger | 5526 | |
21 | 15:20 | An End-To-End Network for Generating Social Relationship Graphs | Arushi Goel; Keng Teck Ma; Cheston Tan | 5560 | |
22 | 15:20 | Meta-Learning Convolutional Neural Architectures for Multi-Target Concrete Defect Classification With the COncrete DEfect BRidge IMage Dataset | Martin Mundt; Sagnik Majumder; Sreenivas Murali; Panagiotis Panetsos; Visvanathan Ramesh | 5561 | |
23 | 15:20 | ECC: Platform-Independent Energy-Constrained Deep Neural Network Compression via a Bilinear Regression Model | Haichuan Yang; Yuhao Zhu; Ji Liu | 5758 | |
24 | 15:20 | SeerNet: Predicting Convolutional Neural Network Feature-Map Sparsity Through Low-Bit Quantization | Shijie Cao; Lingxiao Ma; Wencong Xiao; Chen Zhang; Yunxin Liu; Lintao Zhang; Lanshun Nie; Zhi Yang | 5807 | |
25 | 15:20 | Defending Against Adversarial Attacks by Randomized Diversification | Olga Taran; Shideh Rezaeifar; Taras Holotyak; Slava Voloshynovskiy | 5842 | |
26 | 15:20 | Rob-GAN: Generator, Discriminator, and Adversarial Attacker | Xuanqing Liu; Cho-Jui Hsieh | 5879 | |
27 | 15:20 | Learning From Noisy Labels by Regularized Estimation of Annotator Confusion | Ryutaro Tanno; Ardavan Saeedi; Swami Sankaranarayanan; Daniel C. Alexander; Nathan Silberman | 5939 | |
28 | 15:20 | Task-Free Continual Learning | Rahaf Aljundi; Klaas Kelchtermans; Tinne Tuytelaars | 6022 | |
29 | 15:20 | Importance Estimation for Neural Network Pruning | Pavlo Molchanov; Arun Mallya; Stephen Tyree; Iuri Frosio; Jan Kautz | 6102 | |
30 | 15:20 | Detecting Overfitting of Deep Generative Networks via Latent Recovery | Ryan Webster; Julien Rabin; Loïc Simon; Frédéric Jurie | 6103 | |
31 | 15:20 | Coloring With Limited Data: Few-Shot Colorization via Memory Augmented Networks | Seungjoo Yoo; Hyojin Bahng; Sunghyo Chung; Junsoo Lee; Jaehyuk Chang; Jaegul Choo | 6148 | |
32 | 15:20 | Characterizing and Avoiding Negative Transfer | Zirui Wang; Zihang Dai; Barnabás Póczos; Jaime Carbonell | 6169 | |
33 | 15:20 | Building Efficient Deep Neural Networks With Unitary Group Convolutions | Ritchie Zhao; Yuwei Hu; Jordan Dotzel; Christopher De Sa; Zhiru Zhang | 6173 | |
34 | 15:20 | Semi-Supervised Learning With Graph Learning-Convolutional Networks | Bo Jiang; Ziyan Zhang; Doudou Lin; Jin Tang; Bin Luo | 6180 | |
35 | 15:20 | Learning to Remember: A Synaptic Plasticity Driven Framework for Continual Learning | Oleksiy Ostapenko; Mihai Puscas; Tassilo Klein; Patrick Jähnichen; Moin Nabi | 6280 | |
36 | 15:20 | AIRD: Adversarial Learning Framework for Image Repurposing Detection | Ayush Jaiswal; Yue Wu; Wael AbdAlmageed; Iacopo Masi; Premkumar Natarajan | 6318 | |
37 | 15:20 | A Kernelized Manifold Mapping to Diminish the Effect of Adversarial Perturbations | Saeid Asgari Taghanaki; Kumar Abhishek; Shekoofeh Azizi; Ghassan Hamarneh | 6322 | |
38 | 15:20 | Trust Region Based Adversarial Attack on Neural Networks | Zhewei Yao; Amir Gholami; Peng Xu; Kurt Keutzer; Michael W. Mahoney | 6365 | |
39 | 15:20 | PEPSI : Fast Image Inpainting With Parallel Decoding Network | Min-cheol Sagong; Yong-goo Shin; Seung-wook Kim; Seung Park; Sung-jea Ko | 6375 | |
40 | 15:20 | Model-Blind Video Denoising via Frame-To-Frame Training | Thibaud Ehret; Axel Davy; Jean-Michel Morel; Gabriele Facciolo; Pablo Arias | 6455 | |
41 | 15:20 | End-To-End Efficient Representation Learning via Cascading Combinatorial Optimization | Yeonwoo Jeong; Yoonsung Kim; Hyun Oh Song | 6535 | |
42 | 15:20 | Sim-Real Joint Reinforcement Transfer for 3D Indoor Navigation | Fengda Zhu; Linchao Zhu; Yi Yang | 6570 | |
43 | 15:20 | ChamNet: Towards Efficient Network Design Through Platform-Aware Model Adaptation | Xiaoliang Dai; Peizhao Zhang; Bichen Wu; Hongxu Yin; Fei Sun; Yanghan Wang; Marat Dukhan; Yunqing Hu; Yiming Wu; Yangqing Jia; Peter Vajda; Matt Uyttendaele; Niraj K. Jha | 6629 | |
44 | 15:20 | Regularizing Activation Distribution for Training Binarized Deep Networks | Ruizhou Ding; Ting-Wu Chin; Zeye Liu; Diana Marculescu | 6646 | |
45 | 15:20 | Robustness Verification of Classification Deep Neural Networks via Linear Programming | Wang Lin; Zhengfeng Yang; Xin Chen; Qingye Zhao; Xiangkun Li; Zhiming Liu; Jifeng He | 6687 | |
46 | 15:20 | Additive Adversarial Learning for Unbiased Authentication | Jian Liang; Yuren Cao; Chenbin Zhang; Shiyu Chang; Kun Bai; Zenglin Xu | 6976 | |
47 | 15:20 | Simultaneously Optimizing Weight and Quantizer of Ternary Neural Network Using Truncated Gaussian Approximation | Zhezhi He; Deliang Fan | 7079 | |
48 | 15:20 | Adversarial Defense by Stratified Convolutional Sparse Coding | Bo Sun; Nian-Hsuan Tsai; Fangchen Liu; Ronald Yu; Hao Su | 7112 | |
Recognition | 49 | 15:20 | Exploring Object Relation in Mean Teacher for Cross-Domain Detection | Qi Cai; Yingwei Pan; Chong-Wah Ngo; Xinmei Tian; Lingyu Duan; Ting Yao | 5422 |
50 | 15:20 | Hierarchical Disentanglement of Discriminative Latent Features for Zero-Shot Learning | Bin Tong; Chao Wang; Martin Klinkigt; Yoshiyuki Kobayashi; Yuuichi Nonaka | 5582 | |
51 | 15:20 | R²GAN: Cross-Modal Recipe Retrieval With Generative Adversarial Network | Bin Zhu; Chong-Wah Ngo; Jingjing Chen; Yanbin Hao | 5694 | |
52 | 15:20 | Rethinking Knowledge Graph Propagation for Zero-Shot Learning | Michael Kampffmeyer; Yinbo Chen; Xiaodan Liang; Hao Wang; Yujia Zhang; Eric P. Xing | 5745 | |
53 | 15:20 | Learning to Learn Image Classifiers With Visual Analogy | Linjun Zhou; Peng Cui; Shiqiang Yang; Wenwu Zhu; Qi Tian | 5863 | |
54 | 15:20 | Where's Wally Now? Deep Generative and Discriminative Embeddings for Novelty Detection | Philippe Burlina; Neil Joshi; I-Jeng Wang | 5895 | |
55 | 15:20 | Weakly Supervised Image Classification Through Noise Regularization | Mengying Hu; Hu Han; Shiguang Shan; Xilin Chen | 5972 | |
56 | 15:20 | Data-Driven Neuron Allocation for Scale Aggregation Networks | Yi Li; Zhanghui Kuang; Yimin Chen; Wayne Zhang | 5986 | |
57 | 15:20 | Graphical Contrastive Losses for Scene Graph Parsing | Ji Zhang; Kevin J. Shih; Ahmed Elgammal; Andrew Tao; Bryan Catanzaro | 6075 | |
58 | 15:20 | Deep Transfer Learning for Multiple Class Novelty Detection | Pramuditha Perera; Vishal M. Patel | 6203 | |
59 | 15:20 | QATM: Quality-Aware Template Matching for Deep Learning | Jiaxin Cheng; Yue Wu; Wael AbdAlmageed; Premkumar Natarajan | 6347 | |
60 | 15:20 | Retrieval-Augmented Convolutional Neural Networks Against Adversarial Examples | Jake Zhao (Junbo); Kyunghyun Cho | 6528 | |
61 | 15:20 | Learning Cross-Modal Embeddings With Adversarial Networks for Cooking Recipes and Food Images | Hao Wang; Doyen Sahoo; Chenghao Liu; Ee-peng Lim; Steven C. H. Hoi | 6538 | |
62 | 15:20 | FastDraw: Addressing the Long Tail of Lane Detection by Adapting a Sequential Prediction Network | Jonah Philion | 7022 | |
63 | 15:20 | Weakly Supervised Video Moment Retrieval From Text Queries | Niluthpol Chowdhury Mithun; Sujoy Paul; Amit K. Roy-Chowdhury | 7086 | |
Segmentation, Grouping, & Shape | 64 | 15:20 | Content-Aware Multi-Level Guidance for Interactive Instance Segmentation | Soumajit Majumder; Angela Yao | 5499 |
65 | 15:20 | Greedy Structure Learning of Hierarchical Compositional Models | Adam Kortylewski; Aleksander Wieczorek; Mario Wieser; Clemens Blumer; Sonali Parbhoo; Andreas Morel-Forster; Volker Roth; Thomas Vetter | 5805 | |
66 | 15:20 | Interactive Full Image Segmentation by Considering All Regions Jointly | Eirikur Agustsson; Jasper R. R. Uijlings; Vittorio Ferrari | 6354 | |
67 | 15:20 | Learning Active Contour Models for Medical Image Segmentation | Xu Chen; Bryan M. Williams; Srinivasa R. Vallabhaneni; Gabriela Czanner; Rachel Williams; Yalin Zheng | 6376 | |
68 | 15:20 | Customizable Architecture Search for Semantic Segmentation | Yiheng Zhang; Zhaofan Qiu; Jingen Liu; Ting Yao; Dong Liu; Tao Mei | 6887 | |
Statistics, Physics, Theory, & Datasets | 69 | 15:20 | Local Features and Visual Words Emerge in Activations | Oriane Siméoni; Yannis Avrithis; Ondřej Chum | 5468 |
70 | 15:20 | Hyperspectral Image Super-Resolution With Optimized RGB Guidance | Ying Fu; Tao Zhang; Yinqiang Zheng; Debing Zhang; Hua Huang | 5509 | |
71 | 15:20 | Adaptive Confidence Smoothing for Generalized Zero-Shot Learning | Yuval Atzmon; Gal Chechik | 5671 | |
72 | 15:20 | PMS-Net: Robust Haze Removal Based on Patch Map for Single Images | Wei-Ting Chen; Jian-Jiun Ding; Sy-Yen Kuo | 5748 | |
73 | 15:20 | Deep Spherical Quantization for Image Search | Sepehr Eghbali; Ladan Tahvildari | 5905 | |
74 | 15:20 | Large-Scale Interactive Object Segmentation With Human Annotators | Rodrigo Benenson; Stefan Popov; Vittorio Ferrari | 6117 | |
75 | 15:20 | A Poisson-Gaussian Denoising Dataset With Real Fluorescence Microscopy Images | Yide Zhang; Yinhao Zhu; Evan Nichols; Qingfei Wang; Siyuan Zhang; Cody Smith; Scott Howard | 6327 | |
76 | 15:20 | Task Agnostic Meta-Learning for Few-Shot Learning | Muhammad Abdullah Jamal; Guo-Jun Qi | 6484 | |
77 | 15:20 | Progressive Ensemble Networks for Zero-Shot Recognition | Meng Ye; Yuhong Guo | 6589 | |
78 | 15:20 | Direct Object Recognition Without Line-Of-Sight Using Optical Coherence | Xin Lei; Liangyu He; Yixuan Tan; Ken Xingze Wang; Xinggang Wang; Yihan Du; Shanhui Fan; Zongfu Yu | 6749 | |
79 | 15:20 | Atlas of Digital Pathology: A Generalized Hierarchical Histological Tissue Type-Annotated Database for Deep Learning | Mahdi S. Hosseini; Lyndon Chan; Gabriel Tse; Michael Tang; Jun Deng; Sajad Norouzi; Corwyn Rowsell; Konstantinos N. Plataniotis; Savvas Damaskinos | 6981 | |
3D Multiview | 80 | 15:20 | Perturbation Analysis of the 8-Point Algorithm: A Case Study for Wide FoV Cameras | Thiago L. T. da Silveira; Claudio R. Jung | 5565 |
81 | 15:20 | Robustness of 3D Deep Learning in an Adversarial Setting | Matthew Wicker; Marta Kwiatkowska | 6014 | |
82 | 15:20 | SceneCode: Monocular Dense Semantic Reconstruction Using Learned Encoded Scene Representations | Shuaifeng Zhi; Michael Bloesch; Stefan Leutenegger; Andrew J. Davison | 6328 | |
83 | 15:20 | StereoDRNet: Dilated Residual StereoNet | Rohan Chabra; Julian Straub; Christopher Sweeney; Richard Newcombe; Henry Fuchs | 6433 | |
84 | 15:20 | The Alignment of the Spheres: Globally-Optimal Spherical Mixture Alignment for Camera Pose Estimation | Dylan Campbell; Lars Petersson; Laurent Kneip; Hongdong Li; Stephen Gould | 6633 | |
3D Single View & RGBD | 85 | 15:20 | Learning Joint Reconstruction of Hands and Manipulated Objects | Yana Hasson; Gül Varol; Dimitrios Tzionas; Igor Kalevatykh; Michael J. Black; Ivan Laptev; Cordelia Schmid | 88 |
86 | 15:20 | Deep Single Image Camera Calibration With Radial Distortion | Manuel López; Roger Marí; Pau Gargallo; Yubin Kuang; Javier Gonzalez-Jimenez; Gloria Haro | 5538 | |
87 | 15:20 | CAM-Convs: Camera-Aware Multi-Scale Convolutions for Single-View Depth | Jose M. Facil; Benjamin Ummenhofer; Huizhong Zhou; Luis Montesano; Thomas Brox; Javier Civera | 5655 | |
88 | 15:20 | Translate-to-Recognize Networks for RGB-D Scene Recognition | Dapeng Du; Limin Wang; Huiling Wang; Kai Zhao; Gangshan Wu | 5730 | |
89 | 15:20 | Re-Identification Supervised Texture Generation | Jian Wang; Yunshan Zhong; Yachun Li; Chi Zhang; Yichen Wei | 6024 | |
90 | 15:20 | Action4D: Online Action Recognition in the Crowd and Clutter | Quanzeng You; Hao Jiang | 6276 | |
91 | 15:20 | Monocular 3D Object Detection Leveraging Accurate Proposals and Shape Reconstruction | Jason Ku; Alex D. Pon; Steven L. Waslander | 6696 | |
Face & Body | 92 | 15:20 | High-Quality Face Capture Using Anatomical Muscles | Michael Bao; Matthew Cong; Stéphane Grabli; Ronald Fedkiw | 4 |
93 | 15:20 | FML: Face Model Learning From Videos | Ayush Tewari; Florian Bernard; Pablo Garrido; Gaurav Bharaj; Mohamed Elgharib; Hans-Peter Seidel; Patrick Pérez; Michael Zollhöfer; Christian Theobalt | 2408 | |
94 | 15:20 | AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations | Xiao Zhang; Rui Zhao; Yu Qiao; Xiaogang Wang; Hongsheng Li | 4483 | |
95 | 15:20 | 3D Hand Shape and Pose Estimation From a Single RGB Image | Liuhao Ge; Zhou Ren; Yuncheng Li; Zehao Xue; Yingying Wang; Jianfei Cai; Junsong Yuan | 387 | |
96 | 15:20 | 3D Hand Shape and Pose From Images in the Wild | Adnane Boukhayma; Rodrigo de Bem; Philip H.S. Torr | 647 | |
97 | 15:20 | Self-Supervised 3D Hand Pose Estimation Through Training by Fitting | Chengde Wan; Thomas Probst; Luc Van Gool; Angela Yao | 843 | |
98 | 15:20 | CrowdPose: Efficient Crowded Scenes Pose Estimation and a New Benchmark | Jiefeng Li; Can Wang; Hao Zhu; Yihuan Mao; Hao-Shu Fang; Cewu Lu | 1497 | |
99 | 15:20 | Towards Social Artificial Intelligence: Nonverbal Social Signal Prediction in a Triadic Interaction | Hanbyul Joo; Tomas Simon; Mina Cikara; Yaser Sheikh | 3233 | |
100 | 15:20 | HoloPose: Holistic 3D Human Reconstruction In-The-Wild | Rıza Alp Güler; Iasonas Kokkinos | 6947 | |
101 | 15:20 | Weakly-Supervised Discovery of Geometry-Aware Representation for 3D Human Pose Estimation | Xipeng Chen; Kwan-Yee Lin; Wentao Liu; Chen Qian; Liang Lin | 2239 | |
102 | 15:20 | In the Wild Human Pose Estimation Using Explicit 2D Features and Intermediate 3D Representations | Ikhsanul Habibie; Weipeng Xu; Dushyant Mehta; Gerard Pons-Moll; Christian Theobalt | 3999 | |
103 | 15:20 | Slim DensePose: Thrifty Learning From Sparse Annotations and Motion Cues | Natalia Neverova; James Thewlis; Rıza Alp Güler; Iasonas Kokkinos; Andrea Vedaldi | 33 | |
104 | 15:20 | Self-Supervised Representation Learning From Videos for Facial Action Unit Detection | Yong Li; Jiabei Zeng; Shiguang Shan; Xilin Chen | 2859 | |
105 | 15:20 | Combining 3D Morphable Models: A Large Scale Face-And-Head Model | Stylianos Ploumpis; Haoyang Wang; Nick Pears; William A. P. Smith; Stefanos Zafeiriou | 4558 | |
106 | 15:20 | Boosting Local Shape Matching for Dense 3D Face Correspondence | Zhenfeng Fan; Xiyuan Hu; Chen Chen; Silong Peng | 4364 | |
107 | 15:20 | Unsupervised Part-Based Disentangling of Object Shape and Appearance | Dominik Lorenz; Leonard Bereska; Timo Milbich; Björn Ommer | 2886 | |
108 | 15:20 | Monocular Total Capture: Posing Face, Body, and Hands in the Wild | Donglai Xiang; Hanbyul Joo; Yaser Sheikh | 2922 | |
109 | 15:20 | Expressive Body Capture: 3D Hands, Face, and Body From a Single Image | Georgios Pavlakos; Vasileios Choutas; Nima Ghorbani; Timo Bolkart; Ahmed A. A. Osman; Dimitrios Tzionas; Michael J. Black | 3128 | |
110 | 15:20 | Attribute-Aware Face Aging With Wavelet-Based Generative Adversarial Networks | Yunfan Liu; Qi Li; Zhenan Sun | 5458 | |
111 | 15:20 | Noise-Tolerant Paradigm for Training Face Recognition CNNs | Wei Hu; Yangyu Huang; Fan Zhang; Ruirui Li | 5518 | |
112 | 15:20 | Low-Rank Laplacian-Uniform Mixed Model for Robust Face Recognition | Jiayu Dong; Huicheng Zheng; Lina Lian | 5529 | |
113 | 15:20 | Generalizing Eye Tracking With Bayesian Adversarial Learning | Kang Wang; Rui Zhao; Hui Su; Qiang Ji | 6040 | |
114 | 15:20 | Local Relationship Learning With Person-Specific Shape Regularization for Facial Action Unit Detection | Xuesong Niu; Hu Han; Songfan Yang; Yan Huang; Shiguang Shan | 6085 | |
115 | 15:20 | Point-To-Pose Voting Based Hand Pose Estimation Using Residual Permutation Equivariant Layer | Shile Li; Dongheui Lee | 6096 | |
116 | 15:20 | Improving Few-Shot User-Specific Gaze Adaptation via Gaze Redirection Synthesis | Yu Yu; Gang Liu; Jean-Marc Odobez | 6510 | |
117 | 15:20 | AdaptiveFace: Adaptive Margin and Sampling for Face Recognition | Hao Liu; Xiangyu Zhu; Zhen Lei; Stan Z. Li | 6610 | |
118 | 15:20 | Disentangled Representation Learning for 3D Face Shape | Zi-Hang Jiang; Qianyi Wu; Keyu Chen; Juyong Zhang | 6817 | |
119 | 15:20 | LBS Autoencoder: Self-Supervised Fitting of Articulated Meshes to Point Clouds | Chun-Liang Li; Tomas Simon; Jason Saragih; Barnabás Póczos; Yaser Sheikh | 6834 | |
120 | 15:20 | PifPaf: Composite Fields for Human Pose Estimation | Sven Kreiss; Lorenzo Bertoni; Alexandre Alahi | 6964 | |
Action & Video | 121 | 15:20 | TACNet: Transition-Aware Context Network for Spatio-Temporal Action Detection | Lin Song; Shiwei Zhang; Gang Yu; Hongbin Sun | 5442 |
122 | 15:20 | Learning Regularity in Skeleton Trajectories for Anomaly Detection in Videos | Romero Morais; Vuong Le; Truyen Tran; Budhaditya Saha; Moussa Mansour; Svetha Venkatesh | 5576 | |
123 | 15:20 | Local Temporal Bilinear Pooling for Fine-Grained Action Parsing | Yan Zhang; Siyu Tang; Krikamol Muandet; Christian Jarvers; Heiko Neumann | 5661 | |
124 | 15:20 | Improving Action Localization by Progressive Cross-Stream Cooperation | Rui Su; Wanli Ouyang; Luping Zhou; Dong Xu | 5677 | |
125 | 15:20 | Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition | Lei Shi; Yifan Zhang; Jian Cheng; Hanqing Lu | 5734 | |
126 | 15:20 | A Neural Network Based on SPD Manifold Learning for Skeleton-Based Hand Gesture Recognition | Xuan Son Nguyen; Luc Brun; Olivier Lézoray; Sébastien Bougleux | 5851 | |
127 | 15:20 | Large-Scale Weakly-Supervised Pre-Training for Video Action Recognition | Deepti Ghadiyaram; Du Tran; Dhruv Mahajan | 5874 | |
128 | 15:20 | Learning Spatio-Temporal Representation With Local and Global Diffusion | Zhaofan Qiu; Ting Yao; Chong-Wah Ngo; Xinmei Tian; Tao Mei | 6155 | |
129 | 15:20 | Unsupervised Learning of Action Classes With Continuous Temporal Embedding | Anna Kukleva; Hilde Kuehne; Fadime Sener; Jürgen Gall | 6348 | |
130 | 15:20 | Double Nuclear Norm Based Low Rank Representation on Grassmann Manifolds for Clustering | Xinglin Piao; Yongli Hu; Junbin Gao; Yanfeng Sun; Baocai Yin | 7172 | |
Motion & Biometrics | 131 | 15:20 | SR-LSTM: State Refinement for LSTM Towards Pedestrian Trajectory Prediction | Pu Zhang; Wanli Ouyang; Pengfei Zhang; Jianru Xue; Nanning Zheng | 5803 |
132 | 15:20 | Unsupervised Deep Epipolar Flow for Stationary or Dynamic Scenes | Yiran Zhong; Pan Ji; Jianyuan Wang; Yuchao Dai; Hongdong Li | 6115 | |
133 | 15:20 | An Efficient Schmidt-EKF for 3D Visual-Inertial SLAM | Patrick Geneva; James Maley; Guoquan Huang | 6223 | |
134 | 15:20 | A Neural Temporal Model for Human Motion Prediction | Anand Gopalakrishnan; Ankur Mali; Dan Kifer; Lee Giles; Alexander G. Ororbia | 6640 | |
135 | 15:20 | Multi-Agent Tensor Fusion for Contextual Trajectory Prediction | Tianyang Zhao; Yifei Xu; Mathew Monfort; Wongun Choi; Chris Baker; Yibiao Zhao; Yizhou Wang; Ying Nian Wu | 6954 | |
Synthesis | 136 | 15:20 | Coordinate-Based Texture Inpainting for Pose-Guided Human Image Generation | Artur Grigorev; Artem Sevastopolsky; Alexander Vakhitov; Victor Lempitsky | 5430 |
137 | 15:20 | On Stabilizing Generative Adversarial Training With Noise | Simon Jenni; Paolo Favaro | 5596 | |
138 | 15:20 | Self-Supervised GANs via Auxiliary Rotation Loss | Ting Chen; Xiaohua Zhai; Marvin Ritter; Mario Lucic; Neil Houlsby | 5940 | |
139 | 15:20 | Texture Mixer: A Network for Controllable Synthesis and Interpolation of Texture | Ning Yu; Connelly Barnes; Eli Shechtman; Sohrab Amirghodsi; Michal Lukáč | 5947 | |
140 | 15:20 | Object-Driven Text-To-Image Synthesis via Adversarial Training | Wenbo Li; Pengchuan Zhang; Lei Zhang; Qiuyuan Huang; Xiaodong He; Siwei Lyu; Jianfeng Gao | 6172 | |
141 | 15:20 | Zoom-In-To-Check: Boosting Video Interpolation via Instance-Level Discrimination | Liangzhe Yuan; Yibo Chen; Hantian Liu; Tao Kong; Jianbo Shi | 6289 | |
142 | 15:20 | Disentangling Latent Space for VAE by Label Relevant/Irrelevant Dimensions | Zhilin Zheng; Li Sun | 6308 | |
Computational Photography & Graphics | 143 | 15:20 | Spectral Reconstruction From Dispersive Blur: A Novel Light Efficient Spectral Imager | Yuanyuan Zhao; Xuemei Hu; Hui Guo; Zhan Ma; Tao Yue; Xun Cao | 5055 |
144 | 15:20 | Quasi-Unsupervised Color Constancy | Simone Bianco; Claudio Cusano | 5370 | |
145 | 15:20 | Deep Defocus Map Estimation Using Domain Adaptation | Junyong Lee; Sungkil Lee; Sunghyun Cho; Seungyong Lee | 5571 | |
146 | 15:20 | Using Unknown Occluders to Recover Hidden Scenes | Adam B. Yedidia; Manel Baradad; Christos Thrampoulidis; William T. Freeman; Gregory W. Wornell | 6650 | |
Low-Level & Optimization | 147 | 15:20 | Neural RGB®D Sensing: Depth and Uncertainty From a Video Camera | Chao Liu; Jinwei Gu; Kihwan Kim; Srinivasa G. Narasimhan; Jan Kautz | 707 |
148 | 15:20 | DAVANet: Stereo Deblurring With View Aggregation | Shangchen Zhou; Jiawei Zhang; Wangmeng Zuo; Haozhe Xie; Jinshan Pan; Jimmy S. Ren | 1006 | |
149 | 15:20 | DVC: An End-To-End Deep Video Compression Framework | Guo Lu; Wanli Ouyang; Dong Xu; Xiaoyun Zhang; Chunlei Cai; Zhiyong Gao | 3657 | |
150 | 15:20 | SOSNet: Second Order Similarity Regularization for Local Descriptor Learning | Yurun Tian; Xin Yu; Bin Fan; Fuchao Wu; Huub Heijnen; Vassileios Balntas | 1098 | |
151 | 15:20 | “Double-DIP”: Unsupervised Image Decomposition via Coupled Deep-Image-Priors | Yosef Gandelsman; Assaf Shocher; Michal Irani | 2154 | |
152 | 15:20 | Unprocessing Images for Learned Raw Denoising | Tim Brooks; Ben Mildenhall; Tianfan Xue; Jiawen Chen; Dillon Sharlet; Jonathan T. Barron | 2579 | |
153 | 15:20 | Residual Networks for Light Field Image Super-Resolution | Shuo Zhang; Youfang Lin; Hao Sheng | 3342 | |
154 | 15:20 | Modulating Image Restoration With Continual Levels via Adaptive Feature Modification Layers | Jingwen He; Chao Dong; Yu Qiao | 3959 | |
155 | 15:20 | Second-Order Attention Network for Single Image Super-Resolution | Tao Dai; Jianrui Cai; Yongbing Zhang; Shu-Tao Xia; Lei Zhang | 5318 | |
156 | 15:20 | Devil Is in the Edges: Learning Semantic Boundaries From Noisy Annotations | David Acuna; Amlan Kar; Sanja Fidler | 2599 | |
157 | 15:20 | Path-Invariant Map Networks | Zaiwei Zhang; Zhenxiao Liang; Lemeng Wu; Xiaowei Zhou; Qixing Huang | 3097 | |
158 | 15:20 | FilterReg: Robust and Efficient Probabilistic Point-Set Registration Using Gaussian Filter and Twist Parameterization | Wei Gao; Russ Tedrake | 5608 | |
159 | 15:20 | Probabilistic Permutation Synchronization Using the Riemannian Structure of the Birkhoff Polytope | Tolga Birdal; Umut Şimşekli | 108 | |
160 | 15:20 | Lifting Vectorial Variational Problems: A Natural Formulation Based on Geometric Measure Theory and Discrete Exterior Calculus | Thomas Möllenhoff; Daniel Cremers | 190 | |
161 | 15:20 | A Sufficient Condition for Convergences of Adam and RMSProp | Fangyu Zou; Li Shen; Zequn Jie; Weizhong Zhang; Wei Liu | 1428 | |
162 | 15:20 | Guaranteed Matrix Completion Under Multiple Linear Transformations | Chao Li; Wei He; Longhao Yuan; Zhun Sun; Qibin Zhao | 5959 | |
163 | 15:20 | MAP Inference via Block-Coordinate Frank-Wolfe Algorithm | Paul Swoboda; Vladimir Kolmogorov | 4802 | |
164 | 15:20 | A Convex Relaxation for Multi-Graph Matching | Paul Swoboda; Dagmar Kainm¨uller; Ashkan Mokarian; Christian Theobalt; Florian Bernard | 5321 | |
165 | 15:20 | Competitive Collaboration: Joint Unsupervised Learning of Depth, Camera Motion, Optical Flow and Motion Segmentation | Anurag Ranjan; Varun Jampani; Lukas Balles; Kihwan Kim; Deqing Sun; Jonas Wulff; Michael J. Black | 92 | |
166 | 15:20 | Learning Parallax Attention for Stereo Image Super-Resolution | Longguang Wang; Yingqian Wang; Zhengfa Liang; Zaiping Lin; Jungang Yang; Wei An; Yulan Guo | 5558 | |
167 | 15:20 | Knowing When to Stop: Evaluation and Verification of Conformity to Output-Size Specifications | Chenglong Wang; Rudy Bunel; Krishnamurthy Dvijotham; Po-Sen Huang; Edward Grefenstette; Pushmeet Kohli | 5817 | |
168 | 15:20 | Spatial Attentive Single-Image Deraining With a High Quality Real Rain Dataset | Tianyu Wang; Xin Yang; Ke Xu; Shaozhe Chen; Qiang Zhang; Rynson W.H. Lau | 5924 | |
169 | 15:20 | Focus Is All You Need: Loss Functions for Event-Based Vision | Guillermo Gallego; Mathias Gehrig; Davide Scaramuzza | 6049 | |
170 | 15:20 | Scalable Convolutional Neural Network for Image Compressed Sensing | Wuzhen Shi; Feng Jiang; Shaohui Liu; Debin Zhao | 6090 | |
171 | 15:20 | Event Cameras, Contrast Maximization and Reward Functions: An Analysis | Timo Stoffregen; Lindsay Kleeman | 6183 | |
172 | 15:20 | Convolutional Neural Networks Can Be Deceived by Visual Illusions | Alexander Gomez-Villa; Adrian Martín; Javier Vazquez-Corral; Marcelo Bertalmío | 6251 | |
173 | 15:20 | PDE Acceleration for Active Contours | Anthony Yezzi; Ganesh Sundaramoorthi; Minas Benyamin | 6423 | |
174 | 15:20 | Dichromatic Model Based Temporal Color Constancy for AC Light Sources | Jun-Sang Yoo; Jong-Ok Kim | 6493 | |
175 | 15:20 | Semantic Attribute Matching Networks | Seungryong Kim; Dongbo Min; Somi Jeong; Sunok Kim; Sangryul Jeon; Kwanghoon Sohn | 6564 | |
176 | 15:20 | Skin-Based Identification From Multispectral Image Data Using CNNs | Takeshi Uemori; Atsushi Ito; Yusuke Moriuchi; Alexander Gatto; Jun Murayama | 6653 | |
177 | 15:20 | Large-Scale Distributed Second-Order Optimization Using Kronecker-Factored Approximate Curvature for Deep Convolutional Neural Networks | Kazuki Osawa; Yohei Tsuji; Yuichiro Ueno; Akira Naruse; Rio Yokota; Satoshi Matsuoka | 7061 | |
Scenes & Representation | 178 | 15:20 | Putting Humans in a Scene: Learning Affordance in 3D Indoor Environments | Xueting Li; Sifei Liu; Kihwan Kim; Xiaolong Wang; Ming-Hsuan Yang; Jan Kautz | 100 |
179 | 15:20 | PIEs: Pose Invariant Embeddings | Chih-Hui Ho; Pedro Morgado; Amir Persekian; Nuno Vasconcelos | 5042 | |
180 | 15:20 | Representation Similarity Analysis for Efficient Task Taxonomy & Transfer Learning | Kshitij Dwivedi; Gemma Roig | 5878 | |
181 | 15:20 | Object Counting and Instance Segmentation With Image-Level Supervision | Hisham Cholakkal; Guolei Sun; Fahad Shahbaz Khan; Ling Shao | 6020 | |
182 | 15:20 | Variational Autoencoders Pursue PCA Directions (by Accident) | Michal Rolínek; Dominik Zietlow; Georg Martius | 6226 | |
183 | 15:20 | A Relation-Augmented Fully Convolutional Network for Semantic Segmentation in Aerial Scenes | Lichao Mou; Yuansheng Hua; Xiao Xiang Zhu | 6246 | |
184 | 15:20 | Temporal Transformer Networks: Joint Learning of Invariant and Discriminative Time Warping | Suhas Lohit; Qiao Wang; Pavan Turaga | 6250 | |
185 | 15:20 | PCAN: 3D Attention Map Learning Using Contextual Information for Point Cloud Based Retrieval | Wenxiao Zhang; Chunxia Xiao | 6500 | |
186 | 15:20 | Depth Coefficients for Depth Completion | Saif Imran; Yunfei Long; Xiaoming Liu; Daniel Morris | 6770 | |
187 | 15:20 | Diversify and Match: A Domain Adaptive Representation Learning Paradigm for Object Detection | Taekyung Kim; Minki Jeong; Seunghyeon Kim; Seokeon Choi; Changick Kim | 6933 | |
Language & Reasoning | 188 | 15:20 | Good News, Everyone! Context Driven Entity-Aware Captioning for News Images | Ali Furkan Biten; Lluis Gomez; Marçal Rusiñol; Dimosthenis Karatzas | 5508 |
189 | 15:20 | Multi-Level Multimodal Common Semantic Space for Image-Phrase Grounding | Hassan Akbari; Svebor Karaman; Surabhi Bhargava; Brian Chen; Carl Vondrick; Shih-Fu Chang | 5592 | |
190 | 15:20 | Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning | Nayyer Aafaq; Naveed Akhtar; Wei Liu; Syed Zulqarnain Gilani; Ajmal Mian | 5609 | |
191 | 15:20 | Pointing Novel Objects in Image Captioning | Yehao Li; Ting Yao; Yingwei Pan; Hongyang Chao; Tao Mei | 5669 | |
192 | 15:20 | Informative Object Annotations: Tell Me Something I Don't Know | Lior Bracha; Gal Chechik | 5957 | |
193 | 15:20 | Engaging Image Captioning via Personality | Kurt Shuster; Samuel Humeau; Hexiang Hu; Antoine Bordes; Jason Weston | 5995 | |
194 | 15:20 | Vision-Based Navigation With Language-Based Assistance via Imitation Learning With Indirect Intervention | Khanh Nguyen; Debadeepta Dey; Chris Brockett; Bill Dolan | 6028 | |
195 | 15:20 | TOUCHDOWN: Natural Language Navigation and Spatial Reasoning in Visual Street Environments | Howard Chen; Alane Suhr; Dipendra Misra; Noah Snavely; Yoav Artzi | 6459 | |
196 | 15:20 | A Simple Baseline for Audio-Visual Scene-Aware Dialog | Idan Schwartz; Alexander G. Schwing; Tamir Hazan | 6902 | |
Applications, Medical, & Robotics | 197 | 15:20 | End-To-End Learned Random Walker for Seeded Image Segmentation | Lorenzo Cerrone; Alexander Zeilmann; Fred A. Hamprecht | 5379 |
198 | 15:20 | Efficient Neural Network Compression | Hyeji Kim; Muhammad Umar Karim Khan; Chong-Min Kyung | 5380 | |
199 | 15:20 | Cascaded Generative and Discriminative Learning for Microcalcification Detection in Breast Mammograms | Fandong Zhang; Ling Luo; Xinwei Sun; Zhen Zhou; Xiuli Li; Yizhou Yu; Yizhou Wang | 5462 | |
200 | 15:20 | C3AE: Exploring the Limits of Compact Model for Age Estimation | Chao Zhang; Shuaicheng Liu; Xun Xu; Ce Zhu | 5488 | |
201 | 15:20 | Adaptive Weighting Multi-Field-Of-View CNN for Semantic Segmentation in Pathology | Hiroki Tokunaga; Yuki Teramoto; Akihiko Yoshizawa; Ryoma Bise | 5549 | |
202 | 15:20 | In Defense of Pre-Trained ImageNet Architectures for Real-Time Semantic Segmentation of Road-Driving Images | Marin Oršić; Ivan Krešo; Petra Bevandić; Siniša Šegvic | 5792 | |
203 | 15:20 | Context-Aware Visual Compatibility Prediction | Guillem Cucurull; Perouz Taslakian; David Vazquez | 5796 | |
204 | 15:20 | Sim-To-Real via Sim-To-Sim: Data-Efficient Robotic Grasping via Randomized-To-Canonical Adaptation Networks | Stephen James; Paul Wohlhart; Mrinal Kalakrishnan; Dmitry Kalashnikov; Alex Irpan; Julian Ibarz; Sergey Levine; Raia Hadsell; Konstantinos Bousmalis | 5847 | |
205 | 15:20 | Multiview 2D/3D Rigid Registration via a Point-Of-Interest Network for Tracking and Triangulation | Haofu Liao; Wei-An Lin; Jiarui Zhang; Jingdan Zhang; Jiebo Luo; S. Kevin Zhou | 5882 | |
206 | 15:20 | Context-Aware Spatio-Recurrent Curvilinear Structure Segmentation | Feigege Wang; Yue Gu; Wenxi Liu; Yuanlong Yu; Shengfeng He; Jia Pan | 5904 | |
207 | 15:20 | An Alternative Deep Feature Approach to Line Level Keyword Spotting | George Retsinas; Georgios Louloudis; Nikolaos Stamatopoulos; Giorgos Sfikas; Basilis Gatos | 5911 | |
208 | 15:20 | Dynamics Are Important for the Recognition of Equine Pain in Video | Sofia Broomé; Karina Bech Gleerup; Pia Haubro Andersen; Hedvig Kjellström | 5993 | |
209 | 15:20 | LaserNet: An Efficient Probabilistic 3D Object Detector for Autonomous Driving | Gregory P. Meyer; Ankit Laddha; Eric Kee; Carlos Vallespi-Gonzalez; Carl K. Wellington | 6119 | |
210 | 15:20 | Machine Vision Guided 3D Medical Image Compression for Efficient Transmission and Accurate Segmentation in the Clouds | Zihao Liu; Xiaowei Xu; Tao Liu; Qi Liu; Yanzhi Wang; Yiyu Shi; Wujie Wen; Meiping Huang; Haiyun Yuan; Jian Zhuang | 6366 | |
211 | 15:20 | PointPillars: Fast Encoders for Object Detection From Point Clouds | Alex H. Lang; Sourabh Vora; Holger Caesar; Lubing Zhou; Jiong Yang; Oscar Beijbom | 6374 | |
212 | 15:20 | Motion Estimation of Non-Holonomic Ground Vehicles From a Single Feature Correspondence Measured Over N Views | Kun Huang; Yifu Wang; Laurent Kneip | 6388 | |
213 | 15:20 | From Coarse to Fine: Robust Hierarchical Localization at Large Scale | Paul-Edouard Sarlin; Cesar Cadena; Roland Siegwart; Marcin Dymczyk | 6575 | |
214 | 15:20 | Large Scale High-Resolution Land Cover Mapping With Multi-Resolution Data | Caleb Robinson; Le Hou; Kolya Malkin; Rachel Soobitsky; Jacob Czawlytko; Bistra Dilkina; Nebojsa Jojic | 6731 | |
215 | 15:20 | Leveraging Heterogeneous Auxiliary Tasks to Assist Crowd Counting | Muming Zhao; Jian Zhang; Chongyang Zhang; Wenjun Zhang | 6803 |