Abstract: We present a self-supervised learning approach to learning monocular 3D face reconstruction with a pose guidance network (PGN). First, we unveil the bottleneck of pose estimation in prior parametric 3D face learning methods, and propose to utilize 3D face landmarks for estimating pose parameters. With our specially designed PGN, our model can learn from both faces with fully labeled 3D landmarks and unlimited unlabeled in-the-wild face images. Our network is further augmented with a self-supervised learning scheme, which exploits face geometry information embedded in multiple frames of the same person, to alleviate the ill-posed nature of regressing 3D face geometry from a single image. These three insights yield a single approach that combines the complementary strengths of parametric model learning and data-driven learning techniques. We conduct a rigorous evaluation on the challenging AFLW2000-3D, Florence and FaceWarehouse datasets, and show that our method outperforms the state-of-the-art for all metrics.

SlidesLive

Similar Papers

3D Object Detection and Pose Estimation of Unseen Objects in Color Images with Local Surface Embeddings
Giorgia Pitteri (Université de Bordeaux, LaBRI)*, Aureélie Bugeau (University of Bordeaux), Slobodan Ilic (Siemens AG), Vincent Lepetit (Ecole des Ponts ParisTech)
Human Motion Deblurring using Localized Body Prior
Jonathan Samuel Lumentut (Inha University), Joshua Santoso (Inha University), In Kyu Park (Inha University)*
Adaptive Spotting: Deep Reinforcement Object Search in 3D Point Clouds
Onkar Krishna (NTT Corporation, Japan)*, Go Irie (NTT Corporation), Xiaomeng Wu (NTT Corporation), Takahito Kawanishi (NTT Corporation), Kunio Kashino (NTT Communication Science Laboratories)