Implicit Feature Networks (IF-Nets)

For 3D Shape Reconstruction and Completion

Julian Chibane^1,2, Thiemo Alldieck^1,3, Gerard Pons-Moll¹

¹Max Planck Institute for Informatics, Saarland Informatics Campus, Germany
²Julius-Maximilians-Univeristät Würzburg, Germany
³Computer Graphics Lab, TU Braunschweig, Germany

CVPR 2020, Seattle, Washington, USA

Paper, Supplementary, Code, Video, Arxiv

For Texture Completion

Julian Chibane, Gerard Pons-Moll

Max Planck Institute for Informatics, Saarland Informatics Campus, Germany

ECCV 2020, SHARP Workshop, Glasgow, Scotland (Online)

Paper, Code, Arxiv

1st Place Winner in all Categories of SHARP Challange

Abstract

While many works focus on 3D reconstruction from images, in this paper, we focus on 3D shape reconstruction and completion from a variety of 3D inputs, which are deficient in some respect: low and high resolution voxels, sparse and dense point clouds, complete or incomplete. Processing of such 3D inputs is an increasingly important problem as they are the output of 3D scanners, which are becoming more accessible, and are the intermediate output of 3D computer vision algorithms. Recently, learned implicit functions have shown great promise as they produce continuous reconstructions. However, we identified two limitations in reconstruction from 3D inputs: 1) details present in the input data are not retained, and 2) poor reconstruction of articulated humans. To solve this, we propose Implicit Feature Networks (IF-Nets), which deliver continuous outputs, can handle multiple topologies, and complete shapes for missing or sparse input data retaining the nice properties of recent learned implicit functions, but critically they can also retain detail when it is present in the input data, and can reconstruct articulated humans. Our work differs from prior work in two crucial aspects. First, instead of using a single vector to encode a 3D shape, we extract a learnable 3-dimensional multi-scale tensor of deep features, which is aligned with the original Euclidean space embedding the shape. Second, instead of classifying x-y-z point coordinates directly, we classify deep features extracted from the tensor at a continuous query point. We show that this forces our model to make decisions based on global and local shape structure, as opposed to point coordinates, which are arbitrary under Euclidean transformations. Experiments demonstrate that IF-Nets clearly outperform prior work in 3D object reconstruction in ShapeNet, and obtain significantly more accurate 3D human reconstructions.

Single-View Human Reconstruction

Single-View Video Reconstruction

Shape and Texture Completion of Arbitrary Objects

Citations

@inproceedings{chibane20ifnet,
    title = {Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion},
    author = {Chibane, Julian and Alldieck, Thiemo and Pons-Moll, Gerard},
    booktitle = {{IEEE} Conference on Computer Vision and Pattern Recognition (CVPR)},
    month = {jun},
    organization = {{IEEE}},
    year = {2020},
}

@inproceedings{chibane2020ifnet_texture,
    title = {Implicit Feature Networks for Texture Completion from Partial 3D Data},
    author = {Chibane, Julian and Pons-Moll, Gerard},
    booktitle = {European Conference on Computer Vision (ECCV) Workshops},
    month = {August},
    organization = {{Springer}},
    year = {2020},
}