Faithfully modeling the space of articulations is a crucial task that allows recovery and generation of realistic poses, and remains a notorious challenge. To this end, we introduce Neural Riemannian Distance Fields (NRDFs), data-driven priors modeling the space of plausible articulations, represented as the zero-level-set of a neural field in a high-dimensional product-quaternion space. To train NRDFs only on positive examples, we introduce a new sampling algorithm, ensuring that the geodesic distances follow a desired distribution, yielding a principled distance field learning paradigm. We then devise a projection algorithm to map any random pose onto the level-set by an adaptive-step Riemannian optimizer, adhering to the product manifold of joint rotations at all times. NRDFs can compute the Riemannian gradient via backpropagation and by mathematical analogy, are related to Riemannian flow matching, a recent generative model. We conduct a comprehensive evaluation of NRDF against other pose priors in various downstream tasks, i.e., pose generation, image-based pose estimation, and solving inverse kinematics, highlighting NRDF’s superior performance. Besides humans, NRDF’s versatility extends to hand and animal poses, as it can effectively represent any articulation.
In media: NRDF was featured as CVPR'24 highlights in Computer Vision News magazine.
We can denoise unrealistic poses by projecting the noisy pose onto the learned manifold. Use the slider here to see the projection process.
Noisy Pose
Projected Pose
Noisy Pose
Projected Pose
Noisy Pose
Projected Pose
Noisy Pose
Projected Pose
Noisy Pose
Projected Pose
Noisy Pose
Projected Pose
Given partial observation (yellow markers), we perform 3D pose completion. We observe that VPoser (pink) based optimization generates realistic, yet fixed and less diverse poses. NRDF (blue) generates diverse and realistic poses in all setups.
Top: Results from SMPLer-X Bottom: We refine the network prediction using NRDF based optimization pipeline. As highlighted, refined poses align better with the observation.
@inproceedings{he24nrdf,
title = {NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors},
author = {He, Yannan and Tiwari, Garvita and Birdal, Tolga and Lenssen, Jan Eric and Pons-Moll, Gerard},
booktitle = {Conference on Computer Vision and Pattern Recognition ({CVPR})},
year = {2024},
}