- University of Tübingen
Maria-von-Linden Strasse 6
72076 - 20-5/A19 (tba)
- +49 681 9325 2135
- gerard.pons-moll@uni-tuebingen.de
For all enquiries please contact:
Jessi Endress
Administrative Assistant
+ 49 7071 29 70803
jessica.endress@uni-tuebingen.de
Short Bio
Gerard Pons-Moll is a Professor at the University of Tübingen endowed by the Carl Zeiss Foundation, at the department of Computer Science. He is also core faculty at the Tübingen AI Center and head of the Emmy Noether independent research group "Real Virtual Humans", senior researcher at the Max Planck for Informatics (MPII) in Saarbrücken, Germany, and faculty at the IMPRS-IS (International Max Planck Research School - Intelligent Systems in Tübingen) and faculty at Saarland Informatics Campus. His research lies at the intersection of computer vision, computer graphics and machine learning -- with special focus on analyzing people in videos, and creating virtual human models by "looking" at real ones. His research has produced some of the most advanced statistical human body models of pose, shape, soft-tissue and clothing (which are currently used for a number of applications in industry and research), as well as algorithms to track and reconstruct 3D people models from images, video, depth, and IMUs.
His work has received several awards including the prestigious Emmy Noether Grant (2018), a Google Faculty Research Award (2019), a Facebook Reality Labs Faculty Award (2018), and recently the German Pattern Recognition Award (2019), which is given annually by the German Pattern Recognition Society to one outstanding researcher in the fields of Computer Vision and Machine Learning. In 2020 he received a Snap-Research gift. His work got Best Papers Awards BMVC’13, Eurographics’17, 3DV'18 and CVPR'20 and has been published at the top venues and journals including CVPR, ICCV, Siggraph, Eurographics, 3DV, IJCV and PAMI. He served as Area Chair for ECCV'18, 3DV'19, SCA'18'19, FG'20, ECCV'20. He will serve as Area Chair for CVPR'21, IJCAI'21 and 3DV'20.
Recent tweets
Awards and grants:
Winner of all tracks of the ECCV'20 Challenge with J. Chibane | 2020 | |
Best Student Paper Award Honorable Mention of CVPR'20 (with M. Habermann, W. Xu, M. Zhollhoefer, Christian Theobalt) | 2020 | |
Snap Research Gift | 2020 | |
German Pattern Recognition Award | 2019 | |
Outstanding Reviewer Award of CVPR'19 | 2019 | |
Google Faculty Research Award | 2019 | |
Emmy Noether starting grant | 2018 | |
Facebook Reality Labs Faculty Research Award | 2018 | |
Best Student Paper Award of 3DV'18 (with M. Omran, C.Lassner, P. Gehler, B. Schiele) | (pdf) | 2018 |
Best Paper Award at Eurographics'17 (with T. von Marcard, B. Rosenhahn and M.J. Black) | (pdf) | 2017 |
Starting grant from the Zentrum Digitialisierung Bayern (ZD.B) (Declined) | 2017 | |
Best Paper Award at BMVC'13 (with J. Taylor, J. Shotton, A. Hertzmann & A. Fitzgibbon) | (pdf) | 2017 |
Ph.D degree "with distinction" for the first time in 10 years at the Institute for Information Processing at the Leibniz University of Hannover | 2014 | |
Vodafone Foundation Fellowship. Awarded to outstanding graduates at UPC to conduct graduate research at USA | 2007 |
Professional Activities:
Program Chair of 3DV 2021 | 2021 |
Area Chair CVPR | 2021 |
Area Chair IJCAI | 2021 |
Area Chair ECCV | 2020 |
Area Chair 3DV | 2020 |
Area Chair FG (Face and Gesture) | 2020 |
Area Chair 3DV | 2019 |
Area Chair ECCV | 2018 |
International program committee* Pacific Graphics (PG), Motion Interaction and Games (MIG) * equivalent to Area Chair in vision conferences | 2019 |
International program committee SCA (Symposium on Computer Animation), CASA (Computer Animation and Social Agents) | 2017, 2018 |
Workshop chair ICCV'17, ECCV'18, ECCV'20 | 2017, 2018 |
Tutorial organization ICCV'15, Siggraph'16 | 2015, 2016 |
Reviewer Siggraph, Siggraph Asia | since 2016 | Reviewer of Eurographics | 2018, 2019 |
Program committee CVPR, ICCV, ECCV | since 2012 |
Program committee NIPS | 2017 |
Program committee ICML | 2018 |
Reviewer of T-PAMI, IJCV | since 2013 |
Reviewer of Transactions on Visualization and Computer Graphics (TVCG) | since 2013 |
Reviewer of ISMAR, BMVC, ACCV, Transactions on Circuits and Systems for Video Technology (TCSVT) | since 2013 (some years only) |
Grant Reviewer for German Research Foundation (DFG), French National Research Agency (ANR), Israel Science Foundation (ISF). | since 2017 |
Numerous invited talks. Examples: workshops of CVPR'18, CVPR'19, ICCV'19, ICCV'11, VR-days'18, and many research institutions, see talks | |
PhD committee: INRIA'17,'19, EPFL'18, Sapienza'19, Braunschweig'20 | |
Associate Member of Center for Learning Systems (ETH-MPI-IS) | 2015-2017 |
Positions:
Since 12/2018 | Emmy Noether Independent Group Leader & Senior researcher at Max Planck Institute for Informatics
(MPI-Inf), Saarbruecken. Junior Faculty at Saarland Informatics Campus (SIC) |
09/2017 - 09/2018 | Research Leader, Max Planck Institute for Informatics, Saarbruecken, head of Real Virtual Humans |
09/2015 - 09/2017 | Research Scientist, Max Planck Institute for Intelligent Systems (MPI-IS), Tuebingen |
03/2014 - 09/2015 | PostDoc, Max Planck Institute for Intelligent Systems, Tuebingen |
02/2014 | PhD degree (with distinction) from the Leibniz University of Hannover. Thesis: ``Human Pose Estimation from Video and Inertial Sensors''. |
10/2012 - 02/2014 | PhD student (until 09/2013 at Leibniz U. of Hannover, from 09/2013 until 02/2014 at MPI-IS) |
07/2012 - 10/2012 | Research Intern, Microsoft Research Cambridge |
01/2012 - 06/2012 | Research Visitor, University of Toronto |
02/2009 - 01/2012 | PhD student at the Leibniz University of Hannover. |
01/2007 - 08/2008 | Master Thesis at Northeastern University, Boston, USA |
[2002 - 08/2008] | Telecommunications Engineering B.S. and M.Sc. Technical University of Catalonia. Thesis: ``4D Cardiac MRI Segmentation and Surface Reconstruction''. |
Publications
FORCE: Dataset and Method for Intuitive Physics Guided Human-object Interaction
in International Conference on 3D Vision (3DV), 2025.
Unimotion: Unifying 3D Human Motion Synthesis and Understanding
in International Conference on 3D Vision (3DV), 2025.
HMD^2: Environment-aware Motion Generation from Single Egocentric Head-Mounted Device
in International Conference on 3D Vision (3DV), 2025.
InterTrack: Tracking Human Object Interaction without Object Templates
in International Conference on 3D Vision (3DV), 2025.
Human 3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models
in Advances in Neural Information Processing Systems (NeurIPS), 2024.
Neural Localizer Fields for Continuous 3D Human Pose and Shape Estimation
in Advances in Neural Information Processing Systems (NeurIPS), 2024.
NICP: Neural ICP for 3D Human Registration at Scale
in European Conference on Computer Vision, 2024.
Are Pose Estimators Ready for the Open World? STAGE: Synthetic Data Generation Toolkit for Auditing 3D Human Pose Estimators
in Arxiv, 2024.
Blendify - Python rendering framework for Blender
in Arxiv, 2024.
Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors
in Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
Highlight, 11.9% of accepted papers
GEARS: Local Geometry-aware Hand-object Interaction Synthesis
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
SCENIC: Scene-aware Semantic Navigation with Instruction-guided Control
in Arxiv, 2024.
Template Free Reconstruction of Human-object Interaction with Procedural Interaction Generation
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
Highlight, 11.9% of accepted papers
Recent Trends in 3D Reconstruction of General Non-Rigid Scenes
in Eurographics, 2024.
Interaction Replica: Tracking human–object interaction and scene changes from human motion
in International Conference on 3D Vision (3DV), 2024.
Generating Continual Human Motion in Diverse 3D Scenes
in International Conference on 3D Vision (3DV), 2024.
CloSe: A 3D Clothing Segmentation Dataset and Model
in International Conference on 3D Vision (3DV), 2024.
GAN-Avatar: Controllable Personalized GAN-based Human Head Avatar
in International Conference on 3D Vision (3DV), 2024.
NSF: Neural Surface Fields for Human Modeling from Monocular Depth
in ICCV, 2023.
HDHumans: A Hybrid Approach for High-fidelity Digital Humans
in Symposium on Computer Animation(SCA), 2023.
Object pop-up: Can we infer 3D objects and their poses from human interactions alone?
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
Visibility Aware Human-Object Interaction Tracking from Single RGB Camera
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
Control-NeRF: Editable Feature Volumes for Scene Rendering and Manipulation
in Winter Conference on Applications of Computer Vision (WACV), 2023.
Adjoint Rigid Transform Network: Task-conditioned Alignment of 3D Shapes
in 2022 International Conference on 3D Vision (3DV), 2022.
Any-Shot GIN: Generalizing Implicit Networks for Reconstructing Novel Classes
in 2022 International Conference on 3D Vision (3DV), 2022.
Oral - Best Paper Honourable Mention
Box2Mask: Weakly Supervised 3D Semantic Instance Segmentation Using Bounding Boxes
in European Conference on Computer Vision (ECCV), 2022.
Oral
CHORE: Contact, Human and Object REconstruction from a single RGB image
in European Conference on Computer Vision (ECCV), 2022.
COUCH: Towards Controllable Human-Chair Interactions
in European Conference on Computer Vision (ECCV), 2022.
Learned Vertex Descent: A New Direction for 3D Human Model Fitting
in European Conference on Computer Vision (ECCV), 2022.
Pose-NDF: Modeling Human Pose Manifolds with Neural Distance Fields
in European Conference on Computer Vision (ECCV), 2022.
Oral - Best Paper Honourable Mention
Skeleton-free Pose Transfer for Stylized 3D Characters
in European Conference on Computer Vision (ECCV), 2022.
TOCH: Spatio-Temporal Object-to-Hand Correspondence for Motion Refinement
in European Conference on Computer Vision (ECCV), 2022.
BEHAVE: Dataset and Method for Tracking Human Object Interactions
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
Learning Speech-driven 3D Conversational Gestures from Video
in ACM International Conference on Intelligent Virtual Agents (IVA), 2021.
Best Paper Award
Neural-GIF: Neural Generalized Implicit Functions for Animating People in Clothing
in International Conference on Computer Vision (ICCV), 2021.
Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-localization in Large Scenes from Body-Mounted Sensors
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
(First two authors contributed equally)
Oral, Best paper finalist
Stereo Radiance Fields (SRF): Learning View Synthesis for Sparse Views of Novel Scenes
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
SMPLicit: Topology-aware Generative Model for Clothed People
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
D-NeRF: Neural Radiance Fields for Dynamic Scenes
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
Neural Unsigned Distance Fields for Implicit Function Learning
in Advances in Neural Information Processing Systems (NeurIPS), 2020.
LoopReg: Self-supervised Learning of Implicit Surface Correspondences, Pose and Shape for 3D Human Mesh Registration
in Advances in Neural Information Processing Systems (NeurIPS), 2020.
Oral
SelfPose: 3D Egocentric Pose Estimation from a Headset Mounted Camera
in IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020.
Body Shape Privacy in Images: Understanding Privacy and Preventing Automatic Shape Extraction
in European Conference on Computer Vision (ECCV), Workshops, 2020.
Implicit Feature Networks for Texture Completion from Partial 3D Data
in European Conference on Computer Vision (ECCV), Workshops, 2020.
1st Place Winner in all Categories of SHARP Challange
NASA: Neural Articulated Shape Approximation
in The European Conference on Computer Vision (ECCV), 2020.
SIZER: A Dataset and Model for Parsing 3D Clothing and Learning Size Sensitive 3D Clothing
in European Conference on Computer Vision (ECCV), 2020.
Oral
Combining Implicit Function Learning and Parametric Models for 3D Human Reconstruction
in European Conference on Computer Vision (ECCV), 2020.
Oral
Unsupervised Shape and Pose Disentanglement for 3D Meshes
in European Conference on Computer Vision (ECCV), 2020.
Kinematic 3D Object Detection in Monocular Video
in The European Conference on Computer Vision (ECCV), 2020.
XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera
in ACM Transactions on Graphics, (Proc. SIGGRAPH), vol. 39, no. 4, 2020.
Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
TailorNet: Predicting Clothing in 3D as a Function of Human Pose, Shape and Garment Style
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
Oral
Learning to Transfer Texture from Clothing Images to 3D Humans
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
Learning to Dress 3D People in Generative Clothing
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
DeepCap: Monocular Human Performance Capture Using Weak Supervision
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
Oral - Best Student Paper Honorable Mention
Multi-Garment Net: Learning to Dress 3D People from Images
in IEEE International Conference on Computer Vision (ICCV), 2019.
Tex2Shape: Detailed Full Human Body Geometry from a Single Image
in IEEE International Conference on Computer Vision (ICCV), 2019.
AMASS: Archive of Motion Capture as Surface Shapes
in IEEE International Conference on Computer Vision (ICCV), 2019.
360-Degree Textures of People in Clothing from a Single Image
in International Conference on 3D Vision (3DV), 2019.
DoubleFusion: Real-time Capture of Human Performance with Inner Body Shape from a Depth Sensor
in IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019.
LiveCap: Real-time Human Performance Capture from Monocular Video
in ACM Transactions on Graphics, (Proc. SIGGRAPH), 2019.
In the Wild Human Pose Estimation using Explicit 2D Features and Intermediate 3D Representations
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
Oral
Learning to Reconstruct People in Clothing from a Single RGB Camera
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
SimulCap : Single-View Human Performance Capture with Cloth Simulation
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
Fashion is Taking Shape: Understanding Clothing Preference Based on Body Shape From Online Sources
in IEEE Winter Conference on Applications of Computer Vision (WACV 2019), 2019.
Deep Inertial Poser: Learning to Reconstruct Human Pose from Sparse Inertial Measurements in Real Time
in ACM Transactions on Graphics, (Proc. SIGGRAPH Asia), vol. 37, no. 6, 185:1-185:15 2018.
NRST: Non-rigid Surface Tracking from Monocular Video
in German Conference on Pattern Recognition (GCPR), 2018.
Oral
Detailed Human Avatars from Monocular Video
in International Conference on 3D Vision (3DV), 2018.
Neural Body Fitting: Unifying Deep Learning and Model Based Human Pose and Shape Estimation
in International Conference on 3D Vision (3DV), 2018.
Oral, 3DV Best Student Paper Award
Recovering Accurate 3D Human Pose in The Wild Using IMUs and a Moving Camera
in European Conference on Computer Vision (ECCV), 2018.
3D Poses in the Wild (3DPW) dataset available to download!
Single-Shot Multi-Person 3D Pose Estimation From Monocular RGB
in International Conference on 3D Vision (3DV), 2018.
DoubleFusion: Real-time Capture of Human Performance with Inner Body Shape from a Depth Sensor
in IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2018.
Oral
Video Based Reconstruction of 3D People Models
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
Spotlight
Single-Shot Multi-Person 3D Body Pose Estimation From Monocular RGB Input
in arXiv preprint arXiv:1712.03453, 2018.
A Generative Model of People in Clothing
in Proceedings IEEE International Conference on Computer Vision (ICCV), IEEE, 2017.
Spotlight
ClothCap: Seamless 4D Clothing Capture and Retargeting
in ACM Transactions on Graphics (SIGGRAPH), vol. 36, no. 4, 2017.
Data-Driven Physics for Human Soft Tissue Animation
in ACM Transactions on Graphics, (Proc. SIGGRAPH), vol. 36, no. 4, 2017.
Detailed, accurate, human shape estimation from clothed 3D scan sequences
in IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2017.
Spotlight
Dynamic FAUST: Registering Human Bodies in Motion
in IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2017.
Oral
Sparse Inertial Poser: Automatic 3D Human Pose Estimation from Sparse IMUs
in Computer Graphics Forum 36(2), Proceedings of the 38th Annual Conference of the European Association for Computer Graphics (Eurographics), 349-360 2017.
Best Paper Award
Human Pose Estimation from Video and IMUs
in Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2016.
SMPL: A Skinned Multi-Person Linear Model
in ACM Trans. Graphics (Proc. SIGGRAPH Asia), vol. 34, no. 6, ACM, 248:1-248:16 2015.
Dyna: A Model of Dynamic Human Shape in Motion
in ACM Transactions on Graphics, (Proc. SIGGRAPH), vol. 34, no. 4, ACM, 120:1-120:14 2015.
Metric Regression Forests for Correspondence Estimation
in International Journal of Computer Vision (IJCV), 1-13 2015.
Posebits for Monocular Human Pose Estimation
in Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2345-2352 2014.
Metric Regression Forests for Human Pose Estimation
in British Machine Vision Conference (BMVC), BMVA Press, 2013.
Best Paper Award
PCA-enhanced stochastic optimization methods
in German Conference on Pattern Recognition (GCPR), 2012.
Branch-and-price global optimization for multi-view multi-object tracking
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012.
Data-driven Manifolds for Outdoor Motion Capture
in Outdoor and Large-Scale Real-World Scene Analysis, Springer, 305-328 2012.
Exploiting pedestrian interaction via global optimization and social behaviors
in Theoretic Foundations of Computer Vision: Outdoor and Large-Scale Real-World Scene Analysis, Springer, 2012.
Everybody needs somebody: modeling social and grouping behavior on a linear programming multiple people tracker
in IEEE International Conference on Computer Vision Workshops (IICCVW), 2011.
Outdoor Human Motion Capture using Inverse Kinematics and von Mises-Fisher Sampling
in IEEE International Conference on Computer Vision (ICCV), 1243-1250 2011.
Efficient and Robust Shape Matching for Model Based Human Motion Capture
in German Conference on Pattern Recognition (GCPR), 416-425 2011.
Oral
Analyzing and Evaluating Markerless Motion Tracking Using Inertial Sensors
in European Conference on Computer Vision (ECCV Workshops), 2010.
Multisensor-Fusion for 3D Full-Body Human Motion Capture
in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2010.
Ball Joints for Marker-less Human Motion Capture
in IEEE Workshop on Applications of Computer Vision (WACV), 2009.
4D Cardiac Segmentation of the Epicardium and Left Ventricle
in World Congress of Medical Physics and Biomedical Engineering (WC), 2009.
Parametric Modeling of the Beating Heart with Respiratory Motion Extracted from Magnetic Resonance Images
in IEEE Computers in Cardiology (CINC), 2009.