PuzzleAvatar: Assembling 3D Avatars from Personal Albums

Open Access
Authors
  • M.J. Black
Publication date 12-2024
Journal ACM Transactions on Graphics
Article number 283
Volume | Issue number 43 | 6
Number of pages 15
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Generating personalized 3D avatars is crucial for AR/VR. However, recent text-to-3D methods that generate avatars for celebrities or fictional characters, struggle with everyday people. Methods for faithful reconstruction typically require full-body images in controlled settings. What if users could just upload their personal “OOTD” (Outfit Of The Day) photo collection and get a faithful avatar in return? The challenge is that such casual photo collections contain diverse poses, challenging viewpoints, cropped views, and occlusion (albeit with a consistent outfit, accessories and hairstyle). We address this novel “Album2Human” task by developing PuzzleAvatar, a novel model that generates a faithful 3D avatar (in a canonical pose) from a personal OOTD album, bypassing the challenging estimation of body and camera pose. To this end, we fine-tune a foundational vision-language model (VLM) on such photos, encoding the appearance, identity, garments, hairstyles, and accessories of a person into separate learned tokens, instilling these cues into the VLM. In effect, we exploit the learned tokens as “puzzle pieces" from which we assemble a faithful, personalized 3D avatar. Importantly, we can customize avatars by simply inter-changing tokens. As a benchmark for this new task, we create a new dataset, called PuzzleIOI, with 41 subjects in a total of nearly 1k OOTD configurations, in challenging partial photos with paired ground-truth 3D bodies. Evaluation shows that PuzzleAvatar not only has high reconstruction accuracy, outperforming TeCH and MVDreamBooth, but also a unique scalability to album photos, and demonstrating strong robustness. Our code and data are publicly available for research purpose at puzzleavatar.is.tue.mpg.de.
Document type Article
Language English
Published at https://doi.org/10.48550/arXiv.2405.14869 https://doi.org/10.1145/3687771
Published at https://arxiv.org/abs/2405.14869
Other links https://puzzleavatar.is.tue.mpg.de
Downloads
PuzzleAvatar (Final published version)
Permalink to this page
Back