Reconstructing Signing Avatars from Video Using Linguistic Priors

M.-P. Forte; P. Kulits; C.-H. Huang; V. Choutas; D. Tzionas; K.J. Kuchenbecker; M.J. Black

doi:https://doi.org/10.48550/arXiv.2304.10482

Reconstructing Signing Avatars from Video Using Linguistic Priors

Authors	M.-P. Forte P. Kulits C.-H. Huang V. Choutas D. Tzionas K.J. Kuchenbecker M.J. Black
Publication date	2023
Book title	CVPR 2023
Book subtitle	proceedings: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition : Vancouver, Canada : 18-22 June 2023
ISBN	9798350301304
ISBN (electronic)	9798350301298
Event	IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR) 2023
Pages (from-to)	12791-12801
Publisher	Los Alamitos, California: IEEE Computer Society
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	Sign language (SL) is the primary method of communication for the 70 million Deaf people around the world. Video dictionaries of isolated signs are a core SL learning tool. Replacing these with 3D avatars can aid learning and enable AR/VR applications, improving access to technology and online media. However, little work has attempted to estimate expressive 3D avatars from SL video; occlusion, noise, and motion blur make this task difficult. We address this by introducing novel linguistic priors that are universally applicable to SL and provide constraints on 3D hand pose that help resolve ambiguities within isolated signs. Our method, SGNify, captures fine-grained hand pose, facial expression, and body movement fully automatically from in-the-wild monocular SL videos. We evaluate SGNify quantitatively by using a commercial motion-capture system to compute 3D avatars synchronized with monocular video. SGNify outperforms state-of-the-art 3D body-pose- and shape-estimation methods on SL videos. A perceptual study shows that SGNify's 3D reconstructions are significantly more comprehensible and natural than those of previous methods and are on par with the source videos. Code and data are available at sgnify.is.tue.mpg.de.
Document type	Conference contribution
Note	With supplemental material.
Language	English
Published at	https://doi.org/10.48550/arXiv.2304.10482 https://doi.org/10.1109/CVPR52729.2023.01230
Published at	https://openaccess.thecvf.com/content/CVPR2023/html/Forte_Reconstructing_Signing_Avatars_From_Video_Using_Linguistic_Priors_CVPR_2023_paper.html
Other links	https://sgnify.is.tue.mpg.de https://www.proceedings.com/70184.html
Downloads	2304.10482 (Accepted author manuscript)
Supplementary materials	Forte_Reconstructing_Signing_Avatars_CVPR_2023_supplemental
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Reconstructing Signing Avatars from Video Using Linguistic Priors