数据结构 ?
Data Structure ?
The dataset containing images with ground-truth shape for bodies, hands and faces together.
Data Collection
We begin with the SMPL+H dataset [52], obtaining one full body RGB image per frame. We then align SMPL-X to the 4D scans following [68]. An expert annotator manually curated the dataset to select 100 frames that can be confidently considered pseudo ground-truth, according to alignment quality and interesting hand poses and facial expressions. The pseudo ground-truth meshes allow to use a stricter vertexto-vertex (v2v) error metric [48, 62], in contrast to the common paradigm of reporting 3D joint error, which does not capture surface errors and rotations along the bones.'
[48]: Matthew Loper, Naureen Mahmood, Javier Romero, Gerard Pons-Moll, and Michael
J. Black. SMPL: A skinned multiperson linear model. ACM Transactions on Graphics, (Proc. SIGGRAPH
Asia), 34(6):248:1¨C248:16, Oct. 2015. 2, 3, 6
[52]: MANO, models SMPL+H website: dataset, and code.
http://mano.is.tue.mpg.de. 3, 6
[62]: Georgios Pavlakos, Luyang Zhu, Xiaowei
Zhou, and Kostas Daniilidis. Learning to estimate 3D human pose and shape from a single color
image. In CVPR, 2018. 1, 2, 3, 6
[68]: Javier Romero, Dimitrios Tzionas, and Michael J Black.
Embodied hands: Modeling and capturing hands and bodies together. ACM Transactions on Graphics
(TOG), 2017. 2, 3, 4, 5, 6
