Shortcuts

Body 3D Keypoint




H36m Dataset


Motionbert + Motionbert on H36m

MotionBERT (2022)
 @misc{Zhu_Ma_Liu_Liu_Wu_Wang_2022,
 title={Learning Human Motion Representations: A Unified Perspective},
 author={Zhu, Wentao and Ma, Xiaoxuan and Liu, Zhaoyang and Liu, Libin and Wu, Wayne and Wang, Yizhou},
 year={2022},
 month={Oct},
 language={en-US}
 }
Human3.6M (TPAMI'2014)
@article{h36m_pami,
author = {Ionescu, Catalin and Papava, Dragos and Olaru, Vlad and Sminchisescu, Cristian},
title = {Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments},
journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence},
publisher = {IEEE Computer Society},
volume = {36},
number = {7},
pages = {1325-1339},
month = {jul},
year = {2014}
}

Results on Human3.6M dataset with ground truth 2D detections

Arch MPJPE average MPJPE P-MPJPE ckpt
MotionBERT* 34.5 34.6 27.1 ckpt
MotionBERT-finetuned* 26.9 26.8 21.0 ckpt

Results on Human3.6M dataset converted from the official repo1 with ground truth 2D detections

Arch MPJPE average MPJPE P-MPJPE ckpt log
MotionBERT* 39.8 39.2 33.4 ckpt /
MotionBERT-finetuned* 37.7 37.2 32.2 ckpt /

1 By default, we test models with Human 3.6m dataset processed by MMPose. The official repo’s dataset includes more data and applies a different pre-processing technique. To achieve the same result with the official repo, please download the test annotation file, train annotation file and factors under $MMPOSE/data/h36m/annotation_body3d/fps50 and test with the configs we provided.

Models with * are converted from the official repo. The config files of these models are only for validation. We don’t ensure these config files’ training accuracy and welcome you to contribute your reproduction results.


Image Pose Lift + Simplebaseline3d on H36m

SimpleBaseline3D (ICCV'2017)
@inproceedings{martinez_2017_3dbaseline,
  title={A simple yet effective baseline for 3d human pose estimation},
  author={Martinez, Julieta and Hossain, Rayat and Romero, Javier and Little, James J.},
  booktitle={ICCV},
  year={2017}
}
Human3.6M (TPAMI'2014)
@article{h36m_pami,
  author = {Ionescu, Catalin and Papava, Dragos and Olaru, Vlad and Sminchisescu,  Cristian},
  title = {Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments},
  journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence},
  publisher = {IEEE Computer Society},
  volume = {36},
  number = {7},
  pages = {1325-1339},
  month = {jul},
  year = {2014}
}

Results on Human3.6M dataset with ground truth 2D detections

Arch MPJPE P-MPJPE ckpt log
SimpleBaseline3D1 43.4 34.3 ckpt log

1 Differing from the original paper, we didn’t apply the max-norm constraint because we found this led to a better convergence and performance.


Video Pose Lift + Videopose3d on H36m

VideoPose3D (CVPR'2019)
@inproceedings{pavllo20193d,
title={3d human pose estimation in video with temporal convolutions and semi-supervised training},
author={Pavllo, Dario and Feichtenhofer, Christoph and Grangier, David and Auli, Michael},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages={7753--7762},
year={2019}
}
Human3.6M (TPAMI'2014)
@article{h36m_pami,
author = {Ionescu, Catalin and Papava, Dragos and Olaru, Vlad and Sminchisescu, Cristian},
title = {Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments},
journal = {IEEE Transactions on Pattern Analysis and Machine Intelligence},
publisher = {IEEE Computer Society},
volume = {36},
number = {7},
pages = {1325-1339},
month = {jul},
year = {2014}
}

Testing results on Human3.6M dataset with ground truth 2D detections, supervised training

Arch Receptive Field MPJPE P-MPJPE ckpt log
VideoPose3D-supervised-27frm 27 40.1 30.1 ckpt log
VideoPose3D-supervised-81frm 81 39.1 29.3 ckpt log
VideoPose3D-supervised-243frm 243 37.6 28.3 ckpt log

Testing results on Human3.6M dataset with CPN 2D detections1, supervised training

Arch Receptive Field MPJPE P-MPJPE ckpt log
VideoPose3D-supervised-CPN-1frm 1 53.0 41.3 ckpt log
VideoPose3D-supervised-CPN-243frm 243 47.9 38.0 ckpt log

Testing results on Human3.6M dataset with ground truth 2D detections, semi-supervised training

Training Data Arch Receptive Field MPJPE P-MPJPE N-MPJPE ckpt log
10% S1 VideoPose3D-semi-supervised-27frm 27 57.2 42.4 54.2 ckpt log

Testing results on Human3.6M dataset with CPN 2D detections1, semi-supervised training

Training Data Arch Receptive Field MPJPE P-MPJPE N-MPJPE ckpt log
10% S1 VideoPose3D-semi-supervised-CPN-27frm 27 67.3 50.4 63.6 ckpt log

1 CPN 2D detections are provided by official repo. The reformatted version used in this repository can be downloaded from train_detection and test_detection.