北京邮电大学学报

  • EI核心期刊

北京邮电大学学报 ›› 2024, Vol. 47 ›› Issue (5): 35-43.

• 论文 • 上一篇    下一篇

联合静动态关节关系的3D人体姿态估计

刘琼1,何建航2,温嘉校2   

  1. 1. 华南理工大学软件学院
    2. 华南理工大学
  • 收稿日期:2023-07-21 修回日期:2023-09-17 出版日期:2024-10-28 发布日期:2024-11-10
  • 通讯作者: 刘琼 E-mail:liuqiong@scut.edu.cn
  • 基金资助:
    广东省基础和应用基础研究基金

Modeling Static and Dynamic Joint Relationship for 3D Pose Estimation

  • Received:2023-07-21 Revised:2023-09-17 Online:2024-10-28 Published:2024-11-10
  • Contact: Qiong LIU E-mail:liuqiong@scut.edu.cn

摘要: 人体的非刚性和遮挡是3D姿态估计面临的重要挑战,关节关系建模是解决这类挑战的重要技术途径。基于物理约束建模静态关节关系的方法,因灵活性不足制约姿态估计性能,尤其难以应对遮挡和奇异致的关节缺失或偏移。提取姿态语义属性建模动态关节关系能优化3D姿态估计,但是,忽视人体结构的全局性和层次性很难达到预期。提出联合建模静动态关节关系的3D姿态估计方法。利用互信息计算获取人体关节关系图谱,藉此分组人体关节,进而,按三级人体关节自由度分级归集已划分的关节分组。设计级联估计及关节分组特征共享网络建模静态关节关系;设计多分组注意力机制提取姿态语义特征建模动态关节关系。为了强化模型的鲁棒性,辅以类别平衡姿态重组策略拓展数据多样性。基于Human3.6M、MPI-INF-3DHP和MPII等数据集进行实验,结果表明,较同类先进模型,本文模型平均误差至少降低0.2mm;精度至少提高0.2%;经数据增强,本文模型性能显著提高。

关键词: 深度歧义, 关节关系建模, 注意力机制, 3D人体姿态估计

Abstract: There are enormous challenges for 3D pose estimation due to non-rigid characteristics and occlusion. And modeling joint relationship is a key technology for that. It is difficult to deal with joint missing or skewing caused by occlusion or singularity if only modeling static joint relationships from human physical construction. It is beneficial to improve 3D pose estimation when extracting pose semantics and modeling dynamic joint relationship but the wholeness and hierarchy on human joint relationship can’t be neglected. This work proposes modeling static and dynamic joint relationships together for 3D pose estimation. Mutual information algorithm is used for obtaining a joint relationship map which used to group human joints. Then accumulate the human joint groups based on three level human joint freedoms level by level. A cascading estimation and group joint feature sharing networks are designed to model static joint relationship. Multi-group attention mechanisms are present for each level for extrating pose semantic feature to model dynamic joint relationships. A data enhancement policy by category balancing and pose reorganizing is presented further for improving model robustness. The experiments are carried out based on Human3.6M, MPI-INF-3DHP and MPII datasets extensively. The results show that the average error of our model is reduced 0.2 mm and the average accuracy is increased 0.2% at least when comparing with other advanced models. And the model performance is improved significantly when our data enhancement policy is carried out.

Key words: depth ambiguity, joint relationship modeling, attention mechanism, 3D human pose estimation

中图分类号: