Trusted guidance pyramid network for human parsing

Publication Type:
Conference Proceeding
MM 2018 - Proceedings of the 2018 ACM Multimedia Conference, 2018, pp. 654 - 662
Issue Date:
Filename Description Size
MM2018-manuscript.pdfAccepted Manuscript version1.11 MB
Adobe PDF
Full metadata record
© 2018 Association for Computing Machinery. Human parsing has a wide range of applications. However, none of the existing methods can productively solve the issue of label parsing fragmentation due to confused and complicated annotations. In this paper, we propose a novel Trusted Guidance Pyramid Network (TGPNet) to address this limitation. Based on a pyramid architecture, we design a Pyramid Residual Pooling (PRP) module setting at the end of a bottom-up approach to capture both global and local level context. In the top-down approach, we propose a Trusted Guidance Multi-scale Supervision (TGMS) that efficiently integrates and supervises multi-scale contextual information. Furthermore, we present a simple yet powerful Trusted Guidance Framework (TGF) which imposes global-level semantics into parsing results directly without extra ground truth labels in model training. Extensive experiments on two public human parsing benchmarks well demonstrate that our TGPNet has a strong ability in solving label parsing fragmentation problem and has an obtained improvement than other methods.
Please use this identifier to cite or link to this item: