Research Collection School Of Computing and Information Systems

Mask-guided deformation adaptive network for human parsing

Publication Type

Journal Article

Version

publishedVersion

Publication Date

3-2022

Abstract

Due to the challenges of densely compacted body parts, nonrigid clothing items, and severe overlap in crowd scenes, human parsing needs to focus more on multilevel feature representations compared to general scene parsing tasks. Based on this observation, we propose to introduce the auxiliary task of human mask and edge detection to facilitate human parsing. Different from human parsing, which exploits the discriminative features of each category, human mask and edge detection emphasizes the boundaries of semantic parsing regions and the difference between foreground humans and background clutter, which benefits the parsing predictions of crowd scenes and small human parts. Specifically, we extract human mask and edge labels from the human parsing annotations and train a shared encoder with three independent decoders for the three mutually beneficial tasks. Furthermore, the decoder feature maps of the human mask prediction branch are further exploited as attention maps, indicating human regions to facilitate the decoding process of human parsing and human edge detection. In addition to these auxiliary tasks, we further alleviate the problem of deformed clothing items under various human poses by tracking the deformation patterns with the deformable convolution. Extensive experiments show that the proposed method can achieve superior performance against state-of-the-art methods on both single and multiple human parsing datasets. Codes and trained models are available https://github.com/ViktorLiang/MGDAN.

Keywords

Human parsing, multi-task learning, deformable convolution

Discipline

Graphics and Human Computer Interfaces

Research Areas

Software and Cyber-Physical Systems

Publication

ACM Transactions on Multimedia Computing, Communications and Applications

Volume

Issue

First Page

Last Page

ISSN

1551-6857

Identifier

10.1145/3467889

Publisher

Association for Computing Machinery (ACM)

Citation

MAO, Aihua; LIANG, Yuan; JIAO, Jianbo; LIU, Yongtuo; and HE, Shengfeng. Mask-guided deformation adaptive network for human parsing. (2022). ACM Transactions on Multimedia Computing, Communications and Applications. 18, (1), 1-20.
Available at: https://ink.library.smu.edu.sg/sis_research/7873

Copyright Owner and License

Publisher

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.1145/3467889

Download

Find it in your library

Included in

Graphics and Human Computer Interfaces Commons

COinS

Research Collection School Of Computing and Information Systems

Mask-guided deformation adaptive network for human parsing

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Mask-guided deformation adaptive network for human parsing

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links