End-to-end human parsing and detection optimized for resource-constrained devices

Hosen, MD; Aydin, Tarkan; Islam, Md

doi:10.1038/s41598-025-30449-9

End-to-end human parsing and detection optimized for resource-constrained devices

Hosen M. I., Aydin T., Islam M. B.

Scientific Reports, cilt.16, sa.1, 2026 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 16 Sayı: 1
Basım Tarihi: 2026
Doi Numarası: 10.1038/s41598-025-30449-9
Dergi Adı: Scientific Reports
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, BIOSIS, Chemical Abstracts Core, MEDLINE, Directory of Open Access Journals
Anahtar Kelimeler: Multi-human parsing, Polygons annotation, Resource-constrained devices, Self-attention
İstanbul Ticaret Üniversitesi Adresli: Evet

Özet

Human parsing, a vital task in human-centric analysis, involves segmenting clothing and body parts for individual association. Existing methods often rely on auxiliary inputs like detection and edge prediction, limiting their suitability for resource-constrained devices. To address this, we propose an end-to-end framework that integrates a transformer based self-attention module to enhance contextual understanding while being optimized for low-resource environments. We also introduce bounding-polygon annotations to facilitate simultaneous detection and parsing. Our method achieves fine-grained results in a single pass, significantly improving inference speed without sacrificing accuracy. Real-world validation on Raspberry Pi demonstrates its effectiveness and efficiency in resource-constrained scenarios.