End-to-end human parsing and detection optimized for resource-constrained devices


Hosen M. I., Aydin T., Islam M. B.

Scientific Reports, cilt.16, sa.1, 2026 (SCI-Expanded, Scopus) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 16 Sayı: 1
  • Basım Tarihi: 2026
  • Doi Numarası: 10.1038/s41598-025-30449-9
  • Dergi Adı: Scientific Reports
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, BIOSIS, Chemical Abstracts Core, MEDLINE, Directory of Open Access Journals
  • Anahtar Kelimeler: Multi-human parsing, Polygons annotation, Resource-constrained devices, Self-attention
  • İstanbul Ticaret Üniversitesi Adresli: Evet

Özet

Human parsing, a vital task in human-centric analysis, involves segmenting clothing and body parts for individual association. Existing methods often rely on auxiliary inputs like detection and edge prediction, limiting their suitability for resource-constrained devices. To address this, we propose an end-to-end framework that integrates a transformer based self-attention module to enhance contextual understanding while being optimized for low-resource environments. We also introduce bounding-polygon annotations to facilitate simultaneous detection and parsing. Our method achieves fine-grained results in a single pass, significantly improving inference speed without sacrificing accuracy. Real-world validation on Raspberry Pi demonstrates its effectiveness and efficiency in resource-constrained scenarios.