|
JCSE, vol. 19, no. 2, pp.72-80, 2025
DOI: http://dx.doi.org/10.5626/JCSE.2025.19.2.72
SESAME: Selective Enhancement of Soft-argmax Estimation for Challenging Facial Landmark Detection via Adaptive Masking-Based Representation Learning
Yeeun Choi, Hyeonmin Jeong, Chaemin Yoo, Gwanghee Lee, and Kyoungson Jhang
Chungnam National University, Daejeon, Korea
Abstract: Facial landmark detection is a task that involves estimating landmark points such as eyes, nose, and mouth in facial images, and is utilized in various applications including facial recognition, emotion analysis, expression recognition, and person identification. Among implementation methods, coordinate regression faces a significant challenge with notably lower prediction accuracy in specific areas such as the jaw and below the ears. We propose a selective enhancement of soft-argmax estimation (SESAME) as a technique to address these limitations. SESAME consists of three stages. In the preliminary step, we train a randomly initialized facial landmark detection model using the WFLW facial landmark dataset to identify areas of poor performance. During the pre-training step, we focus on enhancing representation learning by applying intensive masking to poorly performing landmark regions in random facial input images from WFLW, followed by reconstruction. In the fine-tuning step, we further train the enhanced model using the WFLW dataset. Experimental results show significant improvement in the prediction accuracy of traditionally challenging areas, particularly in jawline regions where the normalized mean error (NME) substantially decreased from 6.547% to 6.184%. The overall average NME across all regions improved from 4.397% to 4.351%, demonstrating enhanced overall performance.
Keyword:
Deep learning; Computer vision; Facial landmark detection
Full Paper: 1 Downloads, 15 View
|