Multiscale Facial Detection using RetinaFace Architecture with Loss Function

Irma Amelia Dewi; Nadhiva Adzra Tsania Maryadi

doi:10.47709/cnahpc.v7i3.6161

Authors

Irma Amelia Dewi Department of Informatics, Institut Teknologi Nasional Bandung, Indonesia
Nadhiva Adzra Tsania Maryadi Department of Informatics, Institut Teknologi Nasional Bandung, Indonesia

DOI:

https://doi.org/10.47709/cnahpc.v7i3.6161

Keywords:

ArcFace loss; Facial Recognition; RetinaFace; SphereFace loss; Widerface

Abstract

Facial recognition technology has become increasingly prevalent in modern applications, from security systems to social media platforms. However, one of the most significant challenges in this field remains the accurate detection of faces across varying scales, orientations, and image qualities. Traditional face detection methods often struggle when faces appear at different sizes within the same image or when dealing with low-resolution imagery, leading to inconsistent performance that can compromise system reliability. The RetinaFace architecture emerges as a promising solution to address these multiscale detection challenges. By incorporating a Feature Pyramid Network (FPN), the system creates a hierarchical representation of features that enables effective detection of faces regardless of their size in the image. The FPN combines low-resolution, semantically strong features with high-resolution, semantically weak features, creating a robust feature pyramid that simultaneously captures facial characteristics at multiple scales. Context modules within RetinaFace further enhance detection capabilities by providing additional contextual information that helps distinguish faces from background noise and other objects. This comprehensive approach allows the system to maintain high accuracy even in challenging scenarios where faces appear small, partially occluded, or at unusual angles. The comparative analysis between ArcFace and SphereFace loss functions reveals important insights into optimization strategies for facial recognition systems. The experimental results on the WIDERFACE dataset demonstrate exceptional performance, with the RetinaFace-ResNet152-SphereFace combination achieving 94% accuracy. These findings highlight the importance of architectural choices and loss function selection in developing robust facial recognition systems capable of handling real-world deployment challenges

Downloads

Download data is not yet available.

References

Alhanaee, K., Alhammadi, M., Almenhali, N., & Shatnawi, M. (2021). Face Recognition Smart Attendance System using Deep Transfer Learning. Procedia Computer Science, 192, 4093–4102. https://doi.org/10.1016/j.procs.2021.09.184

Das, P., Asif, N. A., Hasan, M. M., Abhi, S. H., Jahin Tatha, M., & Bristi, S. D. (2022). Intelligent Door Controller Using Deep Learning-Based Network Pruned Face Recognition. Proceedings of 2022 25th International Conference on Computer and Information Technology, ICCIT 2022, (December), 120–124. https://doi.org/10.1109/ICCIT57492.2022.10056094

Deng, J., Guo, J., Ververas, E., Kotsia, I., & Zafeiriou, S. (2020). Retinaface: Single-shot multi-level face localisation in the wild. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 5202–5211. https://doi.org/10.1109/CVPR42600.2020.00525

Deng, J., Guo, J., Zhou, Y., Yu, J., Kotsia, I., & Zafeiriou, S. (2020). RetinaFace: Single-Shot Multi-Level Face Localisation in the Wild. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Seattle, WA, USA: IEEE. https://doi.org/10.1109/CVPR42600.2020.00525

Filian, A., Istianto, B., & Kusuma, G. P. (2024). Image Enhancement using Convolutional Neural Network for Low Light Face Detection. KESATRIA: Jurnal Penerapan Sistem Informasi (Komputer & Manajemen), 5(1), 71–85.

Howard, A., Wang, W., Chu, G., Chen, L., Chen, B., & Tan, M. (2019). Searching for MobileNetV3 Accuracy vs MADDs vs model size. International Conference on Computer Vision, 1314–1324.

Kortli, Y., Jridi, M., Al Falou, A., & Atri, M. (2020). Face recognition systems: A survey. Sensors (Switzerland), 20(2). https://doi.org/10.3390/s20020342

Li, Q., Guo, N., Ye, X., Fan, D., & Tang, Z. (2020). Video Face Recognition System: RetinaFace-mnet-faster and Secondary Search.

Liu, W., Wen, Y., Yu, Z., Li, M., Raj, B., & Song, L. (n.d.). SphereFace: Deep Hypersphere Embedding for Face Recognition.

Ma, L., & Long, Z. (2023). A Face Recognition Method Using ResNet34 and RetinaFace. Francis-Press, 6(10), 18–23. https://doi.org/10.25236/AJCIS.2023.061003

Nanni, L., Brahnam, S., & Lumini, A. (2023). Coupling RetinaFace and Depth Information to Filter False Positives. Applied Sciences (Switzerland), 13(5). Retrieved from https://www.mdpi.com/2076-3417/13/5/2987

Qin, J., Bai, H., & Zhao, Y. (2021). Multi-scale attention network for image inpainting. Computer Vision and Image Understanding, 204(December 2019), 103155. https://doi.org/10.1016/j.cviu.2020.103155

Zaki, M. (2023). Deteksi Penggunaan Masker Pada Citra Menggunakan RetinaFace dengan MobileNetV2. E-Proceeding of Engineering, 10(5), 4896–4902.

Zhang, K., Zhang, Z., Li, Z., & Qiao, Y. (2016). Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks. IEEE Signal Processing Letters, 23(10), 1499–1503. https://doi.org/10.1109/LSP.2016.2603342

Zhu, X., Hu, H., Lin, S., & Dai, J. (2019). Deformable convnets V2: More deformable, better results. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2019-June, 9300–9308. https://doi.org/10.1109/CVPR.2019.00953