UNICA IRIS Institutional Research Information System

Large-scale face recognition datasets are collected by crawling the Internet and without individuals' consent, raising legal, ethical, and privacy concerns. With the recent advances in generative models, recently several works proposed generating synthetic face recognition datasets to mitigate concerns in web-crawled face recognition datasets. This paper presents the summary of the Synthetic Data for Face Recognition (SDFR) Competition held in conjunction with the 18th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2024) and established to investigate the use of synthetic data for training face recognition models. The SDFR competition was split into two tasks, allowing participants to train face recognition systems using new synthetic datasets and/or existing ones. In the first task, the face recognition backbone was fixed and the dataset size was limited, while the second task provided almost complete freedom on the model backbone, the dataset, and the training pipeline. The submitted models were trained on existing and also new synthetic datasets and used clever methods to improve training with synthetic data. The submissions were evaluated and ranked on a diverse set of seven benchmarking datasets. The paper gives an overview of the submitted face recognition models and reports achieved performance compared to baseline models trained on real and synthetic datasets. Furthermore, the evaluation of submissions is extended to bias assessment across different demography groups. Lastly, an outlook on the current state of the research in training face recognition models using synthetic data is presented, and existing problems as well as potential future directions are also discussed.

SDFR: Synthetic Data for Face Recognition competition

Shahreza, Hatef Otroshi;Ecabert, Christophe;George, Anjith;Unnervik, Alexander;Marcel, Sébastien;Di Domenico, Nicolò;Borghi, Guido;Maltoni, Davide;Boutros, Fadi;Vogel, Julia;Damer, Naser;Sánchez-Pérez, Ángela;Mas-Candela, Enrique;Calvo-Zaragoza, Jorge;Biesseck, Bernardo;Vidal, Pedro;Granada, Roger;Menotti, David;DeAndres-Tame, Ivan;La Cava, Simone Maurizio;Concas, Sara;Melzi, Pietro;Tolosana, Ruben;Vera-Rodriguez, Ruben;Perelli, Gianpaolo;Orru', Giulia;Marcialis, Gian Luca;Fierrez, Julian

2024-01-01

Abstract

Large-scale face recognition datasets are collected by crawling the Internet and without individuals' consent, raising legal, ethical, and privacy concerns. With the recent advances in generative models, recently several works proposed generating synthetic face recognition datasets to mitigate concerns in web-crawled face recognition datasets. This paper presents the summary of the Synthetic Data for Face Recognition (SDFR) Competition held in conjunction with the 18th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2024) and established to investigate the use of synthetic data for training face recognition models. The SDFR competition was split into two tasks, allowing participants to train face recognition systems using new synthetic datasets and/or existing ones. In the first task, the face recognition backbone was fixed and the dataset size was limited, while the second task provided almost complete freedom on the model backbone, the dataset, and the training pipeline. The submitted models were trained on existing and also new synthetic datasets and used clever methods to improve training with synthetic data. The submissions were evaluated and ranked on a diverse set of seven benchmarking datasets. The paper gives an overview of the submitted face recognition models and reports achieved performance compared to baseline models trained on real and synthetic datasets. Furthermore, the evaluation of submissions is extended to bias assessment across different demography groups. Lastly, an outlook on the current state of the research in training face recognition models using synthetic data is presented, and existing problems as well as potential future directions are also discussed.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Codice ISBN
	
				979-8-3503-9494-8
979-8-3503-9495-5
			
	Parole chiave
	
				Training; Ethics; Data privacy; Law; Face recognition; Gesture recognition; Data models
			
	Tipologia:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
SDFR_Synthetic_Data_for_Face_Recognition_Competition.pdf Solo gestori archivio Descrizione: VoR Tipologia: versione editoriale (VoR) Dimensione 2.05 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	2.05 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
SDFR_AAM.pdf accesso aperto Descrizione: AAM Tipologia: versione post-print (AAM) Dimensione 2.77 MB Formato Adobe PDF Visualizza/Apri	2.77 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11584/456508

Citazioni

ND

29

6

social impact