Face Synthesis with a Focus on Facial Attributes Translation using Attention Mechanisms

IRIS

Synthesis of face images by translating facial attributes is an important problem in computer vision and biometrics and has a wide range of applications in forensics, entertainment, etc. Recent advances in deep generative networks have made progress in synthesizing face images with certain target facial attributes. However, visualizing and interpreting generative adversarial networks (GANs) is a relatively unexplored area and generative models are still being employed as black-box tools. This paper takes the first step to visually interpret conditional GANs for facial attribute translation by using a gradient-based attention mechanism. Next, a key innovation is to include new learning objectives for knowledge distillation using attention in generative adversarial training, which result in improved synthesized face results, reduced visual confusions and boosted training for GANs in a positive way. Firstly, visual attentions are calculated to provide interpretations for GANs. Secondly, gradient-based visual attentions are used as knowledge to be distilled in a teacher-student paradigm for face synthesis with focus on facial attributes translation tasks in order to improve the performance of the model. Finally, it is shown how “pseudo”-attentions knowledge distillation can be employed during the training of face synthesis networks when teacher and student networks are trained to generate different facial attributes. The approach is validated on facial attribute translation and human expression synthesis with both qualitative and quantitative results being presented.

Face Synthesis with a Focus on Facial Attributes Translation using Attention Mechanisms / Li, R.; Fontanini, T.; Prati, A.; Bhanu, B.. - In: IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE. - ISSN 2637-6407. - (2022). [10.1109/TBIOM.2022.3199707]

Face Synthesis with a Focus on Facial Attributes Translation using Attention Mechanisms

Li R.;Fontanini T.;Prati A.;Bhanu B.

2022-01-01

Abstract

Synthesis of face images by translating facial attributes is an important problem in computer vision and biometrics and has a wide range of applications in forensics, entertainment, etc. Recent advances in deep generative networks have made progress in synthesizing face images with certain target facial attributes. However, visualizing and interpreting generative adversarial networks (GANs) is a relatively unexplored area and generative models are still being employed as black-box tools. This paper takes the first step to visually interpret conditional GANs for facial attribute translation by using a gradient-based attention mechanism. Next, a key innovation is to include new learning objectives for knowledge distillation using attention in generative adversarial training, which result in improved synthesized face results, reduced visual confusions and boosted training for GANs in a positive way. Firstly, visual attentions are calculated to provide interpretations for GANs. Secondly, gradient-based visual attentions are used as knowledge to be distilled in a teacher-student paradigm for face synthesis with focus on facial attributes translation tasks in order to improve the performance of the model. Finally, it is shown how “pseudo”-attentions knowledge distillation can be employed during the training of face synthesis networks when teacher and student networks are trained to generate different facial attributes. The approach is validated on facial attribute translation and human expression synthesis with both qualitative and quantitative results being presented.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del prodotto
	
				2022
			
	Citazione
	
				Face Synthesis with a Focus on Facial Attributes Translation using Attention Mechanisms / Li, R.; Fontanini, T.; Prati, A.; Bhanu, B.. - In: IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE. - ISSN 2637-6407. - (2022). [10.1109/TBIOM.2022.3199707]
			
	Appare nelle tipologie:
	
				1.1 Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
face.pdf solo utenti autorizzati Tipologia: Versione (PDF) editoriale Licenza: NON PUBBLICO - Accesso privato/ristretto Dimensione 4.01 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	4.01 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
Tbiom2022_final.pdf accesso aperto Tipologia: Documento in Post-print Licenza: Creative commons Dimensione 9.1 MB Formato Adobe PDF Visualizza/Apri	9.1 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11381/2932238

Citazioni

ND

2

0

social impact