Dataset bias exposed in face verification

López López, Eric; Pardo López, Xosé Manuel; Vázquez Regueiro, Carlos; Iglesias Rodríguez, Roberto; Estévez Casado, Fernando

doi:10.1049/iet-bmt.2018.5224

Dataset bias exposed in face verification

dc.contributor.affiliation	Universidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías da Información	gl
dc.contributor.affiliation	Universidade de Santiago de Compostela. Departamento de Electrónica e Computación	gl
dc.contributor.area	Área de Enxeñaría e Arquitectura
dc.contributor.author	López López, Eric
dc.contributor.author	Pardo López, Xosé Manuel
dc.contributor.author	Vázquez Regueiro, Carlos
dc.contributor.author	Iglesias Rodríguez, Roberto
dc.contributor.author	Estévez Casado, Fernando
dc.date.accessioned	2021-04-16T09:52:33Z
dc.date.available	2021-04-16T09:52:33Z
dc.date.issued	2019
dc.description	This is the peer reviewed version of the following article: López‐López, E., Pardo, X.M., Regueiro, C.V., Iglesias, R. and Casado, F.E. (2019), Dataset bias exposed in face verification. IET Biom., 8: 249-258, which has been published in final form at https://doi.org/10.1049/iet-bmt.2018.5224. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Use of Self-Archived Versions	gl
dc.description.abstract	Most facial verification methods assume that training and testing sets contain independent and identically distributed samples, although, in many real applications, this assumption does not hold. Whenever gathering a representative dataset in the target domain is unfeasible, it is necessary to choose one of the already available (source domain) datasets. Here, a study was performed over the differences among six public datasets, and how this impacts on the performance of the learned methods. In the considered scenario of mobile devices, the individual of interest is enrolled using a few facial images taken in the operational domain, while training impostors are drawn from one of the public available datasets. This work tried to shed light on the inherent differences among the datasets, and potential harms that should be considered when they are combined for training and testing. Results indicate that a drop in performance occurs whenever training and testing are done on different datasets compared to the case of using the same dataset in both phases. However, the decay strongly depends on the kind of features. Besides, the representation of samples in the feature space reveals insights into what extent bias is an endogenous or an exogenous factor	gl
dc.description.peerreviewed	SI	gl
dc.description.sponsorship	This work has received financial support from the Xunta de Galicia, Consellería de Cultura, Educación e Ordenación Universitaria (Accreditation 2016–2019, EDG431G/01 and ED431G/08, and reference competitive group 2014–2017, GRC2014/030), the European Union: European Social Fund (ESF), European Regional Development Fund (ERDF) and FEDER funds and (AEI/FEDER, UE) grant number TIN2017‐90135‐R. Eric López had received financial support from the Xunta de Galicia and the European Union (European Social Fund ‐ ESF)	gl
dc.identifier.citation	López‐López, E., Pardo, X.M., Regueiro, C.V., Iglesias, R. and Casado, F.E. (2019), Dataset bias exposed in face verification. IET Biom., 8: 249-258 . https://doi.org/10.1049/iet-bmt.2018.5224	gl
dc.identifier.doi	10.1049/iet-bmt.2018.5224
dc.identifier.essn	2047-4946
dc.identifier.uri	http://hdl.handle.net/10347/26000
dc.language.iso	eng	gl
dc.publisher	Wiley	gl
dc.relation.publisherversion	https://doi.org/10.1049/iet-bmt.2018.5224	gl
dc.rights	© 2019 The Institution of Engineering and Technology. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Self-Archiving	gl
dc.rights.accessRights	open access	gl
dc.subject	Face recognition	gl
dc.subject	Learning (artificial intelligence)	gl
dc.subject	Mobile devices	gl
dc.subject	Facial images	gl
dc.subject	Public available datasets	gl
dc.subject	Face verification	gl
dc.subject	Facial verification methods	gl
dc.subject	Target domain	gl
dc.subject	Source domain	gl
dc.title	Dataset bias exposed in face verification	gl
dc.type	journal article	gl
dc.type.hasVersion	AM	gl
dspace.entity.type	Publication
relation.isAuthorOfPublication	ec40b53b-a076-4895-9247-19ee9e6fbdce
relation.isAuthorOfPublication	99ba5c78-bd31-4c8b-976f-b495174c8099
relation.isAuthorOfPublication	1e9d9c35-bfa0-405f-849a-a1b61806ae85
relation.isAuthorOfPublication.latestForDiscovery	ec40b53b-a076-4895-9247-19ee9e6fbdce

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 2019_ietbmt_lopez_dataset.pdf
Size:: 8.25 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Centro de Investigación en Tecnoloxías Intelixentes da USC (CiTIUS)
Electrónica e Computación