Dataset bias exposed in face verification

dc.contributor.affiliationUniversidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías da Informacióngl
dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Electrónica e Computacióngl
dc.contributor.areaÁrea de Enxeñaría e Arquitectura
dc.contributor.authorLópez López, Eric
dc.contributor.authorPardo López, Xosé Manuel
dc.contributor.authorVázquez Regueiro, Carlos
dc.contributor.authorIglesias Rodríguez, Roberto
dc.contributor.authorEstévez Casado, Fernando
dc.date.accessioned2021-04-16T09:52:33Z
dc.date.available2021-04-16T09:52:33Z
dc.date.issued2019
dc.descriptionThis is the peer reviewed version of the following article: López‐López, E., Pardo, X.M., Regueiro, C.V., Iglesias, R. and Casado, F.E. (2019), Dataset bias exposed in face verification. IET Biom., 8: 249-258, which has been published in final form at https://doi.org/10.1049/iet-bmt.2018.5224. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Use of Self-Archived Versionsgl
dc.description.abstractMost facial verification methods assume that training and testing sets contain independent and identically distributed samples, although, in many real applications, this assumption does not hold. Whenever gathering a representative dataset in the target domain is unfeasible, it is necessary to choose one of the already available (source domain) datasets. Here, a study was performed over the differences among six public datasets, and how this impacts on the performance of the learned methods. In the considered scenario of mobile devices, the individual of interest is enrolled using a few facial images taken in the operational domain, while training impostors are drawn from one of the public available datasets. This work tried to shed light on the inherent differences among the datasets, and potential harms that should be considered when they are combined for training and testing. Results indicate that a drop in performance occurs whenever training and testing are done on different datasets compared to the case of using the same dataset in both phases. However, the decay strongly depends on the kind of features. Besides, the representation of samples in the feature space reveals insights into what extent bias is an endogenous or an exogenous factorgl
dc.description.peerreviewedSIgl
dc.description.sponsorshipThis work has received financial support from the Xunta de Galicia, Consellería de Cultura, Educación e Ordenación Universitaria (Accreditation 2016–2019, EDG431G/01 and ED431G/08, and reference competitive group 2014–2017, GRC2014/030), the European Union: European Social Fund (ESF), European Regional Development Fund (ERDF) and FEDER funds and (AEI/FEDER, UE) grant number TIN2017‐90135‐R. Eric López had received financial support from the Xunta de Galicia and the European Union (European Social Fund ‐ ESF)gl
dc.identifier.citationLópez‐López, E., Pardo, X.M., Regueiro, C.V., Iglesias, R. and Casado, F.E. (2019), Dataset bias exposed in face verification. IET Biom., 8: 249-258 . https://doi.org/10.1049/iet-bmt.2018.5224gl
dc.identifier.doi10.1049/iet-bmt.2018.5224
dc.identifier.essn2047-4946
dc.identifier.urihttp://hdl.handle.net/10347/26000
dc.language.isoenggl
dc.publisherWileygl
dc.relation.publisherversionhttps://doi.org/10.1049/iet-bmt.2018.5224gl
dc.rights© 2019 The Institution of Engineering and Technology. This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Self-Archivinggl
dc.rights.accessRightsopen accessgl
dc.subjectFace recognitiongl
dc.subjectLearning (artificial intelligence)gl
dc.subjectMobile devicesgl
dc.subjectFacial imagesgl
dc.subjectPublic available datasetsgl
dc.subjectFace verificationgl
dc.subjectFacial verification methodsgl
dc.subjectTarget domaingl
dc.subjectSource domaingl
dc.titleDataset bias exposed in face verificationgl
dc.typejournal articlegl
dc.type.hasVersionAMgl
dspace.entity.typePublication
relation.isAuthorOfPublicationec40b53b-a076-4895-9247-19ee9e6fbdce
relation.isAuthorOfPublication99ba5c78-bd31-4c8b-976f-b495174c8099
relation.isAuthorOfPublication1e9d9c35-bfa0-405f-849a-a1b61806ae85
relation.isAuthorOfPublication.latestForDiscoveryec40b53b-a076-4895-9247-19ee9e6fbdce

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2019_ietbmt_lopez_dataset.pdf
Size:
8.25 MB
Format:
Adobe Portable Document Format
Description: