Cross-lingual Diachronic Distance: Application to Portuguese and Spanish

dc.contributor.affiliationUniversidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías Intelixentes da USC (CiTIUS)
dc.contributor.authorPichel, José Ramom
dc.contributor.authorGamallo Otero, Pablo
dc.contributor.authorAlegria Loinaz, Iñaki
dc.date.accessioned2026-01-19T13:30:08Z
dc.date.available2026-01-19T13:30:08Z
dc.date.issued2019-09-01
dc.description.abstractThe aim of this paper is to establish a corpus-based methodology for automatically measuring the cross-lingual distance between historical periods of two languages using perplexity. The corpus of both has been constructed adhoc with the closest spelling to the original representing chronologically and in a balanced way fiction and non-fiction. The methodology has been applied to two related languages, Portuguese and Spanish, and measured their diachronic distances both in original orthography and in an automatically transcribed spelling.
dc.description.peerreviewedSI
dc.description.sponsorshipPGC2018-102041-B-I00, MCIU/AEI/FEDER, UE
dc.description.sponsorshipConsellería de Cultura, Educación e Ordenación Universitaria (accreditation 2016- 2019, ED431G/08)
dc.description.sponsorshipEuropean Regional Development Fund (ERDF)
dc.identifier.doi10.26342/2019-63-8
dc.identifier.issn1135-5948
dc.identifier.urihttps://hdl.handle.net/10347/45262
dc.journal.titleProcesamiento del Lenguaje Natural
dc.language.isoeng
dc.page.final84
dc.page.initial77
dc.publisherSEPLN (Sociedad Española del Procesamiento del Lenguaje Natural)
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PGC2018-102041-B-I00/ES/TRADUCCION AUTOMATICA NEURONAL, EN DOMINIO, NO SUPERVISADA
dc.relation.publisherversionhttp://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6097
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectCorpus linguistics
dc.subjectHistorical Linguistics
dc.subjectLanguage distance
dc.subjectDevelopment of linguistic resources and tools
dc.subject.classification33 Ciencias tecnológicas
dc.titleCross-lingual Diachronic Distance: Application to Portuguese and Spanish
dc.title.alternativeDistancia diacrónica interlingüística: aplicación al portugués y el castellano
dc.typejournal article
dc.type.hasVersionVoR
dc.volume.number63
dspace.entity.typePublication
relation.isAuthorOfPublication24c27e24-a456-4990-9f2b-a669bc8a66ea
relation.isAuthorOfPublication898ee1bb-f9e8-4a75-9858-a6c9142bc99e
relation.isAuthorOfPublication.latestForDiscovery24c27e24-a456-4990-9f2b-a669bc8a66ea

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2019_pln_pichel_cross-lingual.pdf
Size:
920.62 KB
Format:
Adobe Portable Document Format