Comparing two Basic Methods for Discriminating Between Similar Languages and Varieties

dc.contributor.affiliationUniversidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías Intelixentes da USC (CiTIUS)
dc.contributor.authorGamallo Otero, Pablo
dc.contributor.authorPichel, José Ramom
dc.contributor.authorAlegria Loinaz, Iñaki
dc.contributor.authorAgirrezabal, Manex
dc.date.accessioned2026-01-12T13:15:58Z
dc.date.available2026-01-12T13:15:58Z
dc.date.issued2016-12-12
dc.description.abstractThis article describes the systems submitted by the Citius Ixa Imaxin team to the Discriminating Similar Languages Shared Task 2016. The systems are based on two different strategies: classification with ranked dictionaries and Naive Bayes classifiers. The results of the evaluation show that ranking dictionaries are more sound and stable across different domains while basic bayesian models perform reasonably well on in-domain datasets, but their performance drops when they are applied on out-of-domain texts.
dc.description.sponsorshipThis work has been supported by TelePares project
dc.description.sponsorshipimaxin software
dc.identifier.citationPablo Gamallo, Iñaki Alegria, José Ramom Pichel, and Manex Agirrezabal. 2016. Comparing Two Basic Methods for Discriminating Between Similar Languages and Varieties. In Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial3), pages 170–177, Osaka, Japan. The COLING 2016 Organizing Committee.
dc.identifier.urihttps://hdl.handle.net/10347/45043
dc.language.isoeng
dc.publisherThe COLING 2016 Organizing Committee
dc.relation.projectIDinfo:eu-repo/grantAgreement/MINECO//FFI2014-51978-C2-1-R/ES/TECNOLOGIAS DE LA LENGUA PARA ANALISIS DE OPINIONES EN REDES SOCIALES
dc.relation.publisherversionhttps://aclanthology.org/W16-4822/
dc.rightsAttribution 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectLanguage Identification
dc.subject.classification33 Ciencias tecnológicas
dc.titleComparing two Basic Methods for Discriminating Between Similar Languages and Varieties
dc.typebook part
dc.type.hasVersionVoR
dspace.entity.typePublication
relation.isAuthorOfPublication898ee1bb-f9e8-4a75-9858-a6c9142bc99e
relation.isAuthorOfPublication24c27e24-a456-4990-9f2b-a669bc8a66ea
relation.isAuthorOfPublication.latestForDiscovery898ee1bb-f9e8-4a75-9858-a6c9142bc99e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2016_coling_gamallo-pichel_comparing.pdf
Size:
129.95 KB
Format:
Adobe Portable Document Format