Comparing two Basic Methods for Discriminating Between Similar Languages and Varieties
| dc.contributor.affiliation | Universidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías Intelixentes da USC (CiTIUS) | |
| dc.contributor.author | Gamallo Otero, Pablo | |
| dc.contributor.author | Pichel, José Ramom | |
| dc.contributor.author | Alegria Loinaz, Iñaki | |
| dc.contributor.author | Agirrezabal, Manex | |
| dc.date.accessioned | 2026-01-12T13:15:58Z | |
| dc.date.available | 2026-01-12T13:15:58Z | |
| dc.date.issued | 2016-12-12 | |
| dc.description.abstract | This article describes the systems submitted by the Citius Ixa Imaxin team to the Discriminating Similar Languages Shared Task 2016. The systems are based on two different strategies: classification with ranked dictionaries and Naive Bayes classifiers. The results of the evaluation show that ranking dictionaries are more sound and stable across different domains while basic bayesian models perform reasonably well on in-domain datasets, but their performance drops when they are applied on out-of-domain texts. | |
| dc.description.sponsorship | This work has been supported by TelePares project | |
| dc.description.sponsorship | imaxin software | |
| dc.identifier.citation | Pablo Gamallo, Iñaki Alegria, José Ramom Pichel, and Manex Agirrezabal. 2016. Comparing Two Basic Methods for Discriminating Between Similar Languages and Varieties. In Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial3), pages 170–177, Osaka, Japan. The COLING 2016 Organizing Committee. | |
| dc.identifier.uri | https://hdl.handle.net/10347/45043 | |
| dc.language.iso | eng | |
| dc.publisher | The COLING 2016 Organizing Committee | |
| dc.relation.projectID | info:eu-repo/grantAgreement/MINECO//FFI2014-51978-C2-1-R/ES/TECNOLOGIAS DE LA LENGUA PARA ANALISIS DE OPINIONES EN REDES SOCIALES | |
| dc.relation.publisherversion | https://aclanthology.org/W16-4822/ | |
| dc.rights | Attribution 4.0 International | en |
| dc.rights.accessRights | open access | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | Language Identification | |
| dc.subject.classification | 33 Ciencias tecnológicas | |
| dc.title | Comparing two Basic Methods for Discriminating Between Similar Languages and Varieties | |
| dc.type | book part | |
| dc.type.hasVersion | VoR | |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 898ee1bb-f9e8-4a75-9858-a6c9142bc99e | |
| relation.isAuthorOfPublication | 24c27e24-a456-4990-9f2b-a669bc8a66ea | |
| relation.isAuthorOfPublication.latestForDiscovery | 898ee1bb-f9e8-4a75-9858-a6c9142bc99e |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 2016_coling_gamallo-pichel_comparing.pdf
- Size:
- 129.95 KB
- Format:
- Adobe Portable Document Format