Introducing idioms in the Galician WordNet: methods, problems and results

Loading...
Thumbnail Image
Identifiers

Publication date

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

De Gruyter
Metrics
Google Scholar
lacobus
Export

Research Projects

Organizational Units

Journal Issue

Abstract

This study describes the introduction of verbal idioms in the Galician language version (Galnet) of the semantic network WordNet; a network that does not traditionally include many phraseological units. To enhance Galnet, a list of 803 Galician verbal idioms was developed to then review each of them individually and assess whether they could be introduced in an existing WordNet synset (a group of synonyms expressing the same concept) or not. Of those 803 idioms, 490 (61%) could be included in this network. Besides, Galnet was enlarged with 750 extra verbal idioms, most of them synonyms or variants of the former. In this study, we present the working methodology for the experiment and an analysis of the results, to help understand the most important problems found when trying to introduce idioms in Galnet. We also discuss the reasons preventing the inclusion of some expressions, and the criteria used to introduce the idioms that finally made it into the network

Description

Bibliographic citation

Álvarez de la Granja, María , Xosé María Gómez Clemente and Xavier Gómez Guinovart. "Introducing Idioms in the Galician WordNet: Methods, Problems and Results" Open Linguistics, 2.1 (2016): -. Retrieved 19 Oct. 2017, from doi:10.1515/opli-2016-0012

Relation

Has part

Has version

Is based on

Is part of

Is referenced by

Is version of

Requires

Sponsors

This research has been supported by the Xunta de Galicia and the European Union (under grant GRC2013/40) and by the Ministry of Economy and Competitiveness of the Spanish Government (Project TUNER: Automatic domain adaptation for semantic processing, TIN2015-65308-C5-1-R)

Rights

© 2016 María Álvarez de la Granja et al.. This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License. BY-NC-ND 3.0