Lexicographic resources and machine translation. Challenges and perspectives

Loading...
Thumbnail Image
Identifiers

Publication date

Advisors

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

Metrics
Google Scholar
lacobus
Export

Research Projects

Organizational Units

Journal Issue

Abstract

In this thesis we describe and evaluate a tool for automatic generation of translations for multiword English terms into Spanish from a monolingual specialized Spanish corpus, compiled by means of web crawling. The resulting translations may be further used to expand or to revise the existing lexicographic resources, i.e. language for specific purposes dictionaries, terminological databases and translation memories. We evaluate the output with the precision and recall metrics and apply our approach to the small English-Spanish glossary in the legal domain that serves both as a reference dictionary and a source for input multiword terms. We analyze the obtained results and suggest possible solutions to the detected problems based on the theoretical part of this research devoted to the issues of machine translation, the use of dictionaries and corpora (non-parallel corpora in particular) for machine translation tasks, and the works on automatic terminology extraction.

Description

Traballo de Fin de Máster en Lexicografía. Curso 2017-2018

Bibliographic citation

Relation

Has part

Has version

Is based on

Is part of

Is referenced by

Is version of

Requires

Sponsors

Rights

Atribución-NoComercial-CompartirIgual 4.0 Internacional