Corpus PaGeS: A multifunctional resource for language learning, translation and cross-linguistic research
Loading...
Identifiers
Publication date
Advisors
Tutors
Journal Title
Journal ISSN
Volume Title
Publisher
John Benjamins
Abstract
This chapter presents the bilingual parallel corpus PaGeS, compiled by the research group SpatiAlEs from the University of Santiago de Compostela. PaGeS currently amounts to nearly 20 million tokens and consists of texts originally written in German and in Spanish and their correspondent translations into the other language, as well as a small portion of German and Spanish translations from third languages. The present contribution introduces the main characteristics of the PaGeS corpus, focusing on its design and compilation. It first explains the criteria for the selection of the texts and the details of text pre-processing, automatic alignment and manual review. It then addresses the search and display features describing the server architecture and indexing process. Finally, the intended development of the PaGeS corpus is briefly discussed.
Description
Bibliographic citation
Doval, Irene, S. Fernández Lanza, T. Jiménez Juliá, E. Liste Lamas, and B. Lübke. 2019. “Corpus PaGeS: A Multifunctional Resource for Language Learning, Translation and Cross-Linguistic Research.” In Parallel Corpora for Contrastive and Translation Studies: New Resources and Applications, edited by Irene Doval and M. T. Sánchez Nieto, 103–121. Amsterdam: John Benjamins.
Relation
Has part
Has version
Is based on
Is part of
Is referenced by
Is version of
Requires
Publisher version
https://benjamins.com/catalog/scl.90.07dovSponsors
Agencia Estatal de Investigación: Optimización de un recurso multifuncional para el aprendizaje de lenguas, la traducción y la investigación linguistico-contrastiva. PaGeS 2.0. Referencia: FFI2017-85938-R IP: Irene Doval








