Corpus PaGeS: A multifunctional resource for language learning, translation and cross-linguistic research

Loading...
Thumbnail Image
Identifiers
E-ISSN: 9789027262844
ISBN: 9789027202345

Publication date

Advisors

Tutors

Journal Title

Journal ISSN

Volume Title

Publisher

John Benjamins
Metrics
Google Scholar
lacobus
Export

Research Projects

Organizational Units

Journal Issue

Abstract

This chapter presents the bilingual parallel corpus PaGeS, compiled by the research group SpatiAlEs from the University of Santiago de Compostela. PaGeS currently amounts to nearly 20 million tokens and consists of texts originally written in German and in Spanish and their correspondent translations into the other language, as well as a small portion of German and Spanish translations from third languages. The present contribution introduces the main characteristics of the PaGeS corpus, focusing on its design and compilation. It first explains the criteria for the selection of the texts and the details of text pre-processing, automatic alignment and manual review. It then addresses the search and display features describing the server architecture and indexing process. Finally, the intended development of the PaGeS corpus is briefly discussed.

Description

Bibliographic citation

Doval, Irene, S. Fernández Lanza, T. Jiménez Juliá, E. Liste Lamas, and B. Lübke. 2019. “Corpus PaGeS: A Multifunctional Resource for Language Learning, Translation and Cross-Linguistic Research.” In Parallel Corpora for Contrastive and Translation Studies: New Resources and Applications, edited by Irene Doval and M. T. Sánchez Nieto, 103–121. Amsterdam: John Benjamins.

Relation

Has part

Has version

Is based on

Is part of

Is referenced by

Is version of

Requires

Sponsors

Agencia Estatal de Investigación: Optimización de un recurso multifuncional para el aprendizaje de lenguas, la traducción y la investigación linguistico-contrastiva. PaGeS 2.0. Referencia: FFI2017-85938-R IP: Irene Doval

Rights