Corpus PaGeS: A multifunctional resource for language learning, translation and cross-linguistic research

dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Filoloxía Inglesa e Alemá
dc.contributor.authorDoval Reixa, Irene
dc.contributor.authorFernández Lanza, Santiago
dc.contributor.authorJiménez Juliá, Tomás
dc.contributor.authorListe Lamas, Elsa
dc.contributor.authorLübke, Barbara
dc.contributor.editorDoval, Irene
dc.contributor.editorSánchez Nieto, María Teresa
dc.date.accessioned2025-01-30T10:05:43Z
dc.date.available2025-01-30T10:05:43Z
dc.date.issued2019
dc.description.abstractThis chapter presents the bilingual parallel corpus PaGeS, compiled by the research group SpatiAlEs from the University of Santiago de Compostela. PaGeS currently amounts to nearly 20 million tokens and consists of texts originally written in German and in Spanish and their correspondent translations into the other language, as well as a small portion of German and Spanish translations from third languages. The present contribution introduces the main characteristics of the PaGeS corpus, focusing on its design and compilation. It first explains the criteria for the selection of the texts and the details of text pre-processing, automatic alignment and manual review. It then addresses the search and display features describing the server architecture and indexing process. Finally, the intended development of the PaGeS corpus is briefly discussed.
dc.description.sponsorshipAgencia Estatal de Investigación: Optimización de un recurso multifuncional para el aprendizaje de lenguas, la traducción y la investigación linguistico-contrastiva. PaGeS 2.0. Referencia: FFI2017-85938-R IP: Irene Doval
dc.identifier.citationDoval, Irene, S. Fernández Lanza, T. Jiménez Juliá, E. Liste Lamas, and B. Lübke. 2019. “Corpus PaGeS: A Multifunctional Resource for Language Learning, Translation and Cross-Linguistic Research.” In Parallel Corpora for Contrastive and Translation Studies: New Resources and Applications, edited by Irene Doval and M. T. Sánchez Nieto, 103–121. Amsterdam: John Benjamins.
dc.identifier.doi10.1075/scl.90.07dov
dc.identifier.essn9789027262844
dc.identifier.isbn9789027202345
dc.identifier.urihttps://hdl.handle.net/10347/39334
dc.language.isoeng
dc.publisherJohn Benjamins
dc.relation.ispartofseriesStudies in Corpus Linguistics; 90
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/FFI2017-85938-R/ES/PAGES 2.0. OPTIMIZACION DE UN RECURSO MULTIFUNCIONAL PARA EL APRENDIZAJE DE LENGUAS, LA TRADUCCION Y LA INVESTIGACION LINGUISTICO-CONTRASTIVA/
dc.relation.publisherversionhttps://benjamins.com/catalog/scl.90.07dov
dc.rights.accessRightsopen access
dc.subjectParallel corpora
dc.subjectCorpus alignment
dc.subjectCorpus visualization
dc.subjectSpanish/German
dc.subject.classification5701 Lingüística aplicada
dc.titleCorpus PaGeS: A multifunctional resource for language learning, translation and cross-linguistic research
dc.typebook part
dc.type.hasVersionAM
dspace.entity.typePublication
relation.isAuthorOfPublication6dd6a8e2-0713-49d8-bd83-bb134192a00f
relation.isAuthorOfPublicationeac23782-4c9f-4d49-9af6-e3fbab4adc51
relation.isAuthorOfPublicatione78ae49c-e55a-4179-8a65-cdd9c881083f
relation.isAuthorOfPublication.latestForDiscovery6dd6a8e2-0713-49d8-bd83-bb134192a00f

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
chapter_6_180801.pdf
Size:
812.23 KB
Format:
Adobe Portable Document Format