RT Book,_Section T1 An Architecture for Document Routing in Spanish: Two Language Components, PreProcessor and Parser A1 Rojo Sánchez, Guillermo A1 Álvarez, Concepción A1 Alvariño, Pilar A1 Gil, Adelaida A1 Santalla del Río, María Paula A1 Sotelo, Susana A2 Gavridilou, Maria A2 Carayannis, George A2 Markantonatou, Stella A2 Piperidis, Stelios A2 Stainhauer, Gregory K1 Document routing K1 Natural language processing K1 Syntactic analysis K1 Normalization AB This paper describes the language components of a system for Document Routing in Spanish. The system identifies relevant terms for classification within involved documents by means of natural language processing techniques. These techniques are based on the isolation and normalization of syntactic unities considered relevant for the classification, especially noun phrases, but also other constituents built around verbs, adverbs, pronouns or adjectives. After a general introduction about the research project, the second Section relates our approach to the problem with other previous and current approaches, the third one describes corpora used for evaluating the system. The linguistic analysis architecture, including pre-processing and two different levels of syntactic analysis, is described in following fourth and fifth Sections, while the last one is dedicated to a comparative analysis of results obtained from the processing of corpora introduced in third Section. Certain future developments of the system are also included in this Section. PB European Language Resources Association (ELRA) YR 2000 FD 2000 LK https://hdl.handle.net/10347/38320 UL https://hdl.handle.net/10347/38320 LA eng NO Rojo, Guillermo, Concepción Álvarez, Pilar Alvariño, Adelaida Gil, María Paula Santalla, Susana Sotelo. (2000). An Architecture for Document Routing in Spanish: Two Language Components, PreProcessor and Parser. En: M. Gavrilidou, G. Carayannis, S. Markantonatou, S. Piperidis, G. Stainhauer, "Proceedings of the Second International Conference on Language Resources and Evaluation (LREC-2000, 31 de mayo--2 de junio de 2000, Atenas)", (Volumen 2: pp. 675-682). European Language Resources Association (ELRA). NO European Commission Directorate General III (DGIII), within the Fourth Framework Programme (1994-1998) of European Union DS Minerva RD 27 abr 2026