RT Journal Article T1 LinguaKit: a Big Data-based multilingual tool for linguistic analysis and information extraction A1 Gamallo Otero, Pablo A1 García González, Marcos A1 Piñeiro Pomar, César Alfredo A1 Martínez-Castaño, Rodrigo A1 Pichel Campos, Juan Carlos K1 Bilingual K1 Information Extraction K1 Big Data K1 Sentiment Analysis K1 Postage K1 Relation Extraction K1 Syntactic Analysis K1 Multi-word K1 Basis Of Analysis K1 Fault-tolerant K1 Analysis Module K1 Disambiguation K1 State Machine K1 Tokenized K1 Related Entities K1 Input Text K1 List Of Pairs K1 Basic Module K1 Big Data Technology K1 Proper Nouns K1 Phonetic Transcription K1 Keyword Extraction K1 Semantic Annotation K1 Lemmatization K1 Apache Spark K1 Language Identification AB This paper presents LinguaKit, a multilingual suite of tools for analysis, extraction, annotation and linguistic correction, as well as its integration into a Big Data infrastructure. LinguaKit allows the user to perform different tasks such as PoS-tagging, syntactic parsing, coreference resolution (among others), including applications for relation extraction, sentiment analysis, summarization, extraction of multiword expressions, or entity linking to DBpedia. Most modules work in four languages: Portuguese, Spanish, English, and Galician. The system is programmed in Perl and is freely available under a GPLv3 license. PB IEEE YR 2018 FD 2018-12-02 LK https://hdl.handle.net/10347/38902 UL https://hdl.handle.net/10347/38902 LA eng NO P. Gamallo, M. Garcia, C. Piñeiro, R. Martinez-Castaño and J. C. Pichel, "LinguaKit: A Big Data-Based Multilingual Tool for Linguistic Analysis and Information Extraction," 2018 Fifth International Conference on Social Networks Analysis, Management and Security (SNAMS), Valencia, Spain, 2018, pp. 239-244, doi: 10.1109/SNAMS.2018.8554689. NO This work has been supported by MINECO (TIN2014-54565-JIN, FFI2014- 51978-C2-1-R), MICINN (IJCI-2016-29598), Xunta de Galicia (ED431G/08), European Regional Development Fund (ERDF), and by two BBVA Foundation Grants for Researchers and Cultural Creators (2016 and 2017). DS Minerva RD 22 abr 2026