Gamallo Otero, PabloGarcía González, MarcosPiñeiro Pomar, César AlfredoMartínez-Castaño, RodrigoPichel Campos, Juan Carlos2025-01-222025-01-222018-12-02P. Gamallo, M. Garcia, C. Piñeiro, R. Martinez-Castaño and J. C. Pichel, "LinguaKit: A Big Data-Based Multilingual Tool for Linguistic Analysis and Information Extraction," 2018 Fifth International Conference on Social Networks Analysis, Management and Security (SNAMS), Valencia, Spain, 2018, pp. 239-244, doi: 10.1109/SNAMS.2018.8554689.https://hdl.handle.net/10347/38902This paper presents LinguaKit, a multilingual suite of tools for analysis, extraction, annotation and linguistic correction, as well as its integration into a Big Data infrastructure. LinguaKit allows the user to perform different tasks such as PoS-tagging, syntactic parsing, coreference resolution (among others), including applications for relation extraction, sentiment analysis, summarization, extraction of multiword expressions, or entity linking to DBpedia. Most modules work in four languages: Portuguese, Spanish, English, and Galician. The system is programmed in Perl and is freely available under a GPLv3 license.eng© 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.BilingualInformation ExtractionBig DataSentiment AnalysisPostageRelation ExtractionSyntactic AnalysisMulti-wordBasis Of AnalysisFault-tolerantAnalysis ModuleDisambiguationState MachineTokenizedRelated EntitiesInput TextList Of PairsBasic ModuleBig Data TechnologyProper NounsPhonetic TranscriptionKeyword ExtractionSemantic AnnotationLemmatizationApache SparkLanguage IdentificationLinguaKit: a Big Data-based multilingual tool for linguistic analysis and information extractionjournal article10.1109/SNAMS.2018.8554689open access