Clever domain adaptation strategies for BERT in the task of hostile-language detection

Villa Cueva, Emilio; Aragón Saenzpardo, Mario Ezra; López Monroy, Adrián; Sánchez Vega, Fernando

doi:10.1007/s11042-026-21521-1

Clever domain adaptation strategies for BERT in the task of hostile-language detection

dc.contributor.affiliation	Universidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías Intelixentes da USC (CiTIUS)
dc.contributor.author	Villa Cueva, Emilio
dc.contributor.author	Aragón Saenzpardo, Mario Ezra
dc.contributor.author	López Monroy, Adrián
dc.contributor.author	Sánchez Vega, Fernando
dc.date.accessioned	2026-04-22T07:25:15Z
dc.date.available	2026-04-22T07:25:15Z
dc.date.issued	2026-03-31
dc.description.abstract	Cyberbullying has experienced a surge in recent years, mainly due to the widespread adoption of social media platforms. This trend manifests in multiple ways, with hostile language being one of the most common. The latter underscores the urgent need for robust detection methods to address this issue effectively. To address this problem, we propose a novel pipeline to enhance hostile language detection in social media. Our approach consists of a combination of two ideas: First, we propose conducting a Domain Adaptation procedure to specialize the knowledge of a pre-trained BERT, making it more specialized in the domain of social media. For this adaptation, we modify the traditional random Masked Language Modeling technique and propose three novel strategies for selecting the subset of tokens to mask out cleverly. Second, we tailor an Adversarial Regularizer when fine-tuning the adapted BERT for specific hostile-language datasets. We evaluate the performance of our method for detecting hate speech, aggressiveness, offensiveness, and sexism. Our results show that the Domain Adaptation procedure significantly outperforms vanilla BERT, and the Adversarial Regularizer can lead to more robust fine-tuning, thereby enhancing performance. Moreover, we demonstrate that these methods can be used together to achieve an even more significant performance boost.
dc.description.peerreviewed	SI
dc.description.sponsorship	Villa-Cueva (CVU 1019520) thanks CONAHCYT for the support through the master’s degree scholarship at CIMAT. This work was supported by the project CBF-2025-I-4384, “Grandes Modelos de Lenguaje Especializados para Detectar Ciberacoso y Violencia Digital”, approved under the Ciencia Básica y de Frontera 2025 call of SECIHTI, Mexico. Mario Ezra Aragón thanks the support obtained from MICIU/AEI/10.13039/501100011033 (PID2022-137061OB-C22, supported by ERDF), Xunta de Galicia-Consellería de Cultura, Educación, Formación Profesional e Universidades (ED431G 2023/04, ED431C 2022/19, supported by ERDF), and the support obtained from the Juan de la Cierva Grant (JDC2023-052296-I), funded by MCIN/AEI/10.13039/501100011033 and by the FSE+. Sanchez-Vega acknowledges CONAHCYT / SECIHTI’s support through the program “Investigadoras e Investigadores por México” (Project ID 11989, No. 1311).
dc.description.sponsorship	Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature. This research was funded by Xunta de Galicia, Ministerio de Ciencia e Innovación (Spain), and CONAHCYT (Mexico).
dc.identifier.citation	Villa-Cueva, E., Aragón, M.E., López-Monroy, A.P., & Sánchez-Vega, F. (2026) Clever domain adaptation strategies for BERT in the task of hostile-language detection. Multimedia Tools and Applications, 85(323). https://doi.org/10.1007/s11042-026-21521-1
dc.identifier.doi	10.1007/s11042-026-21521-1
dc.identifier.essn	1573-7721
dc.identifier.uri	https://hdl.handle.net/10347/46880
dc.issue.number	323
dc.journal.title	Multimedia Tools and Applications
dc.language.iso	eng
dc.page.final	28
dc.page.initial	1
dc.publisher	Springer
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-137061OB-C22/ES/BUSQUEDA, SELECCION Y ORGANIZACION DE CONTENIDOS PARA NECESIDADES DE INFORMACION RELACIONADAS CON LA SALUD: BUSQUEDA Y DETECCION DE DESINFORMACIION
dc.relation.publisherversion	https://doi.org/10.1007/s11042-026-21521-1
dc.rights	This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
dc.rights	Attribution 4.0 International	en
dc.rights.accessRights	open access
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/
dc.subject	Hostile language
dc.subject	Domain adaptation
dc.subject	Social media
dc.subject	Text classification
dc.title	Clever domain adaptation strategies for BERT in the task of hostile-language detection
dc.type	journal article
dc.type.hasVersion	VoR
dc.volume.number	85
dspace.entity.type	Publication

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 2026_multimedia_aragon_clever.pdf
Size:: 2.52 MB
Format:: Adobe Portable Document Format

Download

Collections

Electrónica e Computación
Centro de Investigación en Tecnoloxías Intelixentes da USC (CiTIUS)