Semi-Supervised Learning in the Field of Conversational Agents and Motivational Interviewing
| dc.contributor.affiliation | Universidade de Santiago de Compostela. Centro de Investigación en Tecnoloxías Intelixentes da USC (CiTIUS) | |
| dc.contributor.author | Rosenova Tsakova, Gergana | |
| dc.contributor.author | Fernández Pichel, Marcos | |
| dc.contributor.author | Meyer, Selina | |
| dc.contributor.author | Losada Carril, David Enrique | |
| dc.date.accessioned | 2025-12-16T12:17:31Z | |
| dc.date.available | 2025-12-16T12:17:31Z | |
| dc.date.issued | 2024-09-01 | |
| dc.description.abstract | The exploitation of Motivational Interviewing concepts for text analysis contributes to gaining valuable insights into individuals’ perspectives and attitudes towards behaviour change. The scarcity of labelled user data poses a persistent challenge and impedes technical advances in research under non-English language scenarios. To address the limitations of manual data labelling, we propose a semi-supervised learning method as a means to augment an existing training corpus. Our approach leverages machine-translated user-generated data sourced from social media communities and employs self-training techniques for annotation. To that end, we consider various source contexts and conduct an evaluation of multiple classifiers trained on various augmented datasets. The results indicate that this weak labelling approach does not yield improvements in the overall classification capabilities of the models. However, notable enhancements were observed for the minority classes. We conclude that several factors, including the quality of machine translation, can potentially bias the pseudo-labelling models and that the imbalanced nature of the data and the impact of a strict pre-filtering threshold need to be taken into account as inhibiting factors. | |
| dc.description.peerreviewed | SI | |
| dc.description.sponsorship | This work was supported by project PLEC2021- 007662 (MCIN/AEI/10.13039/501100011033, Plan de Recuperación, Transformación y Resiliencia, Next Generation EU). The authors also thank the financial support supplied by the Xunta de Galicia-Consellería de Cultura, Educación, Formación Profesional e Universidade (ED431G 2023/04, ED431C 2022/19) and the ERDF, which acknowledges the CiTIUS- Research Center in Intelligent Technologies of the USC as a Research Center of the Galician University System. David E. Losada thanks the financial support obtained from project SUBV23/00002 (Ministerio de Consumo, Subdirección General de Regulación del Juego) and project PID2022-137061OB-C22 (Ministerio de Ciencia e Innovación, AEI, Proyectos de Generación de Conocimiento; supported by the ERDF). | |
| dc.identifier.doi | 10.26342/2024-73-4 | |
| dc.identifier.issn | 1135-5948 | |
| dc.identifier.uri | https://hdl.handle.net/10347/44523 | |
| dc.journal.title | Procesamiento del Lenguaje Natural | |
| dc.language.iso | eng | |
| dc.publisher | Sociedad Española para el Procesamiento del Lenguaje Natural | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PLEC2021-007662/ES/Big-eRisk: Predicción temprana de riesgos personales en conjuntos de datos masivos | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-137061OB-C22/ES/BUSQUEDA, SELECCION Y ORGANIZACION DE CONTENIDOS PARA NECESIDADES DE INFORMACION RELACIONADAS CON LA SALUD: BUSQUEDA Y DETECCION DE DESINFORMACIION | |
| dc.relation.publisherversion | https://doi.org/10.26342/2024-73-4 | |
| dc.rights | Attribution 4.0 International | en |
| dc.rights.accessRights | open access | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | Semi-supervised learning | |
| dc.subject | Motivational Interviewing | |
| dc.subject | Conversational Agents | |
| dc.title | Semi-Supervised Learning in the Field of Conversational Agents and Motivational Interviewing | |
| dc.title.alternative | Aprendizaje Semisupervisado en el ´Ambito de los Agentes Conversacionales y la Entrevista Motivacional | |
| dc.type | journal article | |
| dc.type.hasVersion | VoR | |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | ad1c87f4-64b2-44aa-ab80-4709cef31dfe | |
| relation.isAuthorOfPublication | 7ddb36fe-bf39-4c79-85bc-540ce4d9a23b | |
| relation.isAuthorOfPublication.latestForDiscovery | ad1c87f4-64b2-44aa-ab80-4709cef31dfe |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 2024_pln_rosenova_semisupervised.pdf
- Size:
- 309.5 KB
- Format:
- Adobe Portable Document Format