RT Journal Article T1 Human pose estimation solutions: a low cost tool for increasing natural interaction in virtual television sets A1 Arenas, Rubén A1 Méndez Fernández, Roi A1 Pedraza, Luis A1 Castelló Mayo, Enrique A1 Flores González, Julián Carlos K1 Human pose estimation K1 Virtual television sets K1 Human computer interaction K1 Benchmarking AB Virtual television sets (VTS) have experienced a growth spurt in recent years, paralleling the development of virtual reality tools and the metaverse. Nowadays, this technology is used in multiple broadcasts from major television companies to smaller tv stations ranging from sports programs to election nights, news, etc. Despite significant advances in recent years, such as improvements in real and virtual content composition or real-time realistic rendering, interaction between presenters and virtual content remains a challenge. This is primarily due to the high cost and complexity of the equipment required, as well as the limitations imposed by live production technology. In this context, we propose testing Human Pose Estimation (HPE) tools over the studio camera stream as a potential solution that does not require additional hardware integration into the system. We evaluated 14 HPE solutions using a three-step process. First, we assessed the robustness and viability of reliable real-time execution for each solution, with five solutions passing this initial phase. Secondly, we analyzed frames per second (FPS), RAM consumption, and CPU usage for each alternative in both local and global scenarios, considering both the ‘printmetrics’ and ‘no view’ options. BlazePose OpenVINO demonstrated the best performance in these tests and was selected for further testing in real-world scenarios. These tests have confirmed that HPE is a viable alternative for enhancing human-computer interaction in VTS. However, certain limitations remain, such as the lack of reliable depth data and the need for further analysis in detecting complex dynamic gestures. The proposed software-based VTSs promotes universal accessibility by eliminating the need for external control devices, reducing economic barriers and allowing users to customise natural, adaptive interactions that fit their individual capabilities and contextual needs. PB Springer SN 1615-5297 YR 2025 FD 2025-08-09 LK https://hdl.handle.net/10347/43310 UL https://hdl.handle.net/10347/43310 LA eng NO Arenas, R., Méndez, R., Pedraza, L. et al. Human pose estimation solutions: a low cost tool for increasing natural interaction in virtual television sets. Univ Access Inf Soc (2025). https://doi.org/10.1007/s10209-025-01255-x DS Minerva RD 28 abr 2026