Human pose estimation solutions: a low cost tool for increasing natural interaction in virtual television sets

dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Electrónica e Computación
dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Ciencias da Comunicación
dc.contributor.authorArenas, Rubén
dc.contributor.authorMéndez Fernández, Roi
dc.contributor.authorPedraza, Luis
dc.contributor.authorCastelló Mayo, Enrique
dc.contributor.authorFlores González, Julián Carlos
dc.date.accessioned2025-10-21T09:03:45Z
dc.date.available2025-10-21T09:03:45Z
dc.date.issued2025-08-09
dc.description.abstractVirtual television sets (VTS) have experienced a growth spurt in recent years, paralleling the development of virtual reality tools and the metaverse. Nowadays, this technology is used in multiple broadcasts from major television companies to smaller tv stations ranging from sports programs to election nights, news, etc. Despite significant advances in recent years, such as improvements in real and virtual content composition or real-time realistic rendering, interaction between presenters and virtual content remains a challenge. This is primarily due to the high cost and complexity of the equipment required, as well as the limitations imposed by live production technology. In this context, we propose testing Human Pose Estimation (HPE) tools over the studio camera stream as a potential solution that does not require additional hardware integration into the system. We evaluated 14 HPE solutions using a three-step process. First, we assessed the robustness and viability of reliable real-time execution for each solution, with five solutions passing this initial phase. Secondly, we analyzed frames per second (FPS), RAM consumption, and CPU usage for each alternative in both local and global scenarios, considering both the ‘printmetrics’ and ‘no view’ options. BlazePose OpenVINO demonstrated the best performance in these tests and was selected for further testing in real-world scenarios. These tests have confirmed that HPE is a viable alternative for enhancing human-computer interaction in VTS. However, certain limitations remain, such as the lack of reliable depth data and the need for further analysis in detecting complex dynamic gestures. The proposed software-based VTSs promotes universal accessibility by eliminating the need for external control devices, reducing economic barriers and allowing users to customise natural, adaptive interactions that fit their individual capabilities and contextual needs.
dc.description.peerreviewedSI
dc.identifier.citationArenas, R., Méndez, R., Pedraza, L. et al. Human pose estimation solutions: a low cost tool for increasing natural interaction in virtual television sets. Univ Access Inf Soc (2025). https://doi.org/10.1007/s10209-025-01255-x
dc.identifier.doi10.1007/s10209-025-01255-x
dc.identifier.issn1615-5297
dc.identifier.urihttps://hdl.handle.net/10347/43310
dc.journal.titleUniversal Access in the Information Society
dc.language.isoeng
dc.publisherSpringer
dc.relation.publisherversionhttps://doi.org/10.1007/s10209-025-01255-x
dc.rights© The Author(s) 2025. Attribution 4.0 International
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectHuman pose estimation
dc.subjectVirtual television sets
dc.subjectHuman computer interaction
dc.subjectBenchmarking
dc.titleHuman pose estimation solutions: a low cost tool for increasing natural interaction in virtual television sets
dc.typejournal article
dc.type.hasVersionVoR
dspace.entity.typePublication
relation.isAuthorOfPublication9e17dc87-ad21-4b29-a28a-35b3a47c2216
relation.isAuthorOfPublication6aa8c734-4e5e-4ee3-8743-d5c745e5097a
relation.isAuthorOfPublication12b92448-8b3c-499c-9e5c-6b8e82695771
relation.isAuthorOfPublication.latestForDiscovery9e17dc87-ad21-4b29-a28a-35b3a47c2216

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2025_uais_flores_human.pdf
Size:
1.2 MB
Format:
Adobe Portable Document Format