MetrikaBox: An open framework for experimenting with audio classification

dc.contributor.affiliationUniversidade de Santiago de Compostela. Departamento de Electrónica e Computación
dc.contributor.authorPerianez-Pascual, Jorge
dc.contributor.authorGutiérrez, Juan D.
dc.contributor.authorDelgado, Emilio
dc.contributor.authorSánchez-Figueroa, Fernando
dc.contributor.authorRodriguez-Echeverria, Roberto
dc.date.accessioned2025-09-01T12:33:12Z
dc.date.available2025-09-01T12:33:12Z
dc.date.issued2025-08-28
dc.description.abstractThis paper presents MetrikaBox, a general-purpose, open-source, and extensible audio classification package designed to facilitate the development of Deep Learning (DL) models for a wide range of audio processing tasks. The software manages all necessary preprocessing steps to build classification models capable of distinguishing between user-defined classes using advanced Artificial Intelligence (AI) techniques. MetrikaBox is well suited for tasks such as musical genre classification, voice-versus-music discrimination, and other audio classification or segmentation applications. Users can either employ the package as provided or extend it by integrating their own datasets, classification models, data loading systems, augmentation techniques, and more. The package has been tested in both commercial and academic settings, where it has produced models for industrial audio processing and served as a platform for proof-of-concept applications. Comprehensive documentation and practical examples included in the repository support users in integrating the system into their audio analysis projects. MetrikaBox is openly available and provides a user interface for convenient testing.
dc.description.peerreviewedSI
dc.description.sponsorshipThis work was supported by Grant CPP2021-008491 funded by MICIU/AEI/10.13039/50100011033 and by the European Union NextGeneration EU/PRTR.
dc.identifier.citationSoftwareX Volume 31, September 2025, 102306
dc.identifier.doi10.1016/j.softx.2025.102306
dc.identifier.issn2352-7110
dc.identifier.urihttps://hdl.handle.net/10347/42739
dc.journal.titleSoftwareX
dc.language.isoeng
dc.publisherElsevier
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/CPP2021-008491/ES/MUSICGENIA: Una Plataforma en la Nube para de Generación de Música bajo Demanda por medio de Inteligencia Artificial/
dc.relation.publisherversionhttps://doi.org/10.1016/j.softx.2025.102306
dc.rights© 2025 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC license. Attribution-NonCommercial 4.0 International
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/
dc.subjectArtificial Intelligence
dc.subjectDeep Learning
dc.subjectAudio classification
dc.subjectNeural networks
dc.subjectDigital signal processing
dc.subjectSoftware engineering
dc.titleMetrikaBox: An open framework for experimenting with audio classification
dc.typejournal article
dc.type.hasVersionVoR
dc.volume.number31
dspace.entity.typePublication
relation.isAuthorOfPublication34f83200-7a0f-4455-a120-b9c6daf3bcd4
relation.isAuthorOfPublication.latestForDiscovery34f83200-7a0f-4455-a120-b9c6daf3bcd4

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1-s2.0-S2352711025002729-main.pdf
Size:
1.83 MB
Format:
Adobe Portable Document Format