dc.contributor.author | Tejedor Noguerales, Javier | |
dc.contributor.author | Toledano, Doroteo T. | |
dc.contributor.author | López Otero, Paula | |
dc.contributor.author | Docío Fernández, Laura | |
dc.contributor.author | Peñagarikano, Mikel | |
dc.contributor.author | Rodríguez Fuentes, Luis Javier | |
dc.contributor.author | Moreno Sandoval, Antonio | |
dc.date.accessioned | 2022-03-10T11:56:55Z | |
dc.date.available | 2022-03-10T11:56:55Z | |
dc.date.issued | 2019-07-19 | |
dc.identifier.citation | EURASIP Journal on Audio Speech and Music Processing, 2019, 13 (2019) | spa |
dc.identifier.issn | 16874722 | |
dc.identifier.uri | http://hdl.handle.net/11093/3227 | |
dc.description.abstract | The huge amount of information stored in audio and video repositories makes search on speech (SoS) a priority areanowadays. Within SoS, Query-by-Example Spoken Term Detection (QbE STD) aims to retrieve data from a speechrepository given a spoken query. Research on this area is continuously fostered with the organization of QbE STDevaluations. This paper presents a multi-domain internationally open evaluation for QbE STD in Spanish. Theevaluation aims at retrieving the speech files that contain the queries, providing their start and end times, and a scorethat reflects the confidence given to the detection. Three different Spanish speech databases that encompassdifferent domains have been employed in the evaluation: MAVIR database, which comprises a set of talks fromworkshops; RTVE database, which includes broadcast television (TV) shows; and COREMAH database, which contains2-people spontaneous speech conversations about different topics. The evaluation has been designed carefully sothat several analyses of the main results can be carried out. We present the evaluation itself, the three databases, theevaluation metrics, the systems submitted to the evaluation, the results, and the detailed post-evaluation analysesbased on some query properties (within-vocabulary/out-of-vocabulary queries, single-word/multi-word queries, andnative/foreign queries). Fusion results of the primary systems submitted to the evaluation are also presented. Threedifferent teams took part in the evaluation, and ten different systems were submitted. The results suggest that theQbE STD task is still in progress, and the performance of these systems is highly sensitive to changes in the datadomain. Nevertheless, QbE STD strategies are able to outperform text-based STD in unseen data domains. | en |
dc.description.sponsorship | Xunta de Galicia | Ref. ED431G/01 | spa |
dc.description.sponsorship | Xunta de Galicia | Ref. ED431G/04 | spa |
dc.description.sponsorship | Ministerio de Economía y Competitividad | Ref. TEC2015-68172-C2-1-P | spa |
dc.description.sponsorship | Agencia Estatal de Investigación | Ref. RTI2018-098091-B-I00 | spa |
dc.language.iso | eng | en |
dc.publisher | EURASIP Journal on Audio Speech and Music Processing | spa |
dc.relation | info:eu-repo/grantAgreement/MINECO//TEC2015-68172-C2-1-P/ES/REDES PROFUNDAS Y MODELOS DE SUBESPACIOS PARA DETECCION Y SEGUIMIENTO DE LOCUTOR, IDIOMA Y ENFERMEDADES DEGENERATIVAS A PARTIR DE LA VOZ | |
dc.relation | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/RTI2018-098091-B-I00/ES/APRENDIZAJE PROFUNDO EN VOZ PARA APLICACIONES FORENSES Y DE SEGURIDAD | |
dc.rights | Attribution 4.0 International | |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.title | Search on speech from spoken queries: the multi-domain International ALBAYZIN 2018 query-by-example spoken term detection evaluation | en |
dc.type | article | spa |
dc.rights.accessRights | openAccess | spa |
dc.identifier.doi | 10.1186/s13636-019-0156-x | |
dc.identifier.editor | https://asmp-eurasipjournals.springeropen.com/articles/10.1186/s13636-019-0156-x | spa |
dc.publisher.departamento | Teoría do sinal e comunicacións | spa |
dc.publisher.grupoinvestigacion | Grupo de Tecnoloxías Multimedia | spa |
dc.subject.unesco | 1203.04 Inteligencia Artificial | spa |
dc.subject.unesco | 2405 Biometría | |
dc.subject.unesco | 5701.09 Traducción Automática | |
dc.date.updated | 2022-03-09T12:55:48Z | |
dc.computerCitation | pub_title=EURASIP Journal on Audio Speech and Music Processing|volume=2019|journal_number=|start_pag=13|end_pag= | spa |