Resource2Vec: Linked Data distributed representations for term discovery in automatic speech recognition

Indexado en

Licencia y uso

Citaciones

Cited 6 times in Scopus logo

Cited 6 times in Web of Science logo

Cited 8 times in Google Scholar logo

Altmetrics

Grant support

The work leading to these results has been developed as part of ASLP-MULAN (TIN2014-54288-C4-1-R) project, funded by Ministerio de Economia, Industria y Competitividad. We would also like to acknowledge all the other members of the Speech Technology Group at Universidad Politecnica de Madrid for the continuous and fruitful discussion on these topics.

Análisis de autorías institucional

Coucheiro-Limeres, AlejandroAutor (correspondencia)Ferreiros-Lopez, JavierAutor o CoautorSan-Segundo, RubenAutor o CoautorCordoba, RicardoAutor o Coautor

9 de junio de 2019

Publicaciones

Artículo

Resource2Vec: Linked Data distributed representations for term discovery in automatic speech recognition

Publicado en:Expert Systems With Applications. 112 301-320 - 2018-12-01 112(), DOI: 10.1016/j.eswa.2018.06.039

Autores: Coucheiro-Limeres, A; Ferreiros-López, J; San-Segundo, R; Córdoba, R

Afiliaciones

Univ Politecn Madrid, ETSI Telecomunicac, Speech Technol Grp, Av Complutense 30, Madrid, Spain - Autor o Coautor

Resumen

In this work we present a neural network embedding we call Resource2Vec, which is able to represent the resources that make up some Linked Data (LD) corpora. A vector representation of these resources allows more advantageous processing (in computational terms) as is the case with known word or document embeddings. We give a quantitative analysis for their study. Furthermore, we employ them in an Automatic Speech Recognition (ASR) task to demonstrate their functionality by designing a strategy for term discovery. This strategy permits out-of-vocabulary (OOV) terms in a Large Vocabulary Continuous Speech Recognition (LVCSR) system to be discovered and then put into the final transcription. First, we detect where a potential OOV term may have been uttered in the LVCSR output speech segments. Second, we carry out a candidate OOV search in some LD corpora. This search is oriented by distance measurements between the transcription context around the potential-OOV speech segment and the resources of the LD corpora in Resource2Vec format, obtaining a set of candidates. To rank them, we mainly depend on the phone transcription of that segment. Finally, we decide whether or not to incorporate a candidate into the final transcription. The results show we are able to improve the transcription in Word Error Rate (WER) terms significantly, after our strategy is used on speech in Spanish. (C) 2018 Elsevier Ltd. All rights reserved.

Palabras clave

Automatic speech recognitionLinked dataOut-of-vocabulary wordsResource2vec embeddingTerm discovery

Indicios de calidad

Impacto bibliométrico. Análisis de la aportación y canal de difusión

El trabajo ha sido publicado en la revista Expert Systems With Applications debido a la progresión y el buen impacto que ha alcanzado en los últimos años, según la agencia WoS (JCR), se ha convertido en una referencia en su campo. En el año de publicación del trabajo, 2018, se encontraba en la posición 7/84, consiguiendo con ello situarse como revista Q1 (Primer Cuartil), en la categoría Operations Research & Management Science. Destacable, igualmente, el hecho de que la Revista está posicionada por encima del Percentil 90.

Desde una perspectiva relativa, y atendiendo al indicador del impacto normalizado calculado a partir del Field Citation Ratio (FCR) de la fuente Dimensions, arroja un valor de: 1.14, lo que indica que, de manera comparada con trabajos en la misma disciplina y en el mismo año de publicación, lo ubica como trabajo citado por encima de la media. (fuente consultada: Dimensions Aug 2025)

De manera concreta y atendiendo a las diferentes agencias de indexación, el trabajo ha acumulado, hasta la fecha 2025-08-13, el siguiente número de citas:

WoS: 6
Scopus: 6
Google Scholar: 8

Impacto y visibilidad social

Análisis de liderazgo de los autores institucionales

Existe un liderazgo significativo ya que algunos de los autores pertenecientes a la institución aparecen como primer o último firmante, se puede apreciar en el detalle: Primer Autor (COUCHEIRO LIMERES, ALEJANDRO) y Último Autor (CORDOBA HERRALDE, RICARDO DE).

el autor responsable de establecer las labores de correspondencia ha sido COUCHEIRO LIMERES, ALEJANDRO.

Indexado en

Licencia y uso

Citaciones

Altmetrics

Grant support

Análisis de autorías institucional

Compartir

Resource2Vec: Linked Data distributed representations for term discovery in automatic speech recognition

Afiliaciones

Resumen

Palabras clave

Indicios de calidad

Impacto bibliométrico. Análisis de la aportación y canal de difusión

Impacto y visibilidad social

Análisis de liderazgo de los autores institucionales