{rfName}
Ne

Altmetrics

Analysis of institutional authors

Camara, MateoCorresponding AuthorBlanco-Murillo, Jose LuisAuthorBlanco, Jose LuisAuthor

Share

January 21, 2026
Publications
>
Review
No

Neural Audio Synthesis for Sound Effects: A Scope Review

Publicated to: IEEE Transactions on Audio Speech and Language Processing. 34 427-445 - 2026-01-01 34(), DOI: 10.1109/TASLPRO.2025.3646080

Authors:

Camara, Mateo; Marcos, Fernando; Bargum, Anders R; Erkut, Cumhur; Reiss, Joshua; Blanco, Jose Luis
[+]

Affiliations

Aalborg Univ, Multisensory Experience Lab, DK-2450 Copenhagen, Denmark - Author
Queen Mary Univ London, Ctr Digital Mus, London E1 4NS, England - Author
Univ Politec Madrid, Informat Proc & Telecommun Ctr, Madrid 28040, Spain - Author
Univ Politecn Madrid, Signal Proc Applicat Grp, Madrid 28040, Spain - Author
See more

Abstract

Neural Audio Synthesis is dedicated to generating sound through generative neural networks. Sound effects are defined as auditory elements that complement a specific scene (in cinema, fiction, or videogames), support a storyline, enhance a fictional environment, or improve the perceived plausibility and presence (including Virtual Reality) without being music or dialog. This manuscript presents a quantitative literature review of the literature that intersects these two domains: the neural generation of sound effects. By leveraging large language models, we performed an extensive and systematic survey of the major scientific repositories, filtering the most relevant articles to ensure a thorough analysis. Our study examines various generation paradigms employed in sound synthesis, the specific types of sound effects created, the datasets used, and the evaluation metrics considered. Furthermore, we provide a forward-looking discussion on the evolution of this field towards multimodal approaches, where sound generation might integrate with other sensory modalities. All supporting materials and code are available online.
[+]

Keywords

Acoustic generatorsAudio acousticsAudio signalAudio signal processingAudio synthesisDeep learningFoley effectFoley effectsGenerationGenerative synthesisInteractive computer graphicsMeasurementMediaModelNeural audio synthesisNeural networksNeural-networksPipelinesReal-time systemsReviewsSfxSignal-processingSound effectsSpeech processingTaxonomyTrainingVideo-gamesVirtual reality

Quality index

Impact and social visibility

From the perspective of influence or social adoption, and based on metrics associated with mentions and interactions provided by agencies specializing in calculating the so-called "Alternative or Social Metrics," we can highlight as of 2026-04-05:

  • The use of this contribution in bookmarks, code forks, additions to favorite lists for recurrent reading, as well as general views, indicates that someone is using the publication as a basis for their current work. This may be a notable indicator of future more formal and academic citations. This claim is supported by the result of the "Capture" indicator, which yields a total of: 4 (PlumX).
[+]

Leadership analysis of institutional authors

This work has been carried out with international collaboration, specifically with researchers from: Denmark; United Kingdom.

There is a significant leadership presence as some of the institution’s authors appear as the first or last signer, detailed as follows: First Author (CAMARA LARGO, MATEO JOSE) and Last Author (YAGÜE BLANCO, JOSE LUIS).

the author responsible for correspondence tasks has been CAMARA LARGO, MATEO JOSE.

[+]

Awards linked to the item

This work was supported in part by the European Union's Horizon 2020 Research and Innovation Programme under Grant 101003750 and in part by the Ministry of Economy and Competitiveness of Spain under Grant PID2021-128469OB-I00
[+]