Publications
>
Review

From vision to text: A comprehensive review of natural image captioning in medical and

Publicated to:Medical Image Analysis. 97 103264- - 2024-10-01 97(), DOI: 10.1016/j.media.2024.103264

Authors: Reale-Nosei, G; Amador-Domínguez, E; Serrano, E

Affiliations

Abstract

Natural Image Captioning (NIC) is an interdisciplinary research area that lies within the intersection of Computer Vision (CV) and Natural Language Processing (NLP). Several works have been presented on the subject, ranging from the early template-based approaches to the more recent deep learning-based methods. This paper conducts a survey in the area of NIC, especially focusing on its applications for Medical Image Captioning (MIC) and Diagnostic Captioning (DC) in the field of radiology. A review of the state-of-the-art is conducted summarizing key research works in NIC and DC to provide a wide overview on the subject. These works include existing NIC and MIC models, datasets, evaluation metrics, and previous reviews in the specialized literature. The revised work is thoroughly analyzed and discussed, highlighting the limitations of existing approaches and their potential implications in real clinical practice. Similarly, future potential research lines are outlined on the basis of the detected limitations.

Keywords

BenchmarkingClinical practiceComputer visionDeep learningDiagnosisDiagnostic captioningGenerationHumanImage captioningInterdisciplinary researchMedical image captioningMedical imagingMicrowave integrated circuitsModelsNatural image captioningNatural imagesNatural language processingNatural language processing systemsRadiologyRadiology report generationRadiology reportsReport generationReviewState-of-the art reviewsState-of-the-art reviewSurveyVision

Quality index

Bibliometric impact. Analysis of the contribution and dissemination channel

The work has been published in the journal Medical Image Analysis due to its progression and the good impact it has achieved in recent years, according to the agency WoS (JCR), it has become a reference in its field. In the year of publication of the work, 2024 there are still no calculated indicators, but in 2023, it was in position 6/123, thus managing to position itself as a Q1 (Primer Cuartil), in the category Computer Science, Artificial Intelligence. Notably, the journal is positioned above the 90th percentile.

Independientemente del impacto esperado determinado por el canal de difusión, es importante destacar el impacto real observado de la propia aportación.

Según las diferentes agencias de indexación, el número de citas acumuladas por esta publicación hasta la fecha 2025-05-31:

  • WoS: 4
  • Scopus: 8

Impact and social visibility

From the perspective of influence or social adoption, and based on metrics associated with mentions and interactions provided by agencies specializing in calculating the so-called "Alternative or Social Metrics," we can highlight as of 2025-05-31:

  • The use of this contribution in bookmarks, code forks, additions to favorite lists for recurrent reading, as well as general views, indicates that someone is using the publication as a basis for their current work. This may be a notable indicator of future more formal and academic citations. This claim is supported by the result of the "Capture" indicator, which yields a total of: 60 (PlumX).

It is essential to present evidence supporting full alignment with institutional principles and guidelines on Open Science and the Conservation and Dissemination of Intellectual Heritage. A clear example of this is:

  • Assignment of a Handle/URN as an identifier within the deposit in the Institutional Repository: https://oa.upm.es/84575/

As a result of the publication of the work in the institutional repository, statistical usage data has been obtained that reflects its impact. In terms of dissemination, we can state that, as of

  • Views: 67
  • Downloads: 80

Leadership analysis of institutional authors

There is a significant leadership presence as some of the institution’s authors appear as the first or last signer, detailed as follows: First Author (Reale-Nosei, G) and Last Author (SERRANO FERNANDEZ, EMILIO).

the author responsible for correspondence tasks has been AMADOR DOMINGUEZ, ELVIRA.