{rfName}
Mu

Altmetrics

Analysis of institutional authors

Navas-Loro MAuthorGarijo DAuthorCorcho OAuthor

Share

January 25, 2023
Publications
>
Article
No

Multi-label Text Classification for Public Procurement in Spanish [Clasificación multi-etiqueta de textos de licitaciones públicas en español]

Publicated to: Procesamiento de Lenguaje Natural. 69 (69): 73-82 - 2022-09-01 69(69), DOI: 10.26342/2022-69-6

Authors:

Navas-Loro, Maria; Garijo, Daniel; Corcho, Oscar
[+]

Affiliations

Ontology Engineering Group, AI.nnovation Space, Universidad Politécnica de Madrid, Spain - Author
Univ Politecn Madrid, Ontol Engn Grp, AInnovat Space, Madrid, Spain - Author

Abstract

Public procurement accounts for a 14% of the annual budget of the different governments of the European Union. In Europe, contracting processes are classified using Common Procurement Vocabulary codes (CPVs), a taxonomy designed to facilitate statistical reporting, search and the creation of alerts that can be used by potential bidders. CPVs are commonly assigned manually by public employees in charge of contracting processes. However, CPV classification is not a trivial task, as there are more than 9,000 different CPV categories, which are often assigned following heterogeneous criteria. In this paper we have created a CPV classifier that uses as an input the textual description of the contracting process, and assigns CPVs from the 45 top-level CPV categories. We work only with texts in Spanish, although our approach may be easily extended to other languages. Our results improve the state of the art (10% F1-score improvement) and are available online. © 2022 Sociedad Española para el Procesamiento del Lenguaje Natural.
[+]

Keywords

hierarchical classificationmulti-label classificationpublic procurementCpvHierarchical classificationMulti-label classificationPublic procurement

Quality index

Bibliometric impact. Analysis of the contribution and dissemination channel

The work has been published in the journal Procesamiento de Lenguaje Natural due to its progression and the good impact it has achieved in recent years, according to the agency Scopus (SJR), it has become a reference in its field. In the year of publication of the work, 2022, it was in position , thus managing to position itself as a Q1 (Primer Cuartil), in the category Linguistics and Language.

Independientemente del impacto esperado determinado por el canal de difusión, es importante destacar el impacto real observado de la propia aportación.

Según las diferentes agencias de indexación, el número de citas acumuladas por esta publicación hasta la fecha 2026-04-09:

  • Google Scholar: 7
  • Scopus: 5
[+]

Impact and social visibility

From the perspective of influence or social adoption, and based on metrics associated with mentions and interactions provided by agencies specializing in calculating the so-called "Alternative or Social Metrics," we can highlight as of 2026-04-09:

  • The use of this contribution in bookmarks, code forks, additions to favorite lists for recurrent reading, as well as general views, indicates that someone is using the publication as a basis for their current work. This may be a notable indicator of future more formal and academic citations. This claim is supported by the result of the "Capture" indicator, which yields a total of: 7 (PlumX).

It is essential to present evidence supporting full alignment with institutional principles and guidelines on Open Science and the Conservation and Dissemination of Intellectual Heritage. A clear example of this is:

  • Assignment of a Handle/URN as an identifier within the deposit in the Institutional Repository: https://oa.upm.es/93618/

As a result of the publication of the work in the institutional repository, statistical usage data has been obtained that reflects its impact. In terms of dissemination, we can state that, as of

  • Views: 27
  • Downloads: 29
[+]

Leadership analysis of institutional authors

There is a significant leadership presence as some of the institution’s authors appear as the first or last signer, detailed as follows: First Author (NAVAS LORO, MARIA) and Last Author (CORCHO GARCIA, OSCAR).

[+]

Project objectives

La aportación persigue los siguientes objetivos: analizar la problemática de la clasificación manual de los códigos CPV en los procesos de contratación pública, caracterizar la complejidad derivada de la existencia de más de 9,000 categorías CPV y sus criterios heterogéneos de asignación, desarrollar un clasificador automático que utilice descripciones textuales en español para asignar códigos CPV en 45 categorías principales, evaluar el rendimiento del clasificador mediante métricas como la mejora del 10% en la puntuación F1 respecto al estado del arte, y facilitar la extensión del método a otros idiomas para mejorar la eficiencia y precisión en la clasificación de licitaciones públicas.
[+]

Most relevant results

El estudio presenta avances significativos en la clasificación multi-etiqueta de textos de licitaciones públicas en español. Los resultados más relevantes son: se desarrolló un clasificador de CPV que asigna categorías a partir de descripciones textuales, trabajando con los 45 códigos CPV de primer nivel; se enfocó exclusivamente en textos en español, con posibilidad de extensión a otros idiomas; se logró una mejora del 10% en la puntuación F1 respecto al estado del arte; y el sistema está disponible en línea para su uso y evaluación. Estos hallazgos contribuyen a optimizar la clasificación automática en procesos de contratación pública.
[+]