Benchmarking Automatic Tools for Neologisms Extraction: Issues and Challenges
Contributo in Atti di convegno
Data di Pubblicazione:
2025
Abstract:
Human language is constantly evolving, driven by societal, technological, and cultural shifts, which lead to the creation of new terms and expressions. The rise of digital platforms, including social media and academic publications, has accelerated the introduction and spread of these neologisms. This paper explores current advancements and challenges in benchmarking automated and semi-automated tools for extracting neologisms. In particular, we will discuss challenges in dataset creation and evaluation procedures, such as defining neologisms, ensuring diverse text sources, managing annotation variability, and evaluating these tools.
Tipologia CRIS:
04.01 - Contributo in atti di convegno
Keywords:
dataset creation; evaluation methodology; neologisms extraction
Elenco autori:
Di Nunzio, G. M.
Link alla scheda completa:
Titolo del libro:
CEUR Workshop Proceedings
Pubblicato in: