Skip to Main Content (Press Enter)

Logo UNIPD
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Competenze

UNI-FIND
Logo UNIPD

|

UNI-FIND

unipd.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Competenze
  1. Pubblicazioni

FONTI 4.0: Evaluating speech-to-text automatic transcription of digitized historical oral sources

Contributo in Atti di convegno
Data di Pubblicazione:
2021
Abstract:
Conducting “manual” transcriptions and analyses is unsustainable for most historical oral archives because they require a remarkable amount of funds and time. The FONTI 4.0 project aims at exploring the suitability of automatic transcription and information extraction technologies for making historical oral sources available. In this work, we conducted an experiment to test the performance of two commercial speech-to-text services (Google Cloud Speech-to-text and Amazon Transcribe) on digitized oral sources. We created an eight-hour corpus made of manually transcribed and annotated historical speech recordings in TEI format. The results clearly show how audio quality and disturbing elements (e.g., overlaps, foreign words, etc.) impact on the automatic transcription, showing what needs to be improved for implementing an unsupervised transcription chain.
Tipologia CRIS:
04.01 - Contributo in atti di convegno
Elenco autori:
Luzietti, R. B.; Pretto, N.; Kaplan, F.; Dufaux, A.; Canazza, S.
Autori di Ateneo:
CANAZZA TARGON SERGIO
Link alla scheda completa:
https://www.research.unipd.it/handle/11577/3418254
Link al Full Text:
https://www.research.unipd.it//retrieve/handle/11577/3418254/549413/2021_Luzietti_FONTI40_published.pdf
Titolo del libro:
CEUR Workshop Proceedings
Pubblicato in:
CEUR WORKSHOP PROCEEDINGS
Journal
CEUR WORKSHOP PROCEEDINGS
Series
  • Dati Generali

Dati Generali

URL

http://ceur-ws.org/Vol-3033/paper45.pdf; doi.org/10.5281/zenodo.5645827
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.1.0