Skip to Main Content (Press Enter)

Logo UNIPD
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Competenze

UNI-FIND
Logo UNIPD

|

UNI-FIND

unipd.it
  • ×
  • Home
  • Persone
  • Pubblicazioni
  • Strutture
  • Terza Missione
  • Competenze
  1. Strutture

Exploiting Curated Databases to Train Relation Extraction Models for Gene-Disease Associations

Contributo in Atti di convegno
Data di Pubblicazione:
2022
Abstract:
Databases are pivotal to advancing biomedical science. Nevertheless, most of them are populated and
updated by human experts with a great deal of effort. Biomedical Relation Extraction (BioRE) aims to
shift these expensive and time-consuming processes to machines. Among its different applications, the
discovery of Gene-Disease Associations (GDAs) is one of the most pressing challenges. Despite this,
few resources have been devoted to training – and evaluating – models for GDA extraction. Besides,
such resources are limited in size, preventing models from scaling effectively to large amounts of data.
To overcome this limitation, we have exploited the DisGeNET database to build a large-scale, semi-
automatically annotated dataset for GDA extraction: TBGA. TBGA is generated from more than 700K
publications and consists of over 200K instances and 100K gene-disease pairs. We have evaluated state-
of-the-art models for GDA extraction on TBGA, showing that it is a challenging dataset for the task. The
dataset and models are publicly available to foster the development of state-of-the-art BioRE models for
GDA extraction.
Tipologia CRIS:
04.01 - Contributo in atti di convegno
Elenco autori:
Marchesin, S.; Silvello, G.
Autori di Ateneo:
MARCHESIN STEFANO
SILVELLO GIANMARIA
Link alla scheda completa:
https://www.research.unipd.it/handle/11577/3508118
Link al Full Text:
https://www.research.unipd.it//retrieve/handle/11577/3508118/786083/paper16.pdf
Titolo del libro:
Proceedings of the 30th Italian Symposium on Advanced Database Systems (SEBD 2022), June 19–22, 2022, Pisa, Italy
Pubblicato in:
CEUR WORKSHOP PROCEEDINGS
Journal
CEUR WORKSHOP PROCEEDINGS
Series
  • Utilizzo dei cookie

Realizzato con VIVO | Designed by Cineca | 26.5.1.0