A docker-based replicability study of a neural information retrieval model
Contributo in Atti di convegno
Data di Pubblicazione:
2019
Abstract:
In thiswork,we propose a Docker image architecture for the replicability of Neural IR (NeuIR) models.We also share two self-contained Docker images to run the Neural Vector Space Model (NVSM) [22], an unsupervised NeuIR model. The first image we share (nvsm-cpu) can run on most machines and relies only on CPU to perform the required computations. The second image we share (nvsm-GPU) relies instead on the Graphics Processing Unit (GPU) of the host machine, when available, to perform computationally intensive tasks, such as the training of the NVSM model. Furthermore, we discuss some insights on the engineering challenges we encountered to obtain deterministic and consistent results from NeuIR models, relying on TensorFlow within Docker. We also provide an in-depth evaluation of the differences between the runs obtained with the shared images. The differences are due to the usage within Docker of TensorFlow and CUDA libraries - whose inherent randomness alter, under certain circumstances, the relative order of documents in rankings.
Tipologia CRIS:
04.01 - Contributo in atti di convegno
Keywords:
Docker; Neural information retrieval; Replicability; Reproducibility
Elenco autori:
Ferro, N.; Marchesin, S.; Purpura, A.; Silvello, G.
Link alla scheda completa:
Titolo del libro:
Proc. of the Open-Source IR Replicability Challenge (OSIRRC 2019)
Pubblicato in: