Salta al contenuto principale
Passa alla visualizzazione normale.

RAFFAELE GIANCARLO

MapReduce in Computational Biology Via Hadoop and Spark

  • Autori: Cattaneo, Giuseppe; Giancarlo, Raffaele; Petrillo, Umberto Ferraro; Roscigno, Gianluca
  • Anno di pubblicazione: 2017
  • Tipologia: Capitolo o Saggio (Capitolo o saggio)
  • Parole Chiave: Algorithm engineering; Bioinformatics; Distributed computing; Hadoop; MapReduce; Scalability; Spark
  • OA Link: http://hdl.handle.net/10447/291372

Abstract

Bioinformatics has a long history of software solutions developed on multi-core computing systems for solving computational intensive problems. This option suffer from some issues solvable by shifting to Distributed Systems. In particular, the MapReduce computing paradigm, and its implementations, Hadoop and Spark, is becoming increasingly popular in the Bioinformatics field because it allows for virtual-unlimited horizontal scalability while being easy-to-use. Here we provide a qualitative evaluation of some of the most significant MapReduce bioinformatics applications. We also focus on one of these applications to show the importance of correctly engineering an application to fully exploit the potential of Distributed Systems.