MARAGAP: A modular approach to reference assisted genome assembly pipeline

Bilal Wajid*, Erchin Serpedin, Mohamed Nounou, Hazem Nounou

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

This paper presents MARAGAP, a modular approach to reference assisted genome assembly pipeline. MARAGAP uses the principle of Minimum Description Length to determine the optimal reference sequence for the assembly. The optimal reference sequence is used as a template to infer inversions, insertions, deletions and SNPs in the target genome. MARAGAP uses an algorithmic approach to detect and correct inversions and deletions, a De-Bruijn graph based approach to infer the insertions, an affine-match affine-gap local alignment tool to estimate the locations of insertions and a Bayesian estimation framework for detecting SNPs.

Original languageEnglish
Pages (from-to)226-250
Number of pages25
JournalInternational Journal of Computational Biology and Drug Design
Volume8
Issue number3
DOIs
Publication statusPublished - 2015
Externally publishedYes

Keywords

  • Bayesian statistics
  • De-Bruijn graph
  • Genome assembly
  • Graph theory
  • Local alignment
  • Minimum description length principle
  • Mutations
  • Next generation sequencing
  • Reference assisted assembly
  • SNPs
  • Single nucleotide polymorphisms

Fingerprint

Dive into the research topics of 'MARAGAP: A modular approach to reference assisted genome assembly pipeline'. Together they form a unique fingerprint.

Cite this