Hadoop - PruneDIRECT
We are developing an efficient multiple Quantitative Loci Traits (QTL) scans algorithm for experimental populations called PruneDIRECT. In a case study, for detecting two and three QTL using permutation testing, PruneDIRECT needs 11012 thousand and 201514 thousand function evaluations on average. Our main goal is to do permutation testing with PruneDIRECT for more than three QTL.
Previously lots of work has been done on algorithm optimization and efficient serial/parallel implementations. Now the aim is to explore Map-Reduce programming model for QTL application. Map-Reduce is well known for managing several independent tasks of an application. The current implementation of PruneDIRECT is well suited for Map-Reduce programming model. According to the preliminary studies, we are hopeful that Map-Reduce model will be helpful in calculating more than three QTL. The R statistical software is widely used by the biologist community. In order to make a familiar environment settings for the biologists, we have chosen the R-Hadoop framework that allows the transparent use of Hadoop within the R programming environment.
- Fast and accurate detection of multiple quantitative trait loci. In Journal of Computational Biology, volume 20, pp 687-702, 2013. (DOI).
- Using peak shape to improve efficient and effective detection of multiple QTL in known crossings; Carl Nettelblad, Behrang Mahjani*, Sverker Holmgren; Poster presented at 4th International Conference of Quantitative Genetics 2012
- Fast and Accurate Detection of Multiple QTL in Known Crossings; 4th Swedish Meeting on Mathematics in Biology 2012; Behrang Mahjani
- Detection of Multiple QTL in Known Crossings; Behrang Mahjani;ReiDok13 Symposium on Computational PhD Projects 2013