Integrated genome sizing (IGS) approach for the parallelization of whole genome analysis, BMC Bioinf…
페이지 정보작성자 관리자 작성일2018-12-06 조회15,694회
Peter Sona, Jong Hui Hong, Sunho Lee, Byong Joon Kim, Woon-Young Hong, Jongcheol Jung, Han-Na Kim, Hyung-Lae Kim, David Christopher, Laurent Herviou, Young Hwan Im, Kwee-Yum Lee, Tae Soon Kim and Jongsun Jung
       Genome Data Integration Centre, Syntekabio Inc., 187 Techno 2-ro, B512, Yuseong-gu, Daejeon, 34025, South Korea.
  Gwanghwamun Medical Study Centre, Syntekabio Inc., 92 Saemunan-ro, #1708, Jongno-gu, Seoul, 03186, South Korea.
  Department of Biochemistry, School of Medicine, Ewha Woman's University, Seoul 07985, Republic of Korea
The use of whole genome sequence has increased recently with rapid progression of next-generation sequencing (NGS) technologies. However, storing raw sequence reads to perform large-scale genome analysis pose hardware challenges. Despite advancement in genome analytic platforms, efficient approaches remain relevant especially as applied to the human genome. In this study, an Integrated Genome Sizing (IGS) approach is adopted to speed up multiple whole genome analysis in high-performance computing (HPC) environment. The approach splits a genome (GRCh37) into 630 chunks (fragments) wherein multiple chunks can simultaneously be parallelized for sequence analyses across cohorts.