A Bayesian Segmentation Approach to Ascertain Copy Number Variations at the Population Level
Document Type
Article
Publication Date
6-29-2009
Description
Motivation: Efficient and accurate ascertainment of copy number variations (CNVs) at the population level is essential to understand the evolutionary process and population genetics, and to apply CNVs in population-based genome-wide association studies for complex human diseases. We propose a novel Bayesian segmentation approach to identify CNVs in a defined population of any size. It is computationally efficient and provides statistical evidence for the detected CNVs through the Bayes factor. This approach has the unique feature of carrying out segmentation and assigning copy number status simultaneously - a desirable property that current segmentation methods do not share. Results: In comparisons with popular two-step segmentation methods for a single individual using benchmark simulation studies, we find the new approach to perform competitively with respect to false discovery rate and sensitivity in breakpoint detection. In a simulation study of multiple samples with recurrent copy numbers, the new approach outperforms two leading single sample methods. We further demonstrate the effectiveness of our approach in population-level analysis of previously published HapMap data. We also apply our approach in studying population genetics of CNVs.
Citation Information
Wu, Long Y.; Chipman, Hugh A.; Bull, Shelley B.; Briollais, Laurent; and Wang, Kesheng. 2009. A Bayesian Segmentation Approach to Ascertain Copy Number Variations at the Population Level. Bioinformatics. Vol.25(13). 1669-1679. https://doi.org/10.1093/bioinformatics/btp270 PMID: 19389735 ISSN: 1367-4803