Go Back   CORTEX Forums > Best Practices > Subject Matter Expertise > Analytics > Analytic News Feeds
Register Blogs FAQ Members List Calendar Search Today's Posts Mark Forums Read

cn.MOPS: mixture of Poissons for discovering copy number variations in next-generatio

This is a discussion on cn.MOPS: mixture of Poissons for discovering copy number variations in next-generatio within the Analytic News Feeds forums, part of the Analytics category; cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate. Nucleic Acids Res. 2012 Feb 1; Authors: Klambauer G, Schwarzbauer K, ...


Reply
 
LinkBack Thread Tools Search this Thread Display Modes
Old 4th February 2012, 11:02 PM   #1
Administrator
 
Join Date: Oct 2007
Posts: 15,896
Blog Entries: 7
admin has disabled reputation
Default cn.MOPS: mixture of Poissons for discovering copy number variations in next-generatio

cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate.

Nucleic Acids Res. 2012 Feb 1;

Authors: Klambauer G, Schwarzbauer K, Mayr A, Clevert DA, Mitterecker A, Bodenhofer U, Hochreiter S

Abstract
Quantitative analyses of next-generation sequencing (NGS) data, such as the detection of copy number variations (CNVs), remain challenging. Current methods detect CNVs as changes in the depth of coverage along chromosomes. Technological or genomic variations in the depth of coverage thus lead to a high false discovery rate (FDR), even upon correction for GC content. In the context of association studies between CNVs and disease, a high FDR means many false CNVs, thereby decreasing the discovery power of the study after correction for multiple testing. We propose 'Copy Number estimation by a Mixture Of PoissonS' (cn.MOPS), a data processing pipeline for CNV detection in NGS data. In contrast to previous approaches, cn.MOPS incorporates modeling of depths of coverage across samples at each genomic position. Therefore, cn.MOPS is not affected by read count variations along chromosomes. Using a Bayesian approach, cn.MOPS decomposes variations in the depth of coverage across samples into integer copy numbers and noise by means of its mixture components and Poisson distributions, respectively. The noise estimate allows for reducing the FDR by filtering out detections having high noise that are likely to be false detections. We compared cn.MOPS with the five most popular methods for CNV detection in NGS data using four benchmark datasets: (i) simulated data, (ii) NGS data from a male HapMap individual with implanted CNVs from the X chromosome, (iii) data from HapMap individuals with known CNVs, (iv) high coverage data from the 1000 Genomes Project. cn.MOPS outperformed its five competitors in terms of precision (1-FDR) and recall for both gains and losses in all benchmark data sets. The software cn.MOPS is publicly available as an R package at http://www.bioinf.jku.at/software/cnmops/ and at Bioconductor.


PMID: 22302147 [PubMed - as supplied by publisher]



PubMed comprises more than 19 million citations for biomedical articles from MEDLINE and life science journals. This RSS feed searches for mentions of Bioconductor - the open source and open development software project for the analysis and comprehension of genomic data.
More from PubMed...
admin is offline  
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiTweet this Post!
Reply With Quote
Reply

Bookmarks

Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is On
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Finding recurrent copy number alterations preserving within-sample homogeneity. admin Analytic News Feeds 0 30th August 2011 10:07 PM
PREDA: an R-package to identify regional variations in genomic data. admin Analytic News Feeds 0 12th July 2011 09:47 PM
Bendigo & Adelaide mops up Rural Bank Latest News Headlines Latest News 0 27th October 2010 08:46 AM
A multilevel model to address batch effects in copy number estimation using SNP array admin Analytic News Feeds 0 15th July 2010 03:05 AM
Merging mixture components for cell population identification in flow cytometry. admin Analytic News Feeds 0 6th January 2010 10:01 PM


All times are GMT +11. The time now is 11:48 AM.

© The Business Intelligence Group

Search Engine Optimization by vBSEO