Title: | Calling Copy Number Alterations in Amplicon Sequencing Data |
---|---|
Description: | Method for the calculation of copy numbers and calling of copy number alterations. The algorithm uses coverage data from amplicon sequencing of a sample cohort as input. The method includes significance assessment, correction for multiple testing and does not depend on normal DNA controls. Budczies (2016 Mar 15) <doi:10.18632/oncotarget.7451>. |
Authors: | Jan Budczies, Eva Romanovsky |
Maintainer: | Jan Budczies <[email protected]> |
License: | GPL-3 |
Version: | 2.2.2 |
Built: | 2025-03-02 03:33:35 UTC |
Source: | https://github.com/cran/ioncopy |
Method for the calculation of copy numbers and calling of copy number alterations. The algorithm uses coverage data from amplicon sequencing of a sample cohort as input. The method includes significance assessment, correction for multiple testing and does not depend on normal DNA controls.
Package: | ioncopy |
Type: | Package |
Version: | 2.2.2 |
Date: | 2020-08-07 |
License: | GPL-3 |
Maintainer: Jan Budczies <[email protected]>
Eva Romanovsky <[email protected]>
Jan Budczies, Nicole Pfarr, Albrecht Stenzinger, Denise Treue, Volker Endris, Fakher Ismaeel, Nikola Bangemann, Jens-Uwe Blohmer, Manfred Dietel, Sibylle Loibl, Wilko Weichert, Carsten Denkert: Ioncopy: a novel method for calling copy number alterations in amplicon sequencing data including significance assessment. Oncotarget 7(11):13236-47, 2016, doi: 10.18632/oncotarget.7451.
Jan Budczies, Nicole Pfarr, Eva Romanovsky, Volker Endris, Albrecht Stenzinger, and Carsten Denkert: Ioncopy: an R Shiny app to call copy number alterations in targeted NGS data. Submitted.
Starting from a matrix of coverages, the corresponding matrix of copy numbers is calculated. A null model for significance assessment of copy number alterations is fitted. Each amplicon in each sample is assessed for significance. Summarized copy numbers and p-values for genes are calculated as descibed below.
assess.CNA(coverage.target, coverage.source=NULL, method.pooled="amplicon", thres.cov=100)
assess.CNA(coverage.target, coverage.source=NULL, method.pooled="amplicon", thres.cov=100)
coverage.target |
A numeric matrix containing the target coverages of each amplicon (rows) in each sample (columns). The target data are investigated for copy number alterations. |
coverage.source |
A numeric matrix containing the source coverages of each amplicon (rows) in each sample (columns).
The source data are used to fit a null model. If |
method.pooled |
Method used for the estimation of the null model. Either one common null model for all amplicons
( |
thres.cov |
Theshold for the minimal mean coverage of an amplicon to be included in the analysis. |
List containing the following elements: Matrix of copy numbers with the estimated null model ("model"
), estimates of copy numbers ("CN.a"
and "CN.g"
) for amplicons and genes as well as p-values of copy number alterations ("P.a"
and "P.g"
) for amplicons and genes. Copy numbers for genes are calculated as average of the copy numbers of all amplicons interrogating the gene, p-values for genes are calculated using Fisher's method.
## Not run: data(coverage) CNA <- assess.CNA(coverage) ## End(Not run)
## Not run: data(coverage) CNA <- assess.CNA(coverage) ## End(Not run)
Starting from a matrix of coverages, the corresponding matrix of copy numbers is calculated. The caluculation includes a sample normalization step and a amplicon normalization step.
calculate.CN(coverage, scale.amplicon=NULL)
calculate.CN(coverage, scale.amplicon=NULL)
coverage |
A numeric matrix containing the coverages of each amplicon (rows) in each sample (columns). |
scale.amplicon |
Method for amplicon normalization. If |
Matrix of copy numbers with the amplicon scaling factors ("scale.amplicon"
) as attribute.
## Not run: data(coverage) CN <- calculate.CN(coverage) ## End(Not run)
## Not run: data(coverage) CN <- calculate.CN(coverage) ## End(Not run)
Copy number alteration calling after (possible) multiple testing correction for samples and/or amplicons/genes. P-values are controlled either for the number of samples, the number of amplicons/genes or both. Different methods of multiple testing can be chosen to control either FWER or FDR.
call.CNA(CNA, analysis.mode="gene-wise", method.p="samples_genes/amplicons", method.mt="bonferroni", thres.p=0.05, sig.call=0, sig.per=0)
call.CNA(CNA, analysis.mode="gene-wise", method.p="samples_genes/amplicons", method.mt="bonferroni", thres.p=0.05, sig.call=0, sig.per=0)
CNA |
List of CNA assessments generated by |
analysis.mode |
The Mode of the analysis: ( |
method.p |
The multiple testing method used for detection:
Usage of uncorrected p-values ( |
method.mt |
Method for multiple testing correction: must be equal to |
thres.p |
Significance level for calling of copy number alterations. |
sig.call |
An integer >= 0. Only used if |
sig.per |
An integer >= 0. Only used if |
Table containing the status (GAIN, LOSS or NORMAL) of each amplicon or gene in each sample ("tab"
). Matrix of copy number estimates in each sample and each amplicon or gene ("CN"
). Indicator matrix of detected gains in each sample and each amplicon or gene ("gain"
) and the same for losses ("loss"
).
## Not run: data(coverage) CN <- calculate.CN(coverage) CNA <- assess(CN) calls <- call.CNA(CNA) ## End(Not run)
## Not run: data(coverage) CN <- calculate.CN(coverage) CNA <- assess(CN) calls <- call.CNA(CNA) ## End(Not run)
A matrix of sequencing coverages from semiconductor sequencing
data(coverage)
data(coverage)
Matrix containing the sequencing coverages of 154 samples (breast carcinomas) and 184 variables (amplicons).
data(coverage)
data(coverage)
Heatmap visualization including optional hierarchical clustering of amplicons/genes and samples.
heatmap.CNA(CNA, thres.percent=1, cluster.genes=TRUE, cluster.samples=TRUE, type="CNA calls", method.dist="manhattan", method.link="average", mar=3, cex=0.50)
heatmap.CNA(CNA, thres.percent=1, cluster.genes=TRUE, cluster.samples=TRUE, type="CNA calls", method.dist="manhattan", method.link="average", mar=3, cex=0.50)
CNA |
Indicator matrix of CNAs generated by |
thres.percent |
Number between 0 and 100. Only genes with a minimum percentage of amplified and/or deleted samples are included into the heatmap. |
cluster.genes |
Logical value. If |
cluster.samples |
Logical value. If |
type |
Heatmap of copy numbers |
method.dist |
Character. Method for calculation of the distance between genes/amplicons and between samples. |
method.link |
Character. Linkage method to calculate the distance between clusters. |
mar |
Numeric value. Margins for row names and column names. |
cex |
Numerical value. Shrinkage factor for row names and column names. |
Depending on "type"
heatmap of CNAs (color coding: LOSS=green, NORMAL=black, GAIN=red) or
heatmap of copy numbers (color coding CN<1: green, CN<2: darkgreen, CN>3: darkred, CN>4: red, CN>5: orange, CN>10: yellow).
Coverages are read from tab separated files and stored in a coverage matrix. All coverage files need to refer to the same panel of targets.
read.coverages(chip.names, file.names, anno.col="Target")
read.coverages(chip.names, file.names, anno.col="Target")
chip.names |
A character vector comprizing the chip names. |
file.names |
A character vector compizing the names of tab separated files. Each file should contain the coverages of the same sequencing panel (rows) in a cohort of samples (columns). |
anno.col |
The column in input files that compizes the names of the targets. The targets must belong to the same sequencing panel for all input files. |
Matrix of coverages of each target sequence (rows) in each sample (colums).
Running Ioncopy as web application.
runIoncopy()
runIoncopy()
CNA calls are summarized to tables of all amplifications in each sample and to all amplifications in each amplicon/gene.
summarize.CNA(calls)
summarize.CNA(calls)
calls |
List of CNA calls generated by |
List of summary matrices for samples ("samples"
) and amplicons ("amplicon"
) or genes "gene"
.