CADD is a tool for scoring the deleteriousness of single nucleotide variants as well as insertion/deletions variants in the human genome.
While many variant annotation and scoring tools are around, most annotations tend to exploit a single information type (e.g. conservation) and/or are restricted in scope (e.g. to missense changes). Thus, a broadly applicable metric that objectively weights and integrates diverse information is needed. Combined Annotation Dependent Depletion (CADD) is a framework that integrates multiple annotations into one metric by contrasting variants that survived natural selection with simulated mutations.
C-scores strongly correlate with allelic diversity, pathogenicity of both coding and non-coding variants, and experimentally measured regulatory effects, and also highly rank causal variants within individual genome sequences. Finally, C-scores of complex trait-associated variants from genome-wide association studies (GWAS) are significantly higher than matched controls and correlate with study sample size, likely reflecting the increased accuracy of larger GWAS.
CADD can quantitatively prioritize functional, deleterious, and disease causal variants across a wide range of functional categories, effect sizes and genetic architectures and can be used prioritize causal variation in both research and clinical settings.
For more information:
- Rentzsch P, Witten D, Cooper GM, Shendure J, Kircher M. (29 October 2018), CADD: predicting the deleteriousness of variants throughout the human genome, Nucleic Acids Research, 47, D886–D894
- Kircher M, Witten DM, Jain P, O'Roak BJ, Cooper GM, Shendure J (2014 Feb 2), A general framework for estimating the relative pathogenicity of human genetic variants, Nature Genetics, 46, pages 310–315
swap_vertical_circlecloud_downloadDownloads (0)Files marked with an asterix (*) can only be downloaded by users that have the appropriate product license. The license must be active and you must be logged into your account.