The Library
Discovering genetic associations with high-dimensional neuroimaging phenotypes: a sparse reduced-rank regression approach
Tools
Alzheimers Disease Neuroimaging Initiative (Including: Vounou, Maria, Nichols, Thomas E. and Montana, Giovanni). (2010) Discovering genetic associations with high-dimensional neuroimaging phenotypes: a sparse reduced-rank regression approach. NeuroImage, Vol.53 (No.3). pp. 1147-1159. ISSN 1053-8119
Full text not available from this repository.
Official URL: http://dx.doi.org/10.1016/j.neuroimage.2010.07.002
Abstract
There is growing interest in performing genome-wide searches for associations between genetic variants and brain imaging phenotypes. While much work has focused on single scalar valued summaries of brain phenotype, accounting for the richness of imaging data requires a brain-wide, genome-wide search. In particular, the standard approach based on mass-univariate linear modelling (MULM) does not account for the structured patterns of correlations present in each domain. In this work, we propose sparse reduced rank regression (sRRR), a strategy for multivariate modelling of high-dimensional imaging responses (measurements taken over regions of interest or individual voxels) and genetic covariates (single nucleotide polymorphisms or copy number variations), which enforces sparsity in the regression coefficients. Such sparsity constraints ensure that the model performs simultaneous genotype and phenotype selection. Using simulation procedures that accurately reflect realistic human genetic variation and imaging correlations, we present detailed evaluations of the sRRR method in comparison with the more traditional MULM approach. In all settings considered, sRRR has better power to detect deleterious genetic variants compared to MULM. Important issues concerning model selection and connections to existing latent variable models are also discussed. This work shows that sRRR offers a promising alternative for detecting brain-wide, genome-wide associations. Crown Copyright (c) 2010 Published by Elsevier Inc. All rights reserved.
| Item Type: | Journal Article |
|---|---|
| Subjects: | Q Science > QA Mathematics Q Science > QH Natural history > QH426 Genetics R Medicine > RC Internal medicine > RC0321 Neuroscience. Biological psychiatry. Neuropsychiatry |
| Divisions: | Faculty of Science > Statistics Faculty of Science > WMG (Formerly the Warwick Manufacturing Group) |
| Library of Congress Subject Headings (LCSH): | Brain -- Imaging, Regression analysis, Phenotype |
| Journal or Publication Title: | NeuroImage |
| Publisher: | Elsevier |
| ISSN: | 1053-8119 |
| Date: | 15 November 2010 |
| Volume: | Vol.53 |
| Number: | No.3 |
| Number of Pages: | 13 |
| Page Range: | pp. 1147-1159 |
| Identification Number: | 10.1016/j.neuroimage.2010.07.002 |
| Status: | Peer Reviewed |
| Publication Status: | Published |
| Access rights to Published version: | Open Access |
| Funder: | Engineering and Physical Sciences Research Council (EPSRC), GlaxoSmithKline Clinical Imaging Centre, National Institutes of Health (U.S.) (NIH), National Institute on Aging (NIA), National Institute of Biomedical Imaging and Bioengineering (U.S.) , Dana Foundation |
| Grant number: | U01 AG024904 (NIH), P30 AG010129 (NIH), K01 AG030514 (NIH) |
| References: | Cantor, R.M., Lange, K., Sinsheimer, J.S., 2010. Prioritizing GWAS results: a review of statistical methods and recommendations for their application. Journal of Human Genetics 6–22. Chun, H., Keles, S., 2007. Sparse partial least squares regression with an application to genome scale transcription factor analysis. Department of Statistics, University of Wisconsin, Madison, USA, Technical report. Croiseau, P., Cordell, H.J., 2009. Analysis of North American Rheumatoid Arthritis Consortium data using a penalized logistic regression approach. BMC Proceedings 5, 1–5. Dawy, Z., Sarkis, M., Hagenauer, J., Mueller, J., 2005. A novel gene mapping algorithm based on independent component analysis. IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005. Proceedings (ICASSP'05), 5 de Bakker, P., Yelensky, R., Pe'er, I., Gabriel, S., Daly, M., Altshuler, D., 2005. Efficiency and power in genetic association studies. Nature genetics 37 (11), 1217–1223. Donnelly, P., 2008. Progress and challenges in genome-wide association studies in humans. Nature 456 (7223), 728–731. Fawcett, T., 2004. ROC graphs: notes and practical considerations for researchers. Machine Learning, p. 31. Ferreira, M., Purcell, S., 2009. A multivariate test of association. Bioinformatics 25 (1), 132. Filippini, N., Rao, A., Wetten, S., Gibson, R.A., Borrie, M., Guzman, D., Kertesz, A., Loy- English, I., Williams, J., Nichols, T., Whitcher, B., Matthews, P.M., 2009. Anatomically- distinct genetic associations of APOE epsilon4 allele load with regional cortical atrophy in Alzheimer's disease. NeuroImage 44 (3), 724–728. Friedman, J., Hastie, T., Höfling, H., Tibshirani, R., 2007. Pathwise coordinate optimization. Annals of Applied Statistics 1 (2), 302–332. Friman, O., Cedefamn, J., Lundberg, P., Borga, M., Knutsson, H., 2001. Detection of neural activity in functional MRI using canonical correlation analysis. Magnetic Resonance in Medicine 45 (2), 323–330. Good, C., Johnsrude, I., Ashburner, J., Henson, R., Friston, K., Frackowiak, R., 2001. A voxel-based morphometric study of ageing in 465 Normal adult human brains. Neuroimage 14 (1), 21–36. Hardoon, D.R., Ettinger, U., Mourão Miranda, J., Antonova, E., Collier, D., Kumari, V., Williams, S.C.R., Brammer, M., 2009. Correlation-based multivariate analysis of genetic influence on brain volume. Neuroscience letters 450 (3), 281–286. Hastie, T., Tibshirani, R., Friedman, J., 2001. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer, New York. Hoerl, A., Kennard, R., 1970. Ridge regression: applications to nonorthogonal problems. Technometrics 12 (1), 69–82. Hoggart, C.,Whittaker, J., DeIorio,M., Balding,D., 2008. Simultaneous analysis of all SNPs in genome-wide and re-sequencing association studies. PLoS Genet 4 (7), e1000130. Hoggart, C.J., Chadeau-Hyam, M., Clark, T.G., Lampariello, R., Whittaker, J.C., De Iorio, M., Balding, D.J., 2007. Sequence-level population simulations over large genomic regions. Genetics 177 (3), 1725–1731. Izenman, A., 2008. Modern Multivariate Statistical Techniques: Regression, Classification, and Manifold Learning. Springer. Joyner, A.H., Roddey, J.C., Bloss, C.S., Bakken, T.E., Rimol, L.M., Melle, I., Agartz, I., Djurovic, S., Topol, E.J., Schork, N.J., Andreassen, O.A., Dale, A.M., 2009. A common MECP2 haplotype associates with reduced cortical surface area in humans in two independent populations. PNAS 106 (36), 15475–15480. Kustra, R., 2006. Reduced-rank regularized multivariate model for high-dimensional data. Journal of Computational and Graphical Statistics 15 (2), 312–318. Kwee, L., Liu, D., Lin, X., Ghosh, D., Epstein, M., 2008. A powerful and flexible multilocus association test for quantitative traits. The American Journal of Human Genetics 82 (2), 386 Äì397. Laudadio, T., Pels, P., De Lathauwer, L., Van Hecke, P., Van Huffel, S., 2005. Tissue segmentation and classification of MRSI data using canonical correlation analysis. Magnetic resonance inmedicine: official journal of the Society ofMagnetic Resonance in Medicine/Society of Magnetic Resonance in Medicine 54 (6), 1519–1529. Le Cao, K., Rossouw, D., Robert-Granie, C., Besse, P., 2008. A sparse PLS for variable selection when integrating omics data. Statistical Applications in Genetics and Molecular Biology 7 (1), 35. Lin, Z., Altman, R., 2004. Finding haplotype tagging SNPs by use of principal components analysis. The American Journal of Human Genetics 75 (5), 850–861. Liu, J., Demirci, O., Calhoun, V., 2008. A parallel independent component analysis approach to investigate genomic influence on brain function. IEEE Signal Processing Letters 15, 413–416. Liu, T., Shen, D., Davatzikos, C., 2004. Predictive modeling of anatomic structures using canonical correlation analysis. IEEE International Symposium on Biomedical Imaging: Nano to Macro, 2004, pp. 1279–1282. Lykou, A., Whittaker, J., 2009. Sparse CCA using a Lasso with positivity constraints. Computational Statistics and Data Analysis. McVean, G., 2009. A genealogical interpretation of principal components analysis. PLoS genetics 5 (10), e1000686. Nandy, R., Cordes, D., 2003. Novel nonparametric approach to canonical correlation analysis with applications to low CNR functional MRI data. Magnetic Resonance in Medicine 50 (2). Parkhomenko, E., Tritchler, D., Beyene, J., 2009. Sparse canonical correlation analysis with application to genomic data integration. Statistical Applications in Genetics and Molecular Biology 8 (1), 1. Potkin, S., Turner, J., Guffanti, G., Lakatos, A., Fallon, J., Nguyen, D., Mathalon, D., Ford, J., Lauriello, J., Macciardi, F., et al., 2008. A genome-wide association study of schizophrenia using brain activation as a quantitative phenotype. Schizophrenia Bulletin. Rao, A., Babalola, K., Rueckert, D., 2006. Canonical correlation analysis of sub-cortical brain structures using non-rigid registration. Lecture Notes in Computer Science 4057, 66–74. Reich, D., Price, A.L., Patterson, N., 2008. Principal component analysis of genetic data. Nature Genetics 40 (5), 491–492. Reinsel, G., Velu, R., 1998. Multivariate reduced-rank regression. Springer, New York. Sarkis, M., Diepold, K., Westad, F., 2006. A new algorithm for gene mapping: Application of partial least squares regression with cross model validation. Genomic Signal Processing and Statistics, 2006. GENSIPS'06. IEEE International Workshop on, pp. 89–90. Stein, J.L., Hua, X., Lee, S., Ho, A.J., Leow, A.D., Toga, A.W., Saykin, A.J., Shen, L., Foroud, T., Pankratz, N., Huentelman, M.J., Craig, D.W., Gerber, J.D., Allen, A.N., Corneveaux, J.J., Dechairo, B.M., Potkin, S.G., Weiner, M.W., Thompson, P., 2010. Voxelwise genomewide association study (vGWAS). NeuroImage. Tibshirani, R., 1996. Regression shrinkage and selection via the Lasso. Journal of the Royal Statistical Society, Series B 58 (1), 267–288. Tziortzi, A., Searle, G., Tzimopoulou, S., Salinas, C., Beaver, J., Jenkinson, M., Rabiner, E., and Gunn, R. (2010). Imaging dopamine receptors in humans with [11C]-(+)- phno: dissection of d3 signal and anatomy. NeuroImage (in submission). Waaijenborg, S., de Witt Hamer, V., Philip, C., Zwinderman, A., 2008. Quantifying the association between gene expressions and DNA-markers by penalized canonical correlation analysis. Statistical Applications in Genetics and Molecular Biology 7 (1), 3. Wang, K., Abbott, D., 2008. A principal components regression approach to multilocus genetic association studies. Genetic Epidemiology 32 (2), 108–118. Wang, W., Barratt, B., Clayton, D., Todd, J., 2005. Genome-wide association studies: theoretical and practical concerns. Nature Reviews Genetics 6 (2), 109–118. Witten, D., Tibshirani, R., Hastie, T., 2009. A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. Biostatistics 10 (3), 515. Wu, T.T., Chen, Y.F.,Hastie, T., Sobel, E., Lange, K., 2009. Genome-wide association analysis by lasso penalized logistic regression. Bioinformatics (Oxford, England) 25 (6), 714–721. |
| URI: | http://wrap.warwick.ac.uk/id/eprint/5172 |
Data sourced from Thomson Reuters' Web of Knowledge
Actions (login required)
![]() |
View Item |
Tools
Tools

