Alexander Wilson, Ph.D.
Scientist Emeritus
Computational and Statistical Genomics Branch
NHGRI
Research Topics
The overarching goal of Dr. Wilson's research program has been the identification of genetic variants responsible for the variation in quantitative traits. The specific aims of this research program are: 1) to use statistical genetic analysis to identify genetic effects underlying quantitative traits and to identify statistical challenges that need to be addressed, 2) to develop new methods of genetic analysis for quantitative traits that address these challenges, 3) to use computer simulation to investigate the statistical properties of these methods, and 4) to apply insights gained from these simulations to ongoing collaborative studies. This work has spanned more than three decades, first at the Department of Biometry and Genetics at the Louisiana State University Medical Center and then the Genometrics Section of the Computational and Statistical Genomics Branch of the NHGRI Division of Intramural Research, NIH. This research has included applications projects and methods development in linkage analysis and tests of association with red-cell antigens, protein polymorphisms, STRPs, SNPs and next-generation sequence variants (SVs), in both family- and population-based samples.
Major substantive results include 1) the identification of genes for Mendelian syndromes (e.g., congenital cataracts, Cranio-Lenticulo-Sutural-Dysplasia), 2) the identification of polymorphisms responsible for variation in quantitative traits (e.g., dopamine-beta-hydroxylase activity, citalopram response in depressed individuals, and platelet aggregation), and 3) the identification of candidate regions with linkage and association in complex disorders (e.g., traits related to hypertension and cardiovascular disease, depression and alcoholism, familial idiopathic scoliosis and kyphoscoliosis, and craniosynostosis).
Methodological work includes: 1) advances in non-parametric linkage analysis, 2) stepwise regression of identified variants in quantitative results, 3) regional inference with moving averages of p-values, 4) the use of derived composite biallelic loci, 5) testing associations in parent-offspring trios with a regression of offspring on mid-parent (ROMP) based approach, and 6) the use of hot-spot based delimiters to divide the genome into independent segments in a linear regression format for family- and population-based tests of association for sequence variants (tiled regression). In addition, the large and small sample statistical properties of these tests have been investigated with computer simulation studies to ensure that the tests are statistically valid and have reasonable power and type I error rates. Software packages developed include 1) the Genometric Simulation Analysis Package (GASP), 2) the implementation of the Regression of Offspring on Mid-Parent (ROMP, ROOP and ROMPrev) and 3) the implementation of the tiled regression approach, the Tiled Regression Analysis Package (TRAP). These packages are available on the NHGRI website.
During the last several years, the density of genetic variants has increased dramatically and the section's research has become particularly focused on approaches that are robust with respect to the non-independence between markers and correlations between family members within samples, and on the identification of non-coding regulatory elements. Recent methodological work has focused on the development of two new methods for quantitative traits: the regression of offspring on mid-parent (ROMP), and tiled regression. Both are tests of association in a linear regression framework that can be applied to family data and have been designed for data with very large numbers of genetic markers (millions), e.g. high density SNP panels, and/or large-scale sequencing producing large numbers of rare sequence variants. ROMP is designed to minimize the amount of genotyping and/or sequencing required for a test of association in a parent-offspring trio or nuclear family, by requiring phenotyping data on the parents and offspring, but requiring genotyping or sequencing on only the offspring. Tiled regression is a method that determines the set of independent sequence variants across the entire genome that best predict a given phenotype, against the background of all the variants in the genome.
Regression methods are used to identify independent variants (both coding and non-coding) in predefined independent tiles, that are defined by hotspot blocks, or other positional or functional regions. Higher level regression is then used to determine independent variants over chromosomes and the entire genome. At the Genetic Analysis Workshop 17 (GAW 17), in 2010, it was discovered that there was a substantial inflation of type I error when GWAS methods were used to analyze rare sequence variants, most likely due to the presence of gametic disequilibrium (or inter-LD). The tiled regression method was one of the only methods that allowed for both intra- and inter-LD correlations and it did not exhibit inflation of type I error rates that were present in virtually all of the other methods considered. Before this workshop, it was generally assumed that adjustments only had to be made for correlations within LD blocks (intra-LD correlations); but based on the GAW 17 findings, this is clearly not the case.
Future plans include extensions of TRAP to qualitative traits, and the inclusion of tiled regression into the regression of offspring on mid-parent (ROMP) regression framework. The tiled regression approach, using both hotspot and functional criteria to define tiles, is currently being used in all of our ongoing and future collaborative studies. These projects include 1) analysis of all ClinSeq traits and sequence data (Les Biesecker et al.), 2) analysis of the Trinity Irish metabolite data (Lawrence Brody), 3) functional studies in zebrafish based on our kyphoscoliosis IRX results (Nancy Miller), 4) whole exome sequencing and analysis of a large family with metopic craniosynostosis (Simeon Boyadjiev), and 5) targeted or whole exome sequencing and analysis of about 70 families with familial idiopathic scoliosis (Nancy Miller).
Biography
Dr. Alexander F. Wilson graduated from McDaniel College, magna cum laude, with a B.A. in biology in 1975, and received the H.P. Studivant Award as the Outstanding Biology Major. He received his Ph.D. in medical genetics from Indiana University under the direction of P. Michael Conneally, Ph.D., (1980) and did his postdoctoral training in statistical genetics with Robert C. Elston, Ph.D., in the Department of Biometry, Louisiana State University Medical Center (1980-1982). He remained at Louisiana State University, rising to the rank of tenured Full Professor in 1993. He was recruited to the National Human Genome Research Institute in 1995.
He is a scientist emeritus in the Computational and Statistical Genomics Branch, at the National Human Genome Research Institute, NIH, and an adjunct professor, Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health. Dr. Wilson is an active member of the American Society of Human Genetics (ASHG) and the International Genetic Epidemiology Society (IGES). He has served on the ASHG Program Committee (2010-2013), on the IGES Board of Directors, and is currently the President of IGES.
He has been a member of or has directed the dissertation committees of 16 students and has trained over a dozen post-doctoral students and visiting faculty. He has received numerous awards, including the Indiana University School of Medicine Department of Medical Genetics Distinguished Alumnus Award, the Western Maryland College Trustee Alumni Award, the NIH Director's Award and induction into Phi Beta Kappa as an alumni member.
His research interests focus on the identification of genetic effects that may be responsible for phenotypic variation in quantitative traits (e.g., traits related to cardiovascular disease and scoliosis), the coding and non-coding elements that may be responsible for their expression, and the investigation of the statistical properties of newly developed methods of genetic analysis for quantitative traits.
Selected Publications
- Biesecker LG, Mullikin JC, Facio FM, Turner C, Cherukuri PF, Blakesley RW, Bouffard GG, Chines PS, Cruz P, Hansen NF, Teer JK, Maskeri B, Young AC, NISC Comparative Sequencing Program., Manolio TA, Wilson AF, Finkel T, Hwang P, Arai A, Remaley AT, Sachdev V, Shamburek R, Cannon RO, Green ED. The ClinSeq Project: piloting large-scale genome sequencing for research in genomic medicine. Genome Res. 2009;19(9):1665-74.
- Justice CM, Bishop K, Carrington B, Mullikin JC, Swindle K, Marosy B, Sood R, Miller NH, Wilson AF. Evaluation of IRX Genes and Conserved Noncoding Elements in a Region on 5p13.3 Linked to Families with Familial Idiopathic Scoliosis and Kyphosis. G3 (Bethesda). 2016;6(6):1707-12.
- Molloy AM, Pangilinan F, Mills JL, Shane B, O'Neill MB, McGaughey DM, Velkova A, Abaan HO, Ueland PM, McNulty H, Ward M, Strain JJ, Cunningham C, Casey M, Cropp CD, Kim Y, Bailey-Wilson JE, Wilson AF, Brody LC. A Common Polymorphism in HIBCH Influences Methylmalonic Acid Concentrations in Blood Independently of Cobalamin. Am J Hum Genet. 2016;98(5):869-882.
- Justice CM, Yagnik G, Kim Y, Peter I, Jabs EW, Erazo M, Ye X, Ainehsazan E, Shi L, Cunningham ML, Kimonis V, Roscioli T, Wall SA, Wilkie AO, Stoler J, Richtsmeier JT, Heuzé Y, Sanchez-Lara PA, Buckley MF, Druschel CM, Mills JL, Caggana M, Romitti PA, Kay DM, Senders C, Taub PJ, Klein OD, Boggan J, Zwienenberg-Lee M, Naydenov C, Kim J, Wilson AF, Boyadjiev SA. A genome-wide association study identifies susceptibility loci for nonsyndromic sagittal craniosynostosis near BMP2 and within BBS9. Nat Genet. 2012;44(12):1360-4.
- Sabourin JA, Cropp CD, Sung H, Brody LC, Bailey-Wilson JE, Wilson AF. ComPaSS-GWAS: A method to reduce type I error in genome-wide association studies when replication data are not available. Genet Epidemiol. 2019;43(1):102-111.
Related Scientific Focus Areas
This page was last updated on Wednesday, December 1, 2021