Joan E. Bailey-Wilson, Ph.D.

Scientist Emeritus

Computational and Statistical Genomics Branch


333 Cassell Drive
Suite 1200
Baltimore, MD 21224


Research Topics

Dr. Bailey-Wilson, Head of the Statistical Genetics Section (SGS), is actively studying a range of diseases, including lung cancer, prostate cancer, myopia and other eye diseases, autism and cleft lip and palate. Trained in statistical genetics, she is interested in understanding the genetics of complex diseases and developing novel methodologies to disentangle the roles that genes and environment play in disease causation.

She has been particularly interested in lung cancer since the early 1980s, when very few scientists believed there might be a genetic link to the condition. Today, significantly more data support the idea that there are susceptibility alleles for one or more unknown genes that dramatically increase certain smokers' risk of developing lung cancer. In a collaboration called the Genetic Epidemiology of Lung Cancer Consortium (GELCC), Dr. Bailey-Wilson and others recently narrowed down the location of a potential lung-cancer gene to a region of chromosome 6, and showed that RGS17 is a tumor suppressor gene in this region that shows association with lung cancer risk in highly aggregated lung cancer families. With her collaborators, she is using dense genotyping panels and next-generation DNA sequencing in the GELCC's set of highly-aggregated lung cancer families, their family-history-positives cases, and age-gender-smoking matched controls to search for causal variants in additional lung cancer susceptibility loci.

Dr. Bailey-Wilson has used similar approaches to locate other cancer-related genes. For example, she and her collaborators published evidence that genes involved in prostate cancer reside on specific regions of chromosomes 1, 8, 17 and X. These findings have been replicated, and three candidate genes with rare variants that appear to increase prostate cancer risk have been cloned: RNASEL (HPC1), which encodes ribonuclease L, MSRI, which encodes the macrophage scavenger receptor 1 and HOXB13, which encodes the homeobox B13 protein. Dr. Bailey-Wilson is focusing on identifying additional susceptibility genes for these and other cancers in ongoing studies. At present, she is collaborating with the International Consortium for Prostate Cancer Genetics on a whole exome sequencing study of families with strong family history of prostate cancer and is the lead statistician on the study of the ICPCG's African-American families.

Dr. Bailey-Wilson is also applying these next-generation sequencing tools to her studies of highly aggregated non-syndromic oral cleft families from the Syrian Arab Republic (2 to 17 affected individuals per family) and to a set of multiplex autism spectrum disease families. In the autism study, a subset of families in which at least one child with autism also has abnormal cholesterol levels are of particular interest.

To keep pace with the analysis of the exponentially increasing number of genetic markers, Dr. Bailey-Wilson also develops and tests novel computational methods. Until relatively recently, fewer than 100 of these "signposts" along the genome had been identified. Now, there are millions of known markers and genome sequencing has drastically increased the number of variants to be analyzed in a single study. Her group is especially interested in using machine learning methods to detect causal variants that have limited or no marginal effects on risk of a disease but which do show strong interaction effects (with either other genetic variants or with environmental risk factors). She is also working to address the effects of linkage disequilibrium, or the nonrandom association of closely spaced loci, on genetic interaction tests and machine learning methods. Linkage disequilibrium can be caused by a low frequency of recombinations between two loci when they are very close together on a chromosome. The closer two loci are, the more likely they are to exhibit linkage disequilibrium. Thus, markers that are only 100 kb apart display significantly greater linkage disequilibrium than markers that are 100-5,000 kb apart. Because standard linkage analysis methods typically assume no linkage disequilibrium exists between loci, Dr. Bailey-Wilson's group has developed approaches to streamline these methods to study sets of dense genetic markers. She is using association methods that take advantage of linkage disequilibrium data, HapMap data, and the sequence of the human genome to determine the location of genetic loci that increase risk for various diseases. She has used these and other analytical methods to determine, for example, whether alleles at specific marker loci are transmitted along with a disease through generations in families with several affected members.

Dr. Bailey-Wilson has also used statistical methods to determine the marker alleles that people with a specific disease carry more frequently - and disease-free people carry less frequently - than can be explained by chance. This work has helped to greatly reduce the number of target regions that investigators need to search for potential disease-related genes. Her group is also developing approaches to mitigate the increased false-positive evidence of epistatic interaction that can be observed when strong LD exists between variants within a single genetic locus that all have a marginal effect on the trait.


Joan E. Bailey-Wilson, Ph.D. is a senior investigator and co-chief of the Computational and Statistical Genomics Branch at the National Human Genome Research Institute, National Institutes of Health, United States, where she has been a faculty member since 1995. Her research program focuses on understanding the genetic factors that increase risk for various complex diseases and their interactions with environmental risk factors. Dr. Bailey-Wilson specializes in statistical genetics and genetic epidemiology and is especially interested in risk factors for lung cancer, prostate cancer, eye disorders, autism and oral clefts.

Dr. Bailey-Wilson received her B.A. magna cum laude in Biology from Western Maryland College (now McDaniel College) followed by her Ph.D. in medical genetics with a minor in biomathematics from Indiana University in 1981 under the direction of Joe C. Christian, M.D., Ph.D. She then completed post-doctoral training with Dr. Robert Elston in the Department of Biometry and Genetics, Louisiana State University Medical Center, and went on to achieve the rank of professor there, before leaving for NIH. She was appointed co-branch chief of the Inherited Disease Research Branch in 2006 and became co-chief of the Computational and Statistical Genomics Branch in 2014.

Dr. Bailey-Wilson is a diplomat of the American Board of Medical Genetics and a founding fellow of the American College of Medical Genetics. She has served on many scientific advisory boards including the Cancer Family Registry CFRCCS Advisory Board, the World Trade Center Kinship and Data Analysis Panel for the National Institute of Justice and the Genetic Analysis Workshop Advisory Board. She also served as a member of the International Genetic Epidemiology Society (IGES) Board of Directors (1999-2001), as IGES president-elect, president and past president (2006-2008) and as chair of the IGES Ethical, Legal and Social Issues committee.

Dr. Bailey-Wilson has received a number of awards and honors, including the Distinguished Alumnus Award from the Department of Medical Genetics, Indiana University School of Medicine (1995), the Trustee's Alumni Award from Western Maryland College (1998), the Leadership Award from the International Genetic Epidemiology Society (2006), induction as an alumni member into Phi Beta Kappa (2010) and the NHGRI Outstanding Mentor Award (2011).

Selected Publications

  1. Bailey-Wilson JE, Amos CI, Pinney SM, Petersen GM, de Andrade M, Wiest JS, Fain P, Schwartz AG, You M, Franklin W, Klein C, Gazdar A, Rothschild H, Mandal D, Coons T, Slusser J, Lee J, Gaba C, Kupert E, Perez A, Zhou X, Zeng D, Liu Q, Zhang Q, Seminara D, Minna J, Anderson MW. A major lung cancer susceptibility locus maps to chromosome 6q23-25. Am J Hum Genet. 2004;75(3):460-74.
  2. Sei Y, Zhao X, Forbes J, Szymczak S, Li Q, Trivedi A, Voellinger M, Joy G, Feng J, Whatley M, Jones MS, Harper UL, Marx SJ, Venkatesan AM, Chandrasekharappa SC, Raffeld M, Quezado MM, Louie A, Chen CC, Lim RM, Agarwala R, Schäffer AA, Hughes MS, Bailey-Wilson JE, Wank SA. A Hereditary Form of Small Intestinal Carcinoid Associated With a Germline Mutation in Inositol Polyphosphate Multikinase. Gastroenterology. 2015;149(1):67-78.
  3. Holzinger ER, Szymczak S, Dasgupta A, Malley J, Li Q, Bailey-Wilson JE. Variable selection method for the identification of epistatic models. Pac Symp Biocomput. 2015:195-206.
  4. Musolf AM, Simpson CL, Moiz BA, Pikielny CW, Middlebrooks CD, Mandal D, de Andrade M, Cole MD, Gaba C, Yang P, You M, Li Y, Kupert EY, Anderson MW, Schwartz AG, Pinney SM, Amos CI, Bailey-Wilson JE. Genetic Variation and Recurrent Haplotypes on Chromosome 6q23-25 Risk Locus in Familial Lung Cancer. Cancer Res. 2021;81(12):3162-3173.
  5. Musolf AM, Ho WSC, Long KA, Zhuang Z, Argersinger DP, Sun H, Moiz BA, Simpson CL, Mendelevich EG, Bogdanov EI, Bailey-Wilson JE, Heiss JD. Small posterior fossa in Chiari I malformation affected families is significantly linked to 1q43-44 and 12q23-24.11 using whole exome sequencing. Eur J Hum Genet. 2019;27(10):1599-1610.

Related Scientific Focus Areas

This page was last updated on Tuesday, October 4, 2022