Skip to main content

Discovering rare variants associated with substance use in 65,000 deep whole genome sequences in the Trans-Omics for Precision Medicine (TOPMed) Program

Authors
H. Young, M. Liu, G. Datta, D. Beame, D. Becker, M. Bowers, B. Cade, F. Chen, S. David, L. Emery, M. Foreman, E. Fox, S. Gharib, D. Glahn, M. Hall, J. He, J. Hokanson, S. Hwang, A. Justice, R. Kaplan, C. Laurie, D. Levy, D. Liu, L. Martin, D. McGuire, M. Ragland, R. Reed, A. Shadyab, T. Wang, R. Wedow, K. Wehr, W. White, L. Yanek, K. Young, W. Zhao, G. Abecasis, K. North, S. Vrieze, GWAS and Sequencing Consortium of Alcohol & Nicotine, Trans-Omics for Precision Medicine
Name and Date of Professional Meeting
Behavior Genetics Association (Boston, June 20-23, 2018)
Associated paper proposal(s)
Working Group(s)
Abstract Text
Smoking is a moderately heritable behavior that is one of the leading preventable causes of death in the United States. Genome wide association studies have discovered hundreds of common variants associated with tobacco use in individuals of European ancestry, but similar associations have not been studied in non-European populations. Low frequency variants are expected to affect smoking risk, but any such rare variants are yet to be identified, likely due in part to low statistical power of existing studies.

As a part of the Trans-Omics for Precision Medicine Whole Genome Sequencing Program (TOPMed), we seek to identify novel associations between rare variants and four measures of smoking behavior in a multi-ethnic sample. Two of these phenotypes relate to the initiation of smoking, capturing whether an individual has ever smoked regularly in their life and the age at which they began smoking regularly. We also measured smoking cessation through a comparison of current and former smokers. Finally, the heaviness of tobacco use among smokers is measured with the number of cigarettes smoked per day.

At the time of writing, the TOPMed project has called genotypes from 65,000 individuals with 30x whole genome sequences. A subset of 45,000 individuals from this data freeze have one or more smoking phenotypes. We will present results for all four of our smoking phenotypes, including single variant tests on up to 582,000,000 variants and gene based tests on rare nonsynonymous and loss of function variants. To date, we have conducted single variant association tests for cigarettes smoked per day with 154,176,919 variants with minor allele count > 10 using whole genome sequences from 10,444 individuals. This preliminary meta-analysis showed no significant associations between any variants and cigarettes smoked per day, suggesting that additional samples will be required to make discoveries for this and other complex substance use phenotypes. We will present updated results from single variant tests and gene based tests on much larger samples, up to the full sample of 45,000 with one or more smoking phenotypes. Our power to detect effects will increase as the remainder of our sample is analyzed and as the data available to us through TOPMed continues to grow with the release of new data freezes.
Back to top