SBIR-STTR Award

A New Reference Panel to Boost African American Genotype Imputation
Award last edited on: 1/24/2018

Sponsored Program
SBIR
Awarding Agency
NIH : NHGRI
Total Award Amount
$1,758,558
Award Phase
2
Solicitation Topic Code
-----

Principal Investigator
Adam Auton

Company Information

23andMe Inc

349 Oyster Point Boulevard
South San Francisco, CA 94080
   (650) 938-6300
   bd@23andme.com
   www.23andme.com
Location: Single
Congr. District: 15
County: San Mateo

Phase I

Contract Number: N/A
Start Date: 9/28/2016    Completed: 8/31/2017
Phase I year
2016
Phase I Amount
$1
Direct to Phase II

Phase II

Contract Number: 1R44HG009460-01
Start Date: 00/00/00    Completed: 00/00/00
Phase II year
2016
Phase II Amount
$1,758,557
Modern genetic studies have been conducted predominantly in cohorts of individuals of European ancestry. By 2010, there were approximately ten times as many published genome wide association studies (GWAS) in people of European ancestry than studies in people of all other ancestries combined. This research disparity has led to an uneven understanding of the genetic basis underlying disease in Europeans and non-Europeans. 23andMe's web-based, large scale research model is ideal for scaling genetics research within non-European populations and thereby bringing more parity to genetics research. Our database is composed of genotypes and phenotypes of over 1,000,000 consenting customers, including over 200,000 individuals with non-European ancestry. The data derived from non-European individuals represent a particularly valuable resource for genetic discovery of novel variants that may not be found in the European population. However, research studies in non-European populations are weakened by the lack of availability of large-scale reference datasets and, in particular, genotype imputation panels. Genotype imputation is a statistical methodology that uses observations of genotypes in a large reference panel to infer unobserved genotypes in a target dataset. This methodology is widely used within GWAS, and allows novel genetic associations to be identified and refined. Due to this utility, very large reference panels have been constructed, containing thousands or tens of thousands of whole genome sequences. Unfortunately, the largest imputation panels are composed of predominantly European genomes, reflecting the modern bias towards European studies in GWAS. This proposal aims to address this imbalance by constructing an imputation panel specifically for the African American population. In doing so, we will expand 23andMe’s ability to perform genetic discovery in non-European populations, and improve the understanding of global genetic variation underlying diseases and traits. Key commercial outcomes of the research include the identification of novel genetic targets for internal and external therapeutic development. The long-term aim is to improve understanding of disease in minority populations, which we hope may eventually lead to improved treatments of disease in these historically medically understudied groups.    

Public Health Relevance Statement:
Project Narrative   Genome wide association studies have yielded many discoveries of genes associated with disease, but the vast majority of such studies have been conducted in European populations. This proposal aims to empower genetic research in non-European populations by developing a reference dataset from targeted DNA sequencing of the diversity contained within the 23andMe research participant database. This study will enable 23andMe and other researchers to better discover disease variants in individuals with non-European ancestry, with a particular focus on African American populations.      

Project Terms:
Address; African American; base; Biological; Biological Assay; cohort; Communities; Consent; Data; Data Set; Databases; Disease; disease phenotype; DNA; DNA Sequence; drug development; empowered; European; Funding; Future; gene discovery; Genetic; genetic association; Genetic Polymorphism; Genetic Research; genetic resource; Genetic study; Genome; genome sequencing; genome wide association study; Genomics; Genotype; Haplotypes; Human; improved; Individual; Large-Scale Sequencing; Latino; Lead; Methodology; Minority; Modeling; novel; Online Systems; Outcomes Research; parity; Participant; Performance; phenome; Phenotype; Population; Publishing; Reading; Research; Research Personnel; research study; Resources; response; Sampling; Site; Surveys; Technology; Testing; therapeutic development; Time; trait; Variant; Variation (Genetics); whole genome