SBIR-STTR Award

Computational System to Predict Novel Genetic Disease Associations
Award last edited on: 8/27/18

Sponsored Program
SBIR
Awarding Agency
NIH : NCHGR
Total Award Amount
$950,537
Award Phase
2
Solicitation Topic Code
-----

Principal Investigator
Martin G Reese

Company Information

Fabric Genomics (AKA: Omica Inc )

1611 Telegraph Avenue Suite 500
Oakland, CA 94612
   (510) 595-0800
   info@fabricgenomics.com
   www.fabricgenomics.com
Location: Single
Congr. District: 13
County: Alameda

Phase I

Contract Number: 1R43HG003667-01
Start Date: 5/3/05    Completed: 6/30/11
Phase I year
2005
Phase I Amount
$99,757
Rapid advances in biotechnology and clinical studies have produced an information bottleneck in data processing that has hindered application of such critical scientific data for use in medicine and disease diagnosis. Omicia will develop a novel genetics based informatics infrastructure to overcome this bottleneck that can predict the functional outcome of gene mutations and their relationship to human disease by integrating diverse knowledge bases in a consistent and structured manner. Implementation of the technology will reduce the cost and enhance the effectiveness of genetic association studies, population profiling analyses and pharmacogenomic applications, as well as potentially identify novel gene targets for disease. Our technology will mine classification systems for large number of genes and create statistical associations as well as integrate ontologies to allow for reliable inferences about functions of genes and their relation to disease phenotypes. Importantly, the predictive capabilities of the technology will assign functions to orphan genes and genetic markers for which very little functional information is available. The IT system will consist of a classification of genes into families using shared attributes in a number of well-structured domains to create a gene inference system (GIS) by comparing gene ontologies in GO with disease ontologies in MeSH. It will also use the well-annotated HGMD database to infer novel disease-related function to unannotated polymorphisms such as those in dbSNP. These studies are designed to demonstrate the feasibility of the technology so that it can be validated through future genetic association studies in a Phase II SBIR

Phase II

Contract Number: 2R44HG003667-02A1
Start Date: 5/3/05    Completed: 6/30/11
Phase II year
2009
(last award dollars: 2010)
Phase II Amount
$850,780

High-throughput genotyping, expression, and sequencing technologies and the development of increasingly sophisticated methods for predicting gene-disease interactions have given the new field of genomic-based personalized medicine a wealth of data: more, in fact, than can easily be processed and interpreted. Omicia is in the business of developing computational tools and diagnostics in the field of personalized medicine for cardiovascular disease (CVD). As part of that effort we are developing a software infrastructure that uses these abundant data to identify and prioritize candidate genes and their sequence variations for clinical evaluation. The research proposed in this application has three aims. In Phase II, we will enhance and optimize Omicia's Gene Inference System (GIS), prototyped during the SBIR Phase I project. In Phase II we will add additional capabilities to its candidate gene identification methods. This will build upon the Omicia Disease Genes (ODG) ontology built in Phase I, and add pathway and protein-interaction data and include ""top"" candidates from external publicaly-available clinical studies. In Aim 2, we will enhance the ability of GIS to prioritize sequence variations by improving our novel paralogous-gene variation identification algorithm (iDIP) and by integrating existing amino-acid substitution (AAS) methods. By running these algorithms over all known human genes via dbSNP, we expect to identify a list of candidate variations that will be evaluated in Aim 3 including an in vitro experiment. The goal of Aim 3 is to test the gene- and variant-predictive power of GIS and to compare it to other selection methods. The clinical study will be a single-stage case control design to test the ""top"" variant candidates from Aims 1 & 2 and compare the potential association to well-established genetic markers for the risk of myocardial infarction (MI). With approximately 700 cases and 700 matched controls, our association study will be well powered to test our predicted functional markers for MI. The GIS infrastructure is an integral part of the commercial workflow of Omicia, and will form the basis of the product pipeline. As such, it will serve as licensable commercial technology for the company by helping other biotechnology companies to develop their genetic biomarkers for diagnostic and therapeutic developments (theranostics). In addition, any novel variants drawn from this Phase II study will be licensable and exploitable intellectual property, useful both as the basis for future products in our internal pipeline, as well as potentially valuable additions to our patent portfolio. The Phase II goals of enhancing and clinically validating GIS serve three purposes: proving our methodology as applied to CVD and opening the door to applications in other disease areas; showcasing GIS as a key technology for managing complexity in the post-genomic era and providing clinically-relevant insights; and finally, potentially identifying valuable IP in the form of novel genetic markers for MI.

Public Health Relevance:
The outcome of this project will be an evaluated gene inference system (GIS) for identifying gene-disease interactions, with a focus in the area of cardiovascular disease (CVD). This system will be used as part of the Omicia product pipeline, and can also be licensed to third parties. In addition, any novel genetic markers identified as part of the validation study will themselves be valuable additions to the Omicia product and IP portfolio. Omicia's goal is to provide content and analysis tools for molecular diagnostic tests for cardiovascular conditions, with the promise of identifying patients at high risk to enable them to begin preventive care before symptoms appear. Given the prevalence of CVD in the developed world, these products are potentially a great boon to public health, as well as being significant commercial opportunities.

Thesaurus Terms:
Address; Agreement; Algorithms; Alleles; Allelomorphs; Amino Acid Sequence; Amino Acid Substitution; Area; Base Sequence; Bio-Informatics; Bioinformatics; Biological; Biological Function; Biological Process; Biotechnology; Businesses; Candidate Disease Gene; Candidate Gene; Cardiac Infarction; Cardiovascular; Cardiovascular Body System; Cardiovascular Diseases; Cardiovascular System; Cardiovascular System (All Sites); Caring; Causality; Clinical; Clinical Evaluation; Clinical Research; Clinical Study; Clinical Testing; Clinical Trials, Phase Ii; Code; Coding System; Collection; Complement; Complement Proteins; Computer Programs; Computer Simulation; Computer Software; Computerized Models; Cross-Product Ratio; Custom; Dna; Dna Sequencing, Human Gene Mapping; Data; Data Banks; Data Bases; Databank, Electronic; Databanks; Database, Electronic; Databases; Deoxyribonucleic Acid; Diagnostic; Disease; Disease Association; Disorder; Effectiveness; Etiology; Evaluation; Exons; Future; Gene Mapping, Human; Gene Variant; Genehomolog; Genes; Genetic; Genetic Condition; Genetic Diseases; Genetic Diversity; Genetic Markers; Genetic Polymorphism; Genetic Research; Genetic Variation; Genome; Genomics; Genotype; Goals; Hereditary Disease; Homolog; Homologous Gene; Homologue; Human; Human Gene Mapping; Human Genetic Mapping; Human, General; In Vitro; Individual; Informatics; Infrastructure; Intellectual Property; Investigators; Knowledge; Lead; Legal Patent; Licensing; Link; Msh; Man (Taxonomy); Man, Modern; Mathematical Model Simulation; Mathematical Models And Simulations; Mesh; Mesh Headings; Medical; Medical Subject Headings; Medicine; Method Loinc Axis 6; Methodology; Methods; Methods And Techniques; Methods, Other; Mining; Minings; Models, Computer; Molecular Biology, Protein Sequencing; Molecular Diagnostic Testing; Molecular Disease; Myocardial Infarct; Myocardial Infarction; Natural Language Processing; Neighborhoods; Nucleotide Sequence; Omim; Odds Ratio; Online Mendelian Inheritance In Man; Ontology; Organ System, Cardiovascular; Organism; Outcome; Paper; Patents; Pathway Interactions; Patients; Pb Element; Peptide Sequence Determination; Performance; Phase; Phase 2 Clinical Trials; Phase Ii Clinical Trials; Phenotype; Play; Polymorphism (Genetics); Polymorphism, Genetic; Position; Positioning Attribute; Prevalence; Preventive; Procedures; Process; Processings, Natural Language; Protein Sequencing; Protein Structure, Primary; Proteins; Public Health; Relative Odds; Research; Research Infrastructure; Research Personnel; Researchers; Risk; Risk Ratio; Role; Running; Sbir; Sbirs (R43/44); Science; Science Of Medicine; Sequence Determinations, Amino Acid; Sequence Determinations, Protein; Sequence Homology; Shapes; Simulation, Computer Based; Slice; Small Business Innovation Research; Small Business Innovation Research Grant; Software; Staging; Symptoms; System; System, Loinc Axis 4; Txt; Techniques; Technology; Testing; Testing, Molecular Diagnostic; Text; Validation; Variant; Variation; Variation (Genetics); Vascular, Heart; Work; Abstracting; Allelic Variant; Association Test; Base; Biomarker; Cardiac Infarct; Cardiovascular Disorder; Case Control; Circulatory System; Clinical Data Repository; Clinical Data Warehouse; Clinical Relevance; Clinical Test; Clinically Relevant; Computational Modeling; Computational Models; Computational Simulation; Computational Tools; Computer Based Models; Computer Program/Software; Computerized Modeling; Computerized Simulation; Computerized Tools; Coronary Attack; Coronary Infarct; Coronary Infarction; Data Mining; Data Repository; Database Of Genotypes And Phenotypes; Datamining; Design; Designing; Disease Causation; Disease Etiology; Disease/Disorder; Disease/Disorder Etiology; Disorder Etiology; Expectation; Experiment; Experimental Research; Experimental Study; Gene Product; Genetic Association; Genetic Disorder; Genome-Wide; Heart Attack; Heart Infarct; Heart Infarction; Heavy Metal Pb; Heavy Metal Lead; Hereditary Disorder; High Risk; Homology (Molecular); Human Disease; Improved; In Silico; Information Processing; Insight; Knowledge Base; Living System; Member; Natural Language Understanding; Novel; Novel Marker; Nucleic Acid Sequence; Paralog; Paralogous Gene; Pathway; Phase 2 Study; Phase 2 Trial; Phase Ii Trial; Polymorphism; Protein Sequence; Protocol, Phase Ii; Prototype; Public Health Medicine (Field); Public Health Relevance; Relational Database; Research Clinical Testing; Research Study; Social Role; Study, Phase Ii; Technology Development; Therapeutic Development; Tool; Validation Studies; Virtual Simulation