SBIR-STTR Award

Cluster Comparison Methods & the NCI Expression Dataset
Award last edited on: 10/8/02

Sponsored Program
SBIR
Awarding Agency
NIH : NCI
Total Award Amount
$98,438
Award Phase
1
Solicitation Topic Code
-----

Principal Investigator
Michael J McManus

Company Information

Anvil Informatics Inc

25 Corporate Drive
Burlington, MA 01803
   (781) 272-1600
   info@anvilinfo.com
   www.anvilinformatics.com
Location: Single
Congr. District: 06
County: Middlesex

Phase I

Contract Number: 1R43CA096179-01
Start Date: 00/00/00    Completed: 00/00/00
Phase I year
2002
Phase I Amount
$98,438
There is a significant commercial and academic need for new tools that provide quantitative cluster comparison metrics. It is important for pharmaceutical and biotechnology companies to be able to critically evaluate the utility of using different clustering techniques on large high dimensional datasets, in order to make the most informed decisions based upon the clustering results. We propose to evaluate and build bluster comparison metrics, integrating them with high dimensional visualization techniques, so that not only an overall scope, but the cluster distributions can be compared in an intuitive visual fashion. In carrying out our analysis, we will focus on the NCI (approximately 1,400) compound, subset, 118 known mechanism of action compound gene expression dataset analyzed by Scherf, et.al (2000). IN A FOLLOW ON Phase II SBIR Proposal, we will create a robust software package for commercial release where cluster comparison metrics are integrated with the most valuable visualization tools we identify in the Phase I research. PROPOSED COMMERCIAL APPLICATIONS: The Specific Aims of this Phase I proposal will allow us to create new tools where cluster comparison metrics are integrated with high dimensional visualization techniques, so that not only an overall score, but the cluster distributions can be compared in an intuitive visual fashion. We will use the publicly available NCI DIS compound subset, gene expression dataset of Scherf, e.g. al. (2000) to carry out these aims, as ell as data mine this dataset for new discoveries.

Thesaurus Terms:
artificial intelligence, cancer information system, computer data analysis, computer program /software, computer system design /evaluation, data collection methodology /evaluation, mathematics informatics, information retrieval

Phase II

Contract Number: ----------
Start Date: 00/00/00    Completed: 00/00/00
Phase II year
----
Phase II Amount
----