SBIR-STTR Award

Analytic Methods for Heterogeneous Multilevel Data
Award last edited on: 6/30/08

Sponsored Program
SBIR
Awarding Agency
NIH : NIGMS
Total Award Amount
$718,268
Award Phase
2
Solicitation Topic Code
-----

Principal Investigator
Edward C Chao

Company Information

Data Numerica Institute Inc (AKA: Data Numerica)

6120 149th Avenue SE
Bellevue, WA 98006
   (425) 591-7944
   echao@datanumerica.com
   www.datanumerica.com
Location: Single
Congr. District: 09
County: King

Phase I

Contract Number: 1R44GM076846-01A1
Start Date: 00/00/00    Completed: 00/00/00
Phase I year
2006
Phase I Amount
$100,490
Multilevel data are very common in sociological, behavioral and biomedical researches. The data could come from longitudinal community surveys, genetic family studies or spatial-temporal studies to investigate some health outcomes. Typically, the interest focuses on the impact of some treatment intervention. Such data could be very complex when there are multiple levels of data structures. The data might have factors such as community, family, patient and repeated measures over time nested or crossed in each other. For continuous response, hierarchical models such as linear mixed-effects models or latent variable models have been studied and applied. In the analysis, the major interest is to study the impact of specific cause pathway on health outcome. Since the records in each cluster are often correlated, investigator has to adjust the heterogeneity within a cluster of observations or between clusters. Overdispersion is also very common in such data. The major interest of this project is to investigate the analytic methods for continuous and discrete outcomes of the above nature. In this area, typically, people apply generalized linear mixed-effects models GLMM, marginal models or transition models to non-continuous data. The difficulties for such models such as GLMM is that estimation methods often have troubles to achieve unbiasness, consistency and efficiency. We are interested in the development of more robust methods to achieve these goals for continuous and discrete multilevel data with arbitrary dimension. The final result is a software library with flexible multilevel modeling approaches for the analysis of complex multilevel data. The software will be useful to biomedical researchers working on sociological, behavioral and biomedical studies with complex data structures. Manuscripts and course packs will be developed to assist practitioners in applying appropriate methods and the software tool to their studies.

Thesaurus Terms:
computer program /software, computer simulation, computer system design /evaluation, mathematical model, method development, model design /development, statistics /biometry data collection methodology /evaluation, data quality /integrity, health behavior, outcomes research handbook, information dissemination

Phase II

Contract Number: 4R44GM076846-02
Start Date: 00/00/00    Completed: 00/00/00
Phase II year
2007
(last award dollars: 2008)
Phase II Amount
$617,778

Multilevel data are very common in sociological, behavioral and biomedical researches. The data could come from longitudinal community surveys, genetic family studies or spatial-temporal studies to investigate some health outcomes. Typically, the interest focuses on the impact of some treatment intervention. Such data could be very complex when there are multiple levels of data structures. The data might have factors such as community, family, patient and repeated measures over time nested or crossed in each other. For continuous response, hierarchical models such as linear mixed-effects models or latent variable models have been studied and applied. In the analysis, the major interest is to study the impact of specific cause pathway on health outcome. Since the records in each cluster are often correlated, investigator has to adjust the heterogeneity within a cluster of observations or between clusters. Overdispersion is also very common in such data. The major interest of this project is to investigate the analytic methods for continuous and discrete outcomes of the above nature. In this area, typically, people apply generalized linear mixed-effects models GLMM, marginal models or transition models to non-continuous data. The difficulties for such models such as GLMM is that estimation methods often have troubles to achieve unbiasness, consistency and efficiency. We are interested in the development of more robust methods to achieve these goals for continuous and discrete multilevel data with arbitrary dimension. The final result is a software library with flexible multilevel modeling approaches for the analysis of complex multilevel data. The software will be useful to biomedical researchers working on sociological, behavioral and biomedical studies with complex data structures. Manuscripts and course packs will be developed to assist practitioners in applying appropriate methods and the software tool to their studies.

Thesaurus Terms:
computer program /software, computer simulation, computer system design /evaluation, mathematical model, method development, model design /development, statistics /biometry data collection methodology /evaluation, data quality /integrity, health behavior, outcomes research handbook, information dissemination