SBIR-STTR Award

Smoothing Methods to Investigate Non-linear Effect in Correlated Data Studies
Award last edited on: 9/20/13

Sponsored Program
SBIR
Awarding Agency
NIH : NIGMS
Total Award Amount
$809,260
Award Phase
2
Solicitation Topic Code
-----

Principal Investigator
Edward C Chao

Company Information

Data Numerica Institute Inc (AKA: Data Numerica)

6120 149th Avenue SE
Bellevue, WA 98006
   (425) 591-7944
   echao@datanumerica.com
   www.datanumerica.com
Location: Single
Congr. District: 09
County: King

Phase I

Contract Number: 1R44GM077755-01
Start Date: 00/00/00    Completed: 00/00/00
Phase I year
2006
Phase I Amount
$99,583
Correlated data are very common in health studies. Such data could come from longitudinal studies, community panel surveys, genetic family studies or spatial studies. Typically, linear mixed-effect models are used for modeling continuous response, and generalized linear mixed models are applied to non-Gaussian data. In addition to such likelihood approaches, quasi-likelihood methods based on generalized estimating equations GEE are often used when the distributional assumption is not realistic and not easy to specify. We propose to extend these methods to handle the situations when the covariate effect is non-linear or is not easy to be modeled parametrically. This is similar to generalized additive models, where a smooth curve is used to predict the impact of a covariate on a univariate outcome. The goal of this study is to develop statistical software for correlated data in two areas. The first is the spline smoothing methods for generalized additive mixed models, which combine the semiparametric methods in generalized additive models using smoothing methods and mixed-effect modeling for correlated data. The second is the semiparametric GEE methods, which extend the GEE methods for correlated data with kernel smoothing to model the non-linear impact on health outcome. The research includes statistical methods, algorithm development and application to real health problems. The study requires analytic development on innovative semiparametric statistical methods and algorithm development on computational intensive methods. Currently, there is no software for these areas. The aim is to overcome this deficiency and extend the benefits of using smoothing methods to model non-linear covariate effect. The result is a software package, SmoothEffect, for handling correlated data. A comprehensive case study guidebook using problems from longitudinal studies and others will come with the software. Technical reports and simulation studies will also be developed. This study is to develop flexible statistical smoothing methods and software for analyzing correlated data or clustered data such as longitudinal data, panel surveys or spatial data. The focus of interest is to analyze such clustered data where records from the same experimental unit are related and the impact from some predictor on health outcome shows a non-linear smoothing curvature, which is no easy to be parameterized

Phase II

Contract Number: 4R44GM077755-02
Start Date: 4/15/06    Completed: 1/31/09
Phase II year
2007
(last award dollars: 2008)
Phase II Amount
$709,677

Correlated data are very common in health studies. Such data could come from longitudinal studies, community panel surveys, genetic family studies or spatial studies. Typically, linear mixed-effect models are used for modeling continuous response, and generalized linear mixed models are applied to non-Gaussian data. In addition to such likelihood approaches, quasi-likelihood methods based on generalized estimating equations GEE are often used when the distributional assumption is not realistic and not easy to specify. We propose to extend these methods to handle the situations when the covariate effect is non-linear or is not easy to be modeled parametrically. This is similar to generalized additive models, where a smooth curve is used to predict the impact of a covariate on a univariate outcome. The goal of this study is to develop statistical software for correlated data in two areas. The first is the spline smoothing methods for generalized additive mixed models, which combine the semiparametric methods in generalized additive models using smoothing methods and mixed-effect modeling for correlated data. The second is the semiparametric GEE methods, which extend the GEE methods for correlated data with kernel smoothing to model the non-linear impact on health outcome. The research includes statistical methods, algorithm development and application to real health problems. The study requires analytic development on innovative semiparametric statistical methods and algorithm development on computational intensive methods. Currently, there is no software for these areas. The aim is to overcome this deficiency and extend the benefits of using smoothing methods to model non-linear covariate effect. The result is a software package, SmoothEffect, for handling correlated data. A comprehensive case study guidebook using problems from longitudinal studies and others will come with the software. Technical reports and simulation studies will also be developed. This study is to develop flexible statistical smoothing methods and softwarefor analyzing correlated data or clustered data such as longitudinal data, panel surveys or spatial data. The focus of interest is to analyze such clustered data where records from the same experimental unit are related and the impact from some predictor on health outcome shows a non-linear smoothing curvature, which is no easy to be parameterized.

Project Terms:
Accounting; Algorithms; Area; Awareness; base; Case Study; Communities; Computer software; Data; Data Analyses; Dependence; Development; discrete data; Documentation; Equation; Family Study; Female; Fertility; Genetic; Goals; guidebooks; Hand; Health; Heterogeneity; innovation; interest; Libraries; Longitudinal Studies; Menstrual cycle; Methodology; Methods; Modeling; Non-linear Models; Outcome; Pattern; Principal Investigator; Progesterone; public health medicine (field); Records; reproductive hormone; Research; Research Personnel; response; simulation; Statistical Methods; Surveys; technical report; Time; To specify; Urine; Variant