SBIR-STTR Award

Synthesis of HRS-SSA linked data
Award last edited on: 9/25/07

Sponsored Program
STTR
Awarding Agency
NIH : NIA
Total Award Amount
$99,986
Award Phase
1
Solicitation Topic Code
-----

Principal Investigator
Lars Vilhuber

Company Information

Aces-Research LLC

38 Beckett Way
Ithaca, NY 14850
   (607) 257-4673
   john@aces-research.com
   www.aces-research.com

Research Institution

Cornell University

Phase I

Contract Number: 1R41AG029756-01
Start Date: 00/00/00    Completed: 00/00/00
Phase I year
2007
Phase I Amount
$99,986
The Health and Retirement Study is one of the world's most important data resources for the study of aging. The basic longitudinal survey instrument has been supplemented with data from a variety of other sources including Social Security Administration records containing the detailed earnings history of the respondent. Under current HRS protocols, the use of the SSA data is restricted. Investigators must make special security arrangements to obtain these ?les, which they can not redistribute once they complete their analyses. These protocols are necessary to preserve the confidentiality of the underlying HRS and SSA micro data, which is essential to the continued willingness of respondents to participate in the study, but they severely limit the usefulness of the SSA data. New statistical disclosure limitation methods have been developed that promise to provide much of the information in confidential micro data in a manner that permits much wider dissemination and use of the protected data. This project is a Phase I feasibility study of applying these new methods, called synthetic data, to a subset of the variables in the SSA records that link to the general-use RAND-HRS data. The project has three main components: (1) port a general data synthesizer that was developed at the U.S. Census Bureau for use with SSA data linked to the Survey of Income and Program Participation for adaptation to the HRS/SSA link; (2) synthesize a few variables from the HRS/SSA link and test their usefulness in statistical modeling; (3) perform studies of the statistical disclosure risk associated with linking synthetic SSA data to the RAND-HRS general-release ?le. If the confidentiality-protected data prove scientifically useful and if the statistical disclosure risk can be controlled, then Phase II of the research would synthesize the entire HRS/SSA data link. The project scientists will work with the HRS Data Release Protocol Committee and SSA to develop appropriate certifications of the statistical disclosure avoidance provided by the methods. 1 7 Project Narrative Many users of the Health and Retirement Study general-release data ?les would benefit from some access to the restricted-release ?les without the requirement for special security arrangements. Critical variables on the restricted- release data from the Social Security Administration earnings histories will be confidentiality-protected using powerful new methods that preserve privacy while allowing many important statistical analyses to be performed combining the protected SSA and general-release HRS data. If the demonstration is successful, the methods will be extended to the entire HRS/SSA linked data in a future project.

Phase II

Contract Number: ----------
Start Date: 00/00/00    Completed: 00/00/00
Phase II year
----
Phase II Amount
----