Registries offer an important way to collect uniform data with a view toward understanding the etiology of diseases, and to improving their treatment. A major constraint in establishing registries is the cost of data acquisition, especially when not available from existing sources. The CDC SB EMR is a tool for collecting data on SB patients when they first visit a clinic and in each subsequent year. An ever-increasing number of hospitals and clinics are installing an electronic medical record (EMR) system. Non-integration of these separate systems poses a significant challenge for SB clinics: the requirement of double-enter data. One solution to this problem is to develop software that can extract data from EMR and legacy medical record software, and transport it to the CDC SB EMR for transmission to CDC for analysis. Such extraction software may facilitate additional SB clinics participation in the NBSPR. Further, given our approach of first developing a framework for extraction software, the same technology can be applied to build other registries. The existence of a generic EMR/legacy system extraction tool would permit improving the efficiency and effectiveness of other public health reporting systems, such as those of state and local health departments, by reducing the need for data entry and improving the accuracy of reported data.