SBIR-STTR Award

De-Identification Software Tools for Cancer Imaging Research
Award last edited on: 8/11/2021

Sponsored Program
SBIR
Awarding Agency
NIH : NCI
Total Award Amount
$386,526
Award Phase
1
Solicitation Topic Code
411
Principal Investigator
Paul Bunting

Company Information

BioData Consortium LLC

1274 Pasadena Avenue NE
Atlanta, GA 30306
   (404) 314-9653
   N/A
   www.biodataconsortium.com
Location: Single
Congr. District: 05
County: Fulton

Phase I

Contract Number: 75N91020C00023
Start Date: 00/00/00    Completed: 00/00/00
Phase I year
2020
Phase I Amount
$386,526
Developing artificial intelligence technology for medical imaging applications requires training models on large and diverse datasets. Currently, aggregation of large data repositories, including radiology and pathology images, is limited by concerns around patient privacy. In order to successfully share medical images, an institution must be able to quickly and accurately de-identify large numbers of images in batches. This process is currently manual and time-consuming. We propose a pipeline to remove PHI from both radiology DICOM images and pathology whole slide images by leveraging machine learning, natural language processing, and compartmentalized workflow techniques to significantly reduce the human intervention needed to anonymize medical images. In addition to examining header data in the images, we will use optical character recognition and computer vision algorithms to detect text in any location or orientation in the image, then automatically record and subsequently purge these regions. These techniques will be configured to work on a variety of image types (CT, MRI, radiograph, etc) and cover multiple OEM vendors for both radiology and pathology images. This phase I statement of work will construct the software tools, methods, and datasets necessary to facilitate a phase II where the complex algorithms needed for autonomous deidentification will be developed. This phase II processing will be referred to throughout this document as the workflow.

Phase II

Contract Number: ----------
Start Date: 00/00/00    Completed: 00/00/00
Phase II year
----
Phase II Amount
----