SBIR-STTR Award

AI-DLCS: Artificial Intelligence for Data Labeling and Curation at Scale
Award last edited on: 10/8/2024

Sponsored Program
SBIR
Awarding Agency
DHS
Total Award Amount
$171,433
Award Phase
1
Solicitation Topic Code
DHS241-002
Principal Investigator
Amit Juneja

Company Information

Agile Data Decisions LLC (AKA: Agile Data Decisions Inc)

1407 Pinewood Ct
Pearland, TX 77581
   (617) 792-5347
   N/A
   www.agiledd.com
Location: Single
Congr. District: 22
County: Brazoria

Phase I

Contract Number: 70RSAT24C00000033
Start Date: 5/7/2024    Completed: 10/6/2024
Phase I year
2024
Phase I Amount
$171,433
The Department of Homeland Security (DHS) grapples with vast and diverse datasets collected daily, ranging from personal property scans to Stream of Commerce (SoC) data. To analyze and improve algorithms for detecting explosives and prohibited items, efficient curation and labeling are essential. However, DHS faces challenges, including data processing inefficiencies, dependency on human labeling, limited scalability, predictive analytics and threat detection obstacles, and inter-agency collaboration barriers. In response, Agile Data Decisions, Inc. (AgileDD) proposes an innovative solution called AI for Data Labeling and Curation at Scale (AI-DLCS). Leveraging their iQC human-in-the-loop AI platform and the CargoSeer AI platform, the project aims to address DHS's challenges. CargoSeer AI, developed by CargoSeer LTD, specializes in consignment inspection, utilizing a Large Foundation Model to automatically inspect scanned cargo for fraud. AgileDD plans to enhance these platforms with new algorithms for (1) labeling at scale from a known single image with few-shot learning, and (2) multi-class/multi-label image classification and object detection with weakly supervised learning. The technical objectives of the proposed Phase I research include developing a data ingestion and pre-processing pipeline for diverse image and document formats, establishing standardized metrics for auto-labeling, implementing large-scale auto-labeling with few-shot learning, conducting multi-label and multi-class auto-labeling on a large dataset, and demonstrating a proof-of-concept workflow on the ImageNet dataset. The goal is to enhance efficiency, reduce human dependency, and improve scalability for DHS in handling complex and extensive datasets crucial for security and defense decision-making. The proposed solution showcases promise in revolutionizing data handling processes for security applications.

Phase II

Contract Number: ----------
Start Date: 00/00/00    Completed: 00/00/00
Phase II year
----
Phase II Amount
----