SBIR-STTR Award

Labeling of Text to Train Machine Learning Models to Improve the E in the USAFs Processing, Exploitation, and Dissemination Systems
Award last edited on: 1/14/2020

Sponsored Program
SBIR
Awarding Agency
DOD : AF
Total Award Amount
$50,000
Award Phase
1
Solicitation Topic Code
AF183-002
Principal Investigator
Shannon Hynds

Company Information

Thresher Ventures LLC

841 Elm Street Suite 333
Mclean, VA 22101
   (703) 623-5590
   info@thresher.io
   www.thresher.io
Location: Single
Congr. District: 08
County: Fairfax

Phase I

Contract Number: 140D04-19-C-0029
Start Date: 2/1/2019    Completed: 2/1/2020
Phase I year
2019
Phase I Amount
$50,000
The USAF relies on automated text processing systems powered by machine learning to support its Processing, Exploitation, Dissemination mission. The effectiveness of these systems are dependent on domain specific, relevant, and current labeled training data. Yet often, due to to a variety of human, data, and machine challenges, training data sets do not keep pace with the semantic evolution and expansion that is inevitable in a dynamic PED environment. Thresher proposes a feasibility study in conjunction with the 480th ISR Wing to examine the extent to which QuickCode--commercially available software for quickly and transparently labeling text for training data--can help analysts get better results from the machine learning systems they use to process text. The feasibility study will use publicly available data in at least two languages to demonstrate the speed, transparency and accuracy. In addition, the study will use structured approaches for mapping user needs with a particular focus on requirements for integrating QuickCode into an analysts workflow. QuickCode makes use of Threshers patent-pending multilingual label recommendation method built on an open source, machine learning stack, originally developed at Harvard to help researchers find code words in Chinese social media and augmented with funding from DARPA. automated text analysis, Natural Language Processing, artificial intelligence, machine learning, Human Computer Interaction, training data, Social Science, PED

Phase II

Contract Number: ----------
Start Date: 00/00/00    Completed: 00/00/00
Phase II year
----
Phase II Amount
----