SBIR-STTR Award

Superior Speech Acquisition and Robust Automatic Speech Recognition for Integrated Spacesuit Audio Systems
Award last edited on: 4/18/2019

Sponsored Program
SBIR
Awarding Agency
NASA : GRC
Total Award Amount
$699,937
Award Phase
2
Solicitation Topic Code
O3.02
Principal Investigator
Yiteng (Arden) Huang

Company Information

WeVoice Inc

1065 Route 22 West Suite 2e
Bridgewater, NJ 08807
   (908) 393-6101
   info@wevoiceusa.com
   www.wevoiceusa.com
Location: Single
Congr. District: 07
County: Somerset

Phase I

Contract Number: ----------
Start Date: ----    Completed: ----
Phase I year
2009
Phase I Amount
$99,940
Astronauts suffer from poor dexterity of their hands due to the clumsy spacesuit gloves during Extravehicular Activity (EVA) operations and NASA has had a widely recognized but unmet need for novel human machine interface technologies to facilitate data entry, communications, and robots or intelligent systems control. In this proposed Phase I research, WeVoice, Inc., plans to design, to begin the implementation of, and to evaluate a speech human interface system. Loud noise and strong reverberation inside spacesuits make automatic speech recognition (ASR) for such an interface a very challenging problem. WeVoice proprietary microphone array signal processing algorithms for speech acquisition will be taken advantage of. Pros and cons of beamforming vs. multichannel noise reduction for ASR will be assessed and recommendations for the best front-end technique will be established. Using two ASR programs (one based on HTK and the other in C/C++) that were previously developed at WeVoice, Inc., a number of robust methods (ranging from feature transformation and normalization to environmental adaptation) will be validated. In addition, the feasibility of using throat vibration microphones will be explored. The Phase I research is also concerned with the compromise of ASR accuracy and system complexity. A comparative study will be undertaken between two system implementation structures, namely wearable and distributed systems. This effort will form a foundation for prototype design to be conducted in Phase II.

Phase II

Contract Number: ----------
Start Date: ----    Completed: ----
Phase II year
2010
Phase II Amount
$599,997
Astronauts suffer from poor dexterity of their hands due to the clumsy spacesuit gloves during Extravehicular Activity (EVA) operations and NASA has had a widely recognized but unmet need for novel human machine interface technologies to facilitate data entry, communications, and robots or intelligent systems control. The objective of this research project is to develop a speech human interface that can offer both crewmember usability and system operational efficiency. But loud noise and strong reverberation inside spacesuits make automatic speech recognition (ASR) for such an interface a very challenging problem. In Phase I, the feasibility of using WeVoice proprietary microphone array signal processing and robust ASR technologies was validated. In particular, it was found that novel multichannel noise reduction produces larger gain in SNR than conventional beamforming but the latter is more preferable as far as ASR is concerned. In addition, it was confirmed that the model adaptation algorithm can make an ASR system more robust inside spacesuits. An arithmetic complexity model for ASR was developed. It can direct the decision as to whether a specified speech interface is sufficiently efficient to be possibly implemented with a wearable system. Phase II will analyze and minimize the scientific and engineering uncertainties identified during Phase I. Furthermore, a voice command interface for future generations of a suit's processing system is proposed to be developed on DSP chips. The system should be ready for testing and use by NASA suited crewmembers at the end of Phase II.