NSF 2005 Tools for Information Retrieval and Document Classification Using Fast Phonetic Word-Spotting Technology

Tools for Information Retrieval and Document Classification Using Fast Phonetic Word-Spotting Technology
Award last edited on: 3/25/2024

Awarding Agency

NSF

Total Award Amount

$99,999

Award Phase

Solicitation Topic Code

-----

Principal Investigator

Robert Morris

Nexidia Inc (AKA: Fast-Talk Communications Inc)

3565 Piedmont Road Building 2 Suite 400
Atlanta, GA 30305

(404) 495-7220

barnold@nexidia.com

www.nexidia.com

Location: Multiple
Congr. District: 05
County: Fulton

Phase I

Contract Number: 2005
Start Date: ---- Completed: 1/1/2005

Phase I year

2005

Phase I Amount

$99,999

This Small Business Innovation Research Phase I research project will perform the research and development necessary to greatly enhance the information retrieval capability of a fast phonetic word-spotter. The completed research will lead to new methods for spoken document retrieval and classification on low quality telephony audio or multimedia digital sources. Spoken document retrieval has been a well-researched problem in the domain of broadcast news. However, many applications exist where users must retrieve and classify documents with lower quality audio. The most commonly applied method involves converting an audio stream or file into a hypothesized sequence of words (Speech-to-Text or STT), and subsequently using text- based information retrieval. Although this has been shown to be effective for broadcast news document retrieval, this has drawbacks. For example, STT's explicit use of language models limits the hypothesized word sequences to those within its lexicon. On the other hand, phonetic matching is capable of identifying likely instances of keywords, such as names, which are not in a lexicon. One advantage of the STT approach is the applicability of text-based information retrieval methods, which work well on high quality audio where the error rates are fairly small. However, better solutions are necessary over a high volume telephony channel where the computational burden and low accuracy make STT impractical. The goal of the proposed project is to research and develop phonetic-based document retrieval and classification algorithms. The applicability of retrieval systems based on phonetic searches will be compared on large existing corpora. The key innovation of the proposed research is to adapt search techniques to function in environments where audio exists, but text does not. Scientifically, algorithms must be made to work in a probabilistic framework, since phonetic word spotting is always based on confidence measures. Commercially, existing multimedia or audio archives will be available for data mining. In addition, decisions of document type (e.g., was the phone call to the call center a complaint?) open commercial applications in market intelligence, security analysis, quality analysis, and any call segregation application.

Phase II

Contract Number: 0441492
Start Date: 6/30/2005 Completed: 00/00/00

Phase II year

----

Phase II Amount

----

SBIR-STTR Award

Tools for Information Retrieval and Document Classification Using Fast Phonetic Word-Spotting Technology
Award last edited on: 3/25/2024

Sponsored Program

Awarding Agency

Total Award Amount

Award Phase

Solicitation Topic Code

Principal Investigator

Company Information

Nexidia Inc (AKA: Fast-Talk Communications Inc)

Phase I

Phase I year

Phase I Amount

Phase II

Phase II year

Phase II Amount

New To Inknowvation.com?

SBIR-STTR Award

Tools for Information Retrieval and Document Classification Using Fast Phonetic Word-Spotting TechnologyAward last edited on: 3/25/2024

Sponsored Program

Awarding Agency

Total Award Amount

Award Phase

Solicitation Topic Code

Principal Investigator

Company Information

Nexidia Inc (AKA: Fast-Talk Communications Inc)

Phase I

Phase I year

Phase I Amount

Phase II

Phase II year

Phase II Amount

Tools for Information Retrieval and Document Classification Using Fast Phonetic Word-Spotting Technology
Award last edited on: 3/25/2024