Babel

Summary

The Babel Program is developing agile and robust speech recognition technology that can be rapidly applied to any human language in order to provide effective search capability for analysts to efficiently process massive amounts of real-world recorded speech. Today’s transcription systems are built on technology that was originally developed for English, with markedly lower performance on non-English languages. These systems have often taken years to develop and cover only a small subset of the languages of the world. Babel intends to demonstrate the ability to generate a speech transcription system for any new language within one week to support keyword search performance for effective triage of massive amounts of speech recorded in challenging real-world situations.

The goal of the Babel Program is to develop methods to build speech recognition technology for a much larger set of languages than has hitherto been addressed. The Program requires innovations in how to rapidly model a novel language with significantly less training data that are also much noisier and more heterogeneous than what has been used in the current state-of-the-art. Babel's technical measures of success are focused on how well the generated model works to support effective word-based search of noisy channel speech in the languages to be investigated. The new methods are being systematized so that they can be applied rapidly to a novel underserved language.

Related Publications

To access Babel program-related publications, please visit Google Scholar.

To access the OpenKWS website, click here.

Contact Information

Program Manager

Main Office

dni-iarpa-info@iarpa.gov

301-243-1995

Related Program(s)

MATERIAL

Solicitation Status

BROAD AGENCY ANNOUNCEMENT (BAA)
CLOSED

Link(s)

IARPA-BAA-11-02

Proposers' Day Date

January 20, 2011

Release Date

April 7, 2011

Question Period

April 7, 2011

Proposal Due Date

June 7, 2011

Prime Performers

Carnegie Mellon University
IBM - T.J. Watson Research Center
Raytheon BBN Technologies
University of California, Berkeley - International Computer Science Institute

Additional Information

IARPA-BAA-11-02 Q&A

Babel IARPA Day Poster