Massive numbers of video clips are generated daily on many types of consumer electronics and uploaded to the internet. In contrast to videos that are produced for broadcast or from planned surveillance, the "unconstrained" video clips produced by anyone who has a digital camera present a significant challenge for manual as well as automated analysis.
The Aladdin Video Program seeks to combine the state-of-the-art in video extraction, audio extraction, knowledge representation, and search technologies in a revolutionary way to create a fast, accurate, robust, and extensible technology that supports the multimedia analytic needs of the future.
Performers (Prime Contractors)
Carnegie Mellon University; Charles Stark Draper Laboratory, Inc.; IBM - T.J. Watson Research Center; Kitware, Inc.; Raytheon BBN Technologies; Sarnoff Corporation ; SRI International
- Computer vision
- Natural language processing
- Image processing
- Big data
- Video analytics
- Machine learning
- Speech processing
To access Aladdin Video program-related publications, please enter the following into a Google Scholar search query: "D11PC20065 OR D11PC20066 OR D11PC20067 OR D11PC20068 OR D11PC20069 OR D11PC20070 OR D11PC20071"