Knowledge Discovery and Dissemination (KDD)

Program Manager

For information contact:

Program Information

IARPA Day Poster

The objective of KDD is to enable analysts to quickly produce actionable intelligence from multiple, disparate data sources, including new, unanticipated data sets that become available to analysts. To meet this objective, KDD has developed multiple solutions to the challenges of large-scale data alignment efforts and advanced analytics capabilities.

The KDD program has three thrusts:

  • Research in data alignment to quickly align the terminology and organization of new data sources to the analytic data model.
  • Research in analytics to develop flexible algorithms that work across heterogeneous data sets.
  • Engineer a prototype that contains the research products so that they can be assessed in a realistic IC environment.

The KDD research development approach is designed to ensure that efforts are relevant to the Intelligence Community (IC) and perform well on real data. Each year KDD teams deliver their research as software algorithms integrated into prototype systems. The prototypes are tested by IC analysts using new real data sets and realistic analytic challenge problems not previously seen by the teams. This ensures that the technology is quickly adaptable to new data and new problems. To meet flexibility objectives, research components are loosely coupled in the prototype systems to provide a modular approach for evaluating individual technologies.

The KDD program began in October 2010 with research teams composed of academic and commercial organizations. A base year evaluation in 2011 was used to exercise the prototype systems, rehearse the evaluation process and give researchers insight into IC analysis. Formal evaluations were conducted in 2012 and 2013 with results showing significant progress. The evaluation is designed to mature development of analytic tools and make them suitable for transition to operational environments.

In 2014, the program is focused on transition of technologies to IC partners and evaluation against partner data. KDD will work with transition partners to identify KDD candidate research components to fill gaps in partner analytic capabilities.

Performers (Prime Contractors)

BAE Systems Advanced Information Technologies; CUBRC; Leidos, Inc.; SRI International; Telcordia Technologies, Inc.

Research Area(s)

  • Ontologies
  • Alignment
  • Information extraction
  • Clustering
  • Natural language processing
  • Social network analysis
  • Summarization
  • Query expansion
  • Document similarity
  • Machine learning

Related Publications

To access KDD program-related publications, please visit Google Scholar.

Related Article(s)