Defense Advanced Research Projects AgencyTagged Content List

Analytics for Data at Massive Scales

Extracting information from large data sets

Showing 9 results for Analytics + Language RSS
09/19/2013
Bonnie Dorr (left), program manager in DARPA’s Information Innovation Office (I2O), shakes hands with Henry Kautz, past president of the Association for the Advancement of Artificial Intelligence (AAAI), upon her recent induction as an AAAI Fellow. Each year, AAAI bestows the lifetime honor of Fellow on only a handful of researchers for their exceptional leadership, research and service contributions to the field of artificial intelligence.
04/06/2017
The U.S. government has always had an interest in developing and maintaining a strategic understanding of events, situations, and trends around the world. In recent years, however, information complexity has exceeded the capacity of analysts to glean meaningful or actionable insights as data pours in from disparate sources, across a variety of genres, and a mixture of structured and unstructured forms, from military intelligence to social media to accurate and inaccurate news.
The United States Government has an interest in developing and maintaining a strategic understanding of events, situations, and trends around the world, in a variety of domains. The information used in developing this understanding comes from many disparate sources, in a variety of genres, and data types, and as a mixture of structured and unstructured data. Unstructured data can include text or speech in English and a variety of other languages, as well as images, videos, and other sensor information.
Expanded global access to diverse means of communication is resulting in more information being produced in more languages more quickly than ever before. The volume of information encountered by DoD, the speed at which it arrives, and the diversity of languages and media through which it is communicated make identifying and acting on relevant information a serious challenge. At the same time, there is a need to communicate with non-English-speaking local populations of foreign countries, but it is at present costly and difficult for DoD to do so.
Department of Defense (DoD) operators and analysts collect and process copious amounts of data from a wide range of sources to create and assess plans and execute missions. However, depending on context, much of the information that could support DoD missions may be implicit rather than explicitly expressed. Having the capability to automatically extract operationally relevant information that is only referenced indirectly would greatly assist analysts in efficiently processing data.