Defense Advanced Research Projects AgencyTagged Content List

Data Analysis at Massive Scales

Extracting information and insights from massive datasets; "big data"; "data mining"

Showing 5 results for Data + Imagery RSS
10/11/2017
The rapid pace of new commercial satellite constellation launches has led to a significant increase in the amount and availability of geospatial imagery. Unfortunately, no straightforward way currently exists for analysts to access and analyze all of that imagery. The current ad hoc, time-intensive approach requires gathering and curating data from a large number of available sources, downloading it to specific locations, and running it through separate suites of analytics tools.
The United States Government has an interest in developing and maintaining a strategic understanding of events, situations, and trends around the world, in a variety of domains. The information used in developing this understanding comes from many disparate sources, in a variety of genres, and data types, and as a mixture of structured and unstructured data. Unstructured data can include text or speech in English and a variety of other languages, as well as images, videos, and other sensor information.
The Geospatial Cloud Analytics (GCA) program is developing technology to rapidly access the most up-to-date commercial and open-source satellite imagery, as well as automated machine learning tools to analyze this data. Current approaches to geospatial analysis are ad hoc and time intensive, as they require gathering and curating data from a large number of available sources, downloading the data to specific locations, and running it through separate suites of analytics tools.
The U.S. Government operates globally and frequently encounters so-called “low-resource” languages for which no automated human language technology capability exists. Historically, development of technology for automated exploitation of foreign language materials has required protracted effort and a large data investment. Current methods can require multiple years and tens of millions of dollars per language—mostly to construct translated or transcribed corpora.
Adversaries often take photos and videos to claim responsibility for events or to illustrate capabilities. This media is sometimes confiscated by the DoD from a variety of devices, including laptops, cellphone cameras and memory cards. The volume of this visual media is quickly outpacing our ability to review, let alone analyze the contents of every image.
| Data | Imagery | ISR |