Paper
15 May 2015 A scalable architecture for extracting, aligning, linking, and visualizing multi-Int data
Craig A. Knoblock, Pedro Szekely
Author Affiliations +
Abstract
An analyst today has a tremendous amount of data available, but each of the various data sources typically exists in their own silos, so an analyst has limited ability to see an integrated view of the data and has little or no access to contextual information that could help in understanding the data. We have developed the Domain-Insight Graph (DIG) system, an innovative architecture for extracting, aligning, linking, and visualizing massive amounts of domain-specific content from unstructured sources. Under the DARPA Memex program we have already successfully applied this architecture to multiple application domains, including the enormous international problem of human trafficking, where we extracted, aligned and linked data from 50 million online Web pages. DIG builds on our Karma data integration toolkit, which makes it easy to rapidly integrate structured data from a variety of sources, including databases, spreadsheets, XML, JSON, and Web services. The ability to integrate Web services allows Karma to pull in live data from the various social media sites, such as Twitter, Instagram, and OpenStreetMaps. DIG then indexes the integrated data and provides an easy to use interface for query, visualization, and analysis.
© (2015) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Craig A. Knoblock and Pedro Szekely "A scalable architecture for extracting, aligning, linking, and visualizing multi-Int data", Proc. SPIE 9499, Next-Generation Analyst III, 949907 (15 May 2015); https://doi.org/10.1117/12.2177119
Lens.org Logo
CITATIONS
Cited by 4 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data integration

Visualization

Data modeling

Databases

Analytical research

Human-machine interfaces

Associative arrays

RELATED CONTENT


Back to Top