SBIR-STTR Award

Large Data Handling Architecture, Designing Large Data Handling Architectures
Award last edited on: 6/5/2023

Sponsored Program
SBIR
Awarding Agency
DOD : OSD
Total Award Amount
$3,091,311
Award Phase
2
Solicitation Topic Code
OSD09-SP4
Principal Investigator
Marc Abrams

Company Information

Harmonia Holdings Group LLC (AKA: Harmonia Inc)

2020 Kraft Drive Suite 1000
Blacksburg, VA 24060
   (540) 951-5900
   information@harmonia.com
   www.harmonia.com
Location: Multiple
Congr. District: 09
County: Montgomery

Phase I

Contract Number: N00014-10-M-0080
Start Date: 1/5/2010    Completed: 7/20/2010
Phase I year
2010
Phase I Amount
$95,112
We focus on a solution for ultra-scale data handling, into the terabyte range and beyond. Our solution leverages the latest advances in storage technology, and can be used in a cloud, to exploit cloud-native features. We address the following functional requirements

Keywords:
Service Oriented Architecture, Ultra-Scale Data, Sensor Networks, Net-Centric, Gig, Data Sharing, Meta-Data, Ontology

Phase II

Contract Number: N00014-11-C-0523
Start Date: 9/13/2011    Completed: 9/13/2013
Phase II year
2011
(last award dollars: 2021)
Phase II Amount
$2,996,199

Harmonia proposes to continue development from Phase I of a Large Data Handling Architecture (LDHA). Given sufficient amounts of commodity hardware, its goal is to scale up to ingest a terabyte or more of data per hour from each open source or sensor; store tens of thousands and more of terabyte files; and support operations on databases that use complex structures as table cells, sparse tables, and billions or more rows and a million or more columns. Our implementation uses the Ubuntu server Linux distribution, Hadoop core, HBase database and Chukwa data collector. We implement ingesters as Java Servlets to collect continuously and automatically pass for insertion into Hadoop a variety of open sources including, but not limited to structured and unstructured text, sensor data, images, and streaming video, experimenting with real continuous feeds available on the Internet. We develop end user programming tools that empower analysts to define analysis tasks for distributed execution even though they do not have a programming background in MapReduce. Our architecture is service oriented, high performance, extensible, and permits integration with the DoD Global Information Grid. We support disadvantaged users through services that allow downloading of data. We do extensive performance assessment of bottlenecks.

Keywords:
Large-Scale Data, Sensor Networks, Net-Centric, Gig, Data Sharing, Meta-Data, Ontology, Hadoop