SBIR-STTR Award

Advancing Hyperspectral Data Cube Query Capabilities through Apache Spark DataFrame Abstraction
Award last edited on: 3/27/2023

Sponsored Program
SBIR
Awarding Agency
NASA : JPL
Total Award Amount
$874,999
Award Phase
2
Solicitation Topic Code
S5.02
Principal Investigator
Rob Emanuele

Company Information

Azavea Inc (AKA: Avencia Incorporated)

990 Spring Garden Street 5th floor
Philadelphia, PA 19123
   (215) 925-2600
   info@azavea.com
   www.azavea.com
Location: Single
Congr. District: 02
County: Philadelphia

Phase I

Contract Number: 80NSSC19C0523
Start Date: 8/19/2019    Completed: 2/18/2020
Phase I year
2019
Phase I Amount
$125,000
The goal of this research is to accelerate the use of NASA’s hyperspectral data by combining powerful open source tools into an easily understood workflow that will distribute the computation of hyperspectral data across clusters of machines hosted in public cloud infrastructure, while also leveraging low cost object storage in order to maximize accessibility and use. Hyperspectral data collects and processes information across the electromagnetic spectrum, dividing the spectrum into many more bands than are visible to the human eye. The collected images are combined to form a three-dimensional data cube, where two spatial dimensions of the same scene are joined by a third dimension comprised of a range of spectral wavelengths. These data cubes are capable of supporting many surface biology and geology applications, with particular potential for improving the discovery and management of energy, mineral, and soil resources. That said, the ability to efficiently process hyperspectral data is currently limited by the complexity of the data itself, and the inability to present and store it in formats and public cloud environments useful to data scientists. In addition to building standalone tools that can be used independently, this project will enable efficient consumption, reformatting, and processing of hyperspectral datasets in the NASA-funded Raster Foundry platform, an open source solution for finding, analyzing, and publishing geospatial imagery on the web. In so doing, the proposed research will build on previous agency investments that are making additional remotely sensed imagery from NASA and other public resources more broadly accessible for global application to contemporary geospatial challenges. Furthermore, it will provide access to a rich ecosystem of image processing and scientific computing libraries that will support data science studies in industries ranging from precision agriculture to security and defense. Potential NASA Applications (Limit 1500 characters, approximately 150 words) The proposed research provides potential benefit for the Goddard Space Flight Center and Jet Propulsion Lab, where hyperspectral data tools could support the following missions: Hyperion, Hyperspectral Infrared Imager (HyspIRI), Airborne Visible/Infrared Imaging Spectrometer (AVIRIS), and the planned Mapping Imaging Spectrometer for Europa (MISE). Through its use of GeoTrellis, Raster Foundry, and other open source tools, the work also addresses NASA’s need for robust software solutions supported by open source communities. Potential Non-NASA Applications (Limit 1500 characters, approximately 150 words) Three primary markets have already demonstrated significant interest in hyperspectral data analysis tools: large prime contractors, international nonprofit and multilateral development organizations, and the oil and natural gas industry. Potential use cases include change and anomaly detection, pipeline safety and maintenance, humanitarian relief, and disaster recovery.

Phase II

Contract Number: 80NSSC20C0183
Start Date: 7/6/2020    Completed: 7/5/2022
Phase II year
2020
Phase II Amount
$749,999
The goal of this research is to accelerate the use of NASA’s hyperspectral data by combining powerful open source tools into an easily understood workflow that will distribute the computation of hyperspectral data across clusters of machines hosted in public cloud infrastructure, while also leveraging low cost object storage in order to maximize accessibility and use. Hyperspectral data collects and processes information across the electromagnetic spectrum, dividing the spectrum into many more bands than are visible to the human eye. The collected images are combined to form a three-dimensional data cube, where two spatial dimensions of the same scene are joined by a third dimension comprised of a range of spectral wavelengths. These data cubes are capable of supporting many surface biology and geology applications, with particular potential for improving the discovery and management of energy, mineral, and soil resources. That said, the ability to efficiently process hyperspectral data is currently limited by the complexity of the data itself, and the inability to present and store it in formats and public cloud environments useful to data scientists. In addition to building standalone tools that can be used independently, this project will enable efficient consumption, reformatting, and processing of hyperspectral datasets in the NASA-funded Raster Foundry platform, an open source solution for finding, analyzing, and publishing geospatial imagery on the web. In so doing, the proposed research will build on previous agency investments that are making additional remotely sensed imagery from NASA and other public resources more broadly accessible for global application to contemporary geospatial challenges. Furthermore, it will provide access to a rich ecosystem of image processing and scientific computing libraries that will support data science studies in industries ranging from precision agriculture to security and defense. Potential NASA Applications (Limit 1500 characters, approximately 150 words) The proposed research provides potential benefit for the Goddard Space Flight Center and Jet Propulsion Lab, where hyperspectral data tools could support the following missions: the planned Hyperspectral Infrared Imager (HyspIRI), Airborne Visible/Infrared Imaging Spectrometer (AVIRIS), and the planned Mapping Imaging Spectrometer for Europa (MISE). Through its use of Raster Vision, Franklin, Raster Foundry, and other open source tools, the work also addresses NASA’s need for robust software solutions supported by open source communities. Potential Non-NASA Applications (Limit 1500 characters, approximately 150 words) Several primary markets have already demonstrated significant interest: commercial satellite and aerial imagery data users/customers, large prime contractors, the oil and natural gas industry, insurance industry, and international multilateral development organizations. Potential use cases include oil spill detection, pipeline safety/maintenance, tree mortality analysis, and fire risk modeling.