SBIR-STTR Award

Accelerated in-storage analysis of multi-dimensional data
Award last edited on: 3/9/2024

Sponsored Program
SBIR
Awarding Agency
DOC : NOAA
Total Award Amount
$800,000
Award Phase
2
Solicitation Topic Code
9.1
Principal Investigator
Donpaul Stephens

Company Information

AirMettle Inc

2700 Post Oak Boulevard 21st Floor
Houston, TX 77056
   (646) 872-2124
   info@airmettle.com
   www.airmettle.com
Location: Single
Congr. District: 07
County: Harris

Phase I

Contract Number: NA22OAR0210591
Start Date: 9/1/2022    Completed: 12/31/2022
Phase I year
2022
Phase I Amount
$150,000
The massive volumes of multi-dimensional array-oriented data generated by NOAA programs and the scientific community at large are predominantly stored in industry standard Network Common Data Form (NetCDF). Key challenges exist in making use of data stored in netCDF: data sets are often too large to be copied and transferred across networks for every user, and each time data is accessed by an analytics tool it must be retrieved, subsets extracted, and subsequently formatted, among other requirements, which can account for 80-90% of the total time needed to insight. To unlock the enormous potential of petabyte scale netCDF-formatted data stored at different locations, in this SBIR Phase I project, AirMettle, Inc. with its research partners from the University of Wisconsin-Madison proposes to explore the feasibility of integrating in-situ analysis capabilities for multi-dimensional data (netCDF) into our highly innovative real-time smart data lake solution. Dramatically accelerated data analytics performed at the storage layer addresses key challenges noted within the Climate Adaption and Mitigation Topic. Reducing data traffic between sites, shrinking required compute resources, and lowering costs – all while accelerating climate analyses by an order of magnitude – would bring great benefit to NOAA and the broader scientific community.

Phase II

Contract Number: NA23OAR0210342
Start Date: 8/1/2024    Completed: 7/31/2025
Phase II year
2023
Phase II Amount
$650,000
AirMettle Inc. is transforming big data analytics for NOAA and the broader scientific community with a real-time smart data lake solution. Our innovative method utilizes massively parallel in-storage data processing within a versatile software defined storage framework, deployable on-premises or as a cloudbased service. Building upon our successful NOAA SBIR Phase I project, we strive to improve the handling of large, multi-dimensional NetCDF4 datasets vital to climate and weather forecasting. This project incorporates on-demand rescaling, allowing users to directly load only the required data at their desired resolution from the storage service. We will validate the benefits for climatologists and enhance the solution's commercial robustness. Our goal is to accelerate basic operations by 100x and reduce the data retrieved from storage by over 10x for typical requests. The potential commercial applications span meteorology and climatology across various sectors, making it easier and faster for experts to access and analyze crucial climate and weather data. AirMettle's cutting-edge data lake solution is poised to revolutionize big data analytics and garner widespread interest from both public and private stakeholders.