SBIR-STTR Award

Robust Medical Data Aggregation to Enable Advanced Approaches to Precision Medicine
Award last edited on: 3/31/2022

Sponsored Program
SBIR
Awarding Agency
NSF
Total Award Amount
$1,816,568
Award Phase
2
Solicitation Topic Code
SH
Principal Investigator
Ganapati Srinivasa

Company Information

Omics Data Automation Inc (AKA: ODA)

12655 Beaverdam Road
Beaverton, OR 97005
   (503) 475-6660
   info@omicsautomation.com
   www.omicsautomation.com
Location: Single
Congr. District: 01
County: Washington

Phase I

Contract Number: 1721343
Start Date: 7/1/2017    Completed: 6/30/2018
Phase I year
2017
Phase I Amount
$224,903
The broader impact/commercial potential of this Small Business Innovation Research (SBIR) Phase I project is to enhance the impact of precision medicine by simultaneously addressing large-scale medical data aggregation and optimized computation that is cost-effective and to extend the utility of medical informatics well beyond current practice. Patient medical information comes in many diverse forms: genomic sequences, medical images, and clinical observations. The integration of these various data sources across patient populations have shown to reveal patterns and similarities among patients, which inform treatment options. With advances in imaging and genomic sequencing technologies, the sheer volume of available information is growing exponentially, straining current computational approaches, and creating an imminent need for scalable data integration. The ability to overcome this data mountain opens the door to support advanced analytics to support precision medicine and provide enhanced services to medical institutions. With these innovations, patients receive faster and more accurate diagnoses and treatments, clinicians deliver verified treatment decisions through patient cohort comparison, hospitals have better standard of care, and society is overall empowered by supporting global treatment options and well informed pharmaceutical development.The proposed project will develop a scalable aggregation and analysis framework to integrate various patient data modalities to inform personalized diagnosis and therapy in precision medicine. Currently, information from different modalities exists in silos, hindering joint analysis and insight. While there has been research trying to leverage machine learning techniques in medical imaging, these efforts have generally focused on a single domain and not been able to integrate facts from other domains. This project will aggregate features from genomics, imaging and clinical characterization of patients into scalable databases and then use a distributed, parallel framework to enable efficient analytics on the resultant joint representation. The resulting platform will enable identification of cohorts based on both genotypes and phenotypes and empower powerful machine learning analyses to inform clinical decision systems or identification of new personalized therapies.

Phase II

Contract Number: 1831085
Start Date: 8/15/2018    Completed: 4/30/2022
Phase II year
2018
(last award dollars: 2021)
Phase II Amount
$1,591,665

The broader impact/commercial potential of this Small Business Innovation Research (SBIR) Phase II project is to increase the impact of precision medicine by simultaneously addressing large-scale medical data aggregation and analytics. Patient medical information comes in many forms: DNA sequences, medical images, clinical observations, etc. Integration of these various data sources across large patient populations can reveal novel insights into clinical disease and inform new treatment options. There is a growing need for scalable data integration to support the delivery of precision medicine. This Small Business Innovation Research (SBIR) Phase II project will build on the data Framework that was created during Phase I work. This Framework supports the aggregation of different types of medical data into a single data core ?pool?, and computation on the pooled data in a scalable cloud environment. The Framework includes a custom database for genomic sequencing data. During Phase II of this project, this custom (sparse matrix) approach to data storage will be adapted to handle microscopy images of cancer and other diseased tissues. The Framework, with its integrated genomic and image databases, will be marketed to healthcare systems, biotechnology companies, and pharmaceutical companies interested in applying advanced analytics on large, complex datasets. This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.