SBIR-STTR Award

An integrated web interface for cloud-based computational reproducibility using the ENCODE analysis pipeline architecture
Award last edited on: 2/28/2024

Sponsored Program
STTR
Awarding Agency
NIH : NHGRI
Total Award Amount
$149,959
Award Phase
1
Solicitation Topic Code
172
Principal Investigator
Karl Sebby

Company Information

XD Bio Inc

210 Parkhill Drive
Whitefish, MT 59937
   (214) 748-3647
   info@xdbio.com
   www.xdbio.com

Research Institution

Stanford University

Phase I

Contract Number: 1R41HG010844-01
Start Date: 9/16/2019    Completed: 3/15/2021
Phase I year
2019
Phase I Amount
$149,959
This project will develop and test a publicly available web interface for executing and monitoring genomic data analysis pipelines described by the Workflow Description Language. The interface will be hosted on Truwl"‹TM"‹ (https://truwl.com), xD Bio's"‹TM"‹ community-oriented genomic data analysis methods sharing web application. Executing pipelines will leverage the reproducibility framework developed by researchers at the Encyclopedia of DNA Elements (ENCODE) consortium Data Coordinating Center which simplifies running pipelines in cloud environments. The project aims to make analysis methods accessible to and usable to the genomics community and enable biomedical researchers with limited or no computation expertise to analyze genomic data easily. Briefly, Truwl will be extended to contain a new web interface with dynamic forms for specifying pipeline inputs, a backend service will be created execute and monitor compute jobs, and Google Cloud Platform compute instances will be created and configured with the ENCODE reproducibility framework to execute compute jobs. The ENCODE Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) pipeline will be used as a test case for successfully executing pipelines from the new web interface. A range of datasets and input parameters will be used to test the capabilities of the system. This will demonstrate running an ENCODE pipeline from a publicly available web interface for the the first time and provide a pattern for making any well-described genomic data analysis pipeline widely available and usable to the biomedical research community.

Public Health Relevance Statement:
Narrative Genomics is key to understanding and biology and disease. This project will contribute to understanding of how the genome works by expanding the capabilities of biomedical researchers to effectively analyze genomic data. Terms:

Phase II

Contract Number: ----------
Start Date: 00/00/00    Completed: 00/00/00
Phase II year
----
Phase II Amount
----