SBIR-STTR Award

Software For The Accurate De Novo 3d Structure Prediction Of Rna
Award last edited on: 10/9/12

Sponsored Program
SBIR
Awarding Agency
NIH : NIGMS
Total Award Amount
$847,762
Award Phase
2
Solicitation Topic Code
-----

Principal Investigator
John Santalucia

Company Information

DNA Software Inc

334 East Washington Street
Ann Arbor, MI 48104
   (734) 222-9080
   N/A
   www.dnasoftware.com
Location: Single
Congr. District: 12
County: Washtenaw

Phase I

Contract Number: 1R44GM095251-01
Start Date: 9/1/10    Completed: 2/28/11
Phase I year
2010
Phase I Amount
$99,586
The discovery of new and interesting RNA sequences from genome sequencing projects, and the urgency to unravel their functions, has led to a dramatic push for novel structural determination techniques. Current experimental methods for three dimensional structure determination of nucleic acids such as x-ray crystallography and NMR cannot keep pace with the day to day discovery of sequences that need representative structures to be solved or modeled. Thus, there is a clear need to develop tools for 3D structure prediction given only the primary sequence and when available, experimental constraint information. Compared to proteins, RNA structure prediction has received limited resources, and only recently has the field gained attention by the scientific community. As such, RNA prediction has largely relied on protein prediction methodologies despite the vast intrinsic differences between proteins and nucleic acids. Although many of these tools have shown significant advances in the prediction quality, they have also demonstrated low reliability and are often limited to prediction of very small RNAs. In addition, the majority are either manual or semi-automated, which requires an experienced user and a variety of intermediate software packages. To address such concerns, DNA Software, Inc. (DNAS) has developed an RNA homology modeling software, NA-CAD (Nucleic Acid Computer Aided Design) that has a unique force field specifically optimized for RNA. This tool has demonstrated success in homology modeling of large RNA-protein complexes such as the small ribosomal subunit of Pseudomonas aeruginosa. We would now like to extend NA-CAD to include a component for de novo structure prediction. The force field in NA-CAD and the free energy based secondary structure prediction algorithm in our flagship software product Visual OMP provide an advantageous starting point for developing a unified tool that can accurately predict de novo the tertiary structure of medium to large RNA targets. This proposal addresses the engineering of novel algorithms for handling difficult structural motifs such as multiloops, pseudoknots, and multiple domains and incorporating experimental constraints to improve prediction quality. Additionally, coarse-grained models for representing RNA residues and accelerated classical molecular dynamics simulations will be implemented to increase conformational sampling in a tractable computational time frame. , ,

Public Health Relevance:
We propose to develop an accurate, fast, and unified de novo structure prediction tool optimized for medium to large sized RNAs. This tool will generate valuable structural models that will help elucidate the functions of RNAs that do not have solved crystal or NMR structures. The proposed de novo tool will be incorporated into our homology modeling software, NA-CAD, so that it will be able to generate three-dimensional homology models of pharmaceutically relevant RNA targets and to model potential drug-resistant mutants, which will be beneficial to researchers involved in structure-based drug discovery.

Thesaurus Terms:
3' Untranslated Regions;3'utr;3-D Structure;3-Dimensional Structure;3d Modeling;3d Structure;Address;Algorithms;Attention;Biochemical;Catalytic Rna;Cereals;Chemicals;Code;Coding System;Communities;Computer Architectures;Computer Programs;Computer Software;Computer-Aided Design;Computer-Assisted Design;Coupled;Crystallographies;Crystallography;Dna;Data Banks;Data Bases;Data Set;Databank, Electronic;Databanks;Database, Electronic;Databases;Dataset;Deoxyribonucleic Acid;Development;Drug Resistance;E Coli;Engineering;Engineerings;Escherichia Coli;Free Energy;Functional Rna;Gene Action Regulation;Gene Expression Regulation;Gene Products, Rna;Gene Regulation;Gene Regulation Process;Generations;Goals;Grain;Homology Modeling;Human Genome;Hydrogen Bonding;In Vitro;Intervening Sequences;Introns;Investigators;Knowledge;Length;Libraries;Link;Macromolecular Structure;Manuals;Messenger Rna;Method Loinc Axis 6;Methodology;Methods;Methods And Techniques;Methods, Other;Metric;Modeling;Modification;Molecular Configuration;Molecular Conformation;Molecular Dynamics Simulation;Molecular Stereochemistry;Molecular Structure;Non-Coding;Non-Coding Rna;Nucleic Acids;Nucleotides;P. Aeruginosa;P.Aeruginosa;Performance;Phase;Phe-Trna;Physics;Process;Programs (Pt);Programs [publication Type];Proteins;Pseudomonas Aeruginosa;Pseudomonas Pyocyanea;Publishing;Rna;Rna Sequences;Rna, Messenger;Rna, Non-Polyadenylated;Research Personnel;Research Resources;Researchers;Resolution;Resources;Ribonucleic Acid;Ribosomal Rna;Ribosomes;Ribozymes;Running;Sampling;Series;Side;Simulate;Small Rna;Software;Spinal Column;Spine;Structural Models;Structure;Techniques;Testing;Time;Torsion;Torsion (Malposition);Validation;Vertebral Column;Visual;Work;Yeast Model System;Yeastmodel;Yeasts;Aptamer;Backbone;Base;Blind;Clinical Data Repository;Clinical Data Warehouse;Computer Program/Software;Conformation;Conformational State;Data Repository;Develop Software;Developing Computer Software;Drug Discovery;Drug Resistant;Electron Density;Experience;Experiment;Experimental Research;Experimental Study;Flexibility;Gene Product;Genome Sequencing;High End Computer;Improved;Interest;Knowledge Base;Mrna;Molecular Dynamics;Mutant;Network Architecture;Novel;Phenylalanine-Trna;Phenylalanyl-Trna;Programs;Protein Complex;Protein Structure;Prototype;Public Health Relevance;Rrna;Relational Database;Research Study;Resistance To Drug;Resistant To Drug;Restraint;Software Development;Success;Trna, Phenylalanine-;Three Dimensional Structure;Three-Dimensional Modeling;Tool;Validation Studies

Phase II

Contract Number: 4R44GM095251-02
Start Date: 9/1/10    Completed: 9/30/12
Phase II year
2011
(last award dollars: 2012)
Phase II Amount
$748,176

The discovery of new and interesting RNA sequences from genome sequencing projects, and the urgency to unravel their functions, has led to a dramatic push for novel structural determination techniques. Current experimental methods for three dimensional structure determination of nucleic acids such as x-ray crystallography and NMR cannot keep pace with the day to day discovery of sequences that need representative structures to be solved or modeled. Thus, there is a clear need to develop tools for 3D structure prediction given only the primary sequence and when available, experimental constraint information. Compared to proteins, RNA structure prediction has received limited resources, and only recently has the field gained attention by the scientific community. As such, RNA prediction has largely relied on protein prediction methodologies despite the vast intrinsic differences between proteins and nucleic acids. Although many of these tools have shown significant advances in the prediction quality, they have also demonstrated low reliability and are often limited to prediction of very small RNAs. In addition, the majority are either manual or semi-automated, which requires an experienced user and a variety of intermediate software packages. To address such concerns, DNA Software, Inc. (DNAS) has developed an RNA homology modeling software, NA-CAD (Nucleic Acid Computer Aided Design) that has a unique force field specifically optimized for RNA. This tool has demonstrated success in homology modeling of large RNA-protein complexes such as the small ribosomal subunit of Pseudomonas aeruginosa. We would now like to extend NA-CAD to include a component for de novo structure prediction. The force field in NA-CAD and the free energy based secondary structure prediction algorithm in our flagship software product Visual OMP provide an advantageous starting point for developing a unified tool that can accurately predict de novo the tertiary structure of medium to large RNA targets. This proposal addresses the engineering of novel algorithms for handling difficult structural motifs such as multiloops, pseudoknots, and multiple domains and incorporating experimental constraints to improve prediction quality. Additionally, coarse-grained models for representing RNA residues and accelerated classical molecular dynamics simulations will be implemented to increase conformational sampling in a tractable computational time frame.

Public Health Relevance:
We propose to develop an accurate, fast, and unified de novo structure prediction tool optimized for medium to large sized RNAs. This tool will generate valuable structural models that will help elucidate the functions of RNAs that do not have solved crystal or NMR structures. The proposed de novo tool will be incorporated into our homology modeling software, NA-CAD, so that it will be able to generate three-dimensional homology models of pharmaceutically relevant RNA targets and to model potential drug-resistant mutants, which will be beneficial to researchers involved in structure-based drug discovery.

Thesaurus Terms:
3' Untranslated Regions;3'utr;3-D Structure;3-Dimensional Structure;3d Modeling;3d Structure;Address;Algorithms;Attention;Biochemical;Catalytic Rna;Cereals;Chemicals;Code;Coding System;Communities;Computer Architectures;Computer Programs;Computer Software;Computer-Aided Design;Computer-Assisted Design;Coupled;Crystallographies;Crystallography;Dna;Data Banks;Data Bases;Data Set;Databank, Electronic;Databanks;Database, Electronic;Databases;Dataset;Deoxyribonucleic Acid;Development;Drug Resistance;E Coli;Engineering;Engineerings;Escherichia Coli;Free Energy;Functional Rna;Gene Action Regulation;Gene Expression Regulation;Gene Products, Rna;Gene Regulation;Gene Regulation Process;Generations;Goals;Grain;Homology Modeling;Human Genome;Hydrogen Bonding;In Vitro;Intervening Sequences;Introns;Investigators;Knowledge;Length;Libraries;Link;Macromolecular Structure;Manuals;Messenger Rna;Method Loinc Axis 6;Methodology;Methods;Methods And Techniques;Methods, Other;Metric;Modeling;Modification;Molecular Configuration;Molecular Conformation;Molecular Dynamics Simulation;Molecular Stereochemistry;Molecular Structure;Nat;Non-Coding;Non-Coding Rna;Nucleic Acid Amplification Tests;Nucleic Acid Testing;Nucleic Acids;Nucleotides;P. Aeruginosa;P.Aeruginosa;Performance;Phase;Phe-Trna;Physics;Process;Programs (Pt);Programs [publication Type];Proteins;Pseudomonas Aeruginosa;Pseudomonas Pyocyanea;Publishing;Rna;Rna Sequences;Rna, Messenger;Rna, Non-Polyadenylated;Research Personnel;Research Resources;Researchers;Resolution;Resources;Ribonucleic Acid;Ribosomal Rna;Ribosomes;Ribozymes;Running;Sampling;Series;Side;Simulate;Small Rna;Software;Spinal Column;Spine;Structural Models;Structure;Techniques;Testing;Time;Torsion;Torsion (Malposition);Validation;Vertebral Column;Visual;Work;Yeast Model System;Yeastmodel;Yeasts;Aptamer;Backbone;Base;Blind;Clinical Data Repository;Clinical Data Warehouse;Community;Computer Program /Software;Computer Program/Software;Conformation;Conformational State;Data Repository;Develop Software;Developing Computer Software;Drug Discovery;Drug Resistant;Electron Density;Experience;Experiment;Experimental Research;Experimental Study;Flexibility;Gene Product;Genome Sequencing;High End Computer;Hydrogen Bond;Improved;Interest;Knowledge Base;Library;Mrna;Messenger Rna;Model;Molecular Dynamics;Mutant;Network Architecture;Novel;Phenylalanine-Trna;Phenylalanyl-Trna;Programs;Protein Complex;Protein Structure;Prototype;Public Health Relevance;Rrna;Relational Database;Research Study;Resistance To Drug;Resistant To Drug;Restraint;Software Development;Success;Trna, Phenylalanine-;Three Dimensional Structure;Three-Dimensional Modeling;Tool;Validation Studies