SBIR-STTR Award

Toolkit For Whole Transcriptome Analysis Of Neural Tissues
Award last edited on: 7/29/13

Sponsored Program
SBIR
Awarding Agency
NIH : NINDS
Total Award Amount
$1,944,624
Award Phase
2
Solicitation Topic Code
-----

Principal Investigator
Jonathan D Buckley

Company Information

Epicenter Software

80 South Lake Avenue Suite 550
Pasadena, CA 91101
   (626) 304-9487
   support@epicentersoftware.com
   www.epicentersoftware.com
Location: Single
Congr. District: 27
County: Los Angeles

Phase I

Contract Number: 1R43NS063540-01A1
Start Date: 00/00/00    Completed: 00/00/00
Phase I year
2008
Phase I Amount
$145,388
Development of an organ as complex as the brain must depend on an intricate interplay of thousands of signaling proteins, orchestrated by an interacting web of regulatory factors. Recent ENCODE data reveal that the large tracts of so-called `junk' DNA in introns and between genes is, in fact, actively transcribed. The functions of these non-coding transcripts, which number in the millions, are virtually unknown, although one very small sib-class - the microRNAs - is receiving close attention as regulatory RNAs. One process that may be controlled by the non-coding transcripts is alternate exon use - a mechanism that adds significantly to the diversity of cellular proteins (particularly in the CNS), the regulation of which is little understood at this time. Recent technological advances that generate whole-transcriptome data have provided the means to systematically explore both these factors - the role non-coding transcripts, and the occurrence and control of splice variation. However, the bioinformatic challenges posed by these technologies are substantial, and the lack of comprehensive, well designed, and easily used software to manage, visualize, analyze and interpret these data will likely be the limiting factor in this field of research. We propose to develop a bioinformatics toolkit specifically to mine whole-transcriptome data from two fundamentally different technologies, the Affymetrix All Exon microarray which provides measures of over 1.4 million distinct transcripts, and the Solexa ultra high throughput sequencer, which provides for `digital' expression analysis of the whole transcriptome. The design and development of the software will be guided by prominent scientists engaged in the study of the brain, and will be applied to sample data set derived from neurological tissue, to ensure that the progra incorporates functions and annotations relevant to this field.

Public Health Relevance:
While the large-scale array technologies have provided an unprecedented capability to model cellular processes in the brain, both in normal functioning and disease states, this capability is utterly dependent on the availability of complex data management, computational, statistical and informatic software tools. The utility of the next generation of arrays - which focus on critical regulation and control functions of the cell - will be stymied by an initial lack of suitable bioinformatic tools. This proposal initiates an accelerated development of an integrated software package intended to empower biologists in the application and analysis of these powerful new technologies, with broadly reaching impact at all levels of biological and clinical research, and across every discipline.

Public Health Relevance:
Narrative While the large-scale array technologies have provided an unprecedented capability to model cellular processes in the brain, both in normal functioning and disease states, this capability is utterly dependent on the availability of complex data management, computational, statistical and informatic software tools. The utility of the next generation of arrays - which focus on critical regulation and control functions of the cell - will be stymied by an initial lack of suitable bioinformatic tools. This proposal initiates an accelerated development of an integrated software package intended to empower biologists in the application and analysis of these powerful new technologies, with broadly reaching impact at all levels of biological and clinical research, and across every discipline.

Thesaurus Terms:
There Are No Thesaurus Terms On File For This Project.

Phase II

Contract Number: 5R43NS063540-02
Start Date: 9/30/08    Completed: 8/31/10
Phase II year
2009
(last award dollars: 2013)
Phase II Amount
$1,799,236

Development of an organ as complex as the brain must depend on an intricate interplay of thousands of signaling proteins, orchestrated by an interacting web of regulatory factors. Recent ENCODE data reveal that the large tracts of so-called `junk' DNA in introns and between genes is, in fact, actively transcribed. The functions of these non-coding transcripts, which number in the millions, are virtually unknown, although one very small sib-class - the microRNAs - is receiving close attention as regulatory RNAs. One process that may be controlled by the non-coding transcripts is alternate exon use - a mechanism that adds significantly to the diversity of cellular proteins (particularly in the CNS), the regulation of which is little understood at this time. Recent technological advances that generate whole-transcriptome data have provided the means to systematically explore both these factors - the role non-coding transcripts, and the occurrence and control of splice variation. However, the bioinformatic challenges posed by these technologies are substantial, and the lack of comprehensive, well designed, and easily used software to manage, visualize, analyze and interpret these data will likely be the limiting factor in this field of research. We propose to develop a bioinformatics toolkit specifically to mine whole-transcriptome data from two fundamentally different technologies, the Affymetrix All Exon microarray which provides measures of over 1.4 million distinct transcripts, and the Solexa ultra high throughput sequencer, which provides for `digital' expression analysis of the whole transcriptome. The design and development of the software will be guided by prominent scientists engaged in the study of the brain, and will be applied to sample data set derived from neurological tissue, to ensure that the progra incorporates functions and annotations relevant to this field.

Public Health Relevance:
While the large-scale array technologies have provided an unprecedented capability to model cellular processes in the brain, both in normal functioning and disease states, this capability is utterly dependent on the availability of complex data management, computational, statistical and informatic software tools. The utility of the next generation of arrays - which focus on critical regulation and control functions of the cell - will be stymied by an initial lack of suitable bioinformatic tools. This proposal initiates an accelerated development of an integrated software package intended to empower biologists in the application and analysis of these powerful new technologies, with broadly reaching impact at all levels of biological and clinical research, and across every discipline.

Thesaurus Terms:
Anova; Analysis Of Variance; Analysis, Data; Astrocytoma, Grade Iv; Atlases; Attention; Base Sequence; Bayesian Networks; Bio-Informatics; Bioinformatics; Biological; Biological Neural Networks; Body Tissues; Brain; Cell Function; Cell Process; Cell Physiology; Cellular Function; Cellular Physiology; Cellular Process; Clinical; Clinical Research; Clinical Study; Complex; Computer Programs; Computer Software Tools; Computer Software; Data; Data Analyses; Data Banks; Data Bases; Data Set; Databank, Electronic; Databanks; Database, Electronic; Databases; Dataset; Development; Discipline; Disease; Disorder; Elements; Encephalon; Encephalons; Ensure; Evaluation; Exons; Factor Analyses; Factor Analysis; Functional Rna; Gene Expression Profile; Genes; Genome; Genomics; Glioblastoma; Grade Iv Astrocytic Neoplasm; Grade Iv Astrocytic Tumor; Graph; Imagery; Individual; Informatics; Internet; Intervening Sequences; Introns; Junk Dna; Link; Measures; Messenger Rna; Methods; Micro Rna; Micrornas; Mining; Minings; Modeling; Nervous; Nervous System, Brain; Neuroblastoma; Neuroblastoma (Schwannian Stroma-Poor); Neurologic; Neurological; Non-Coding; Non-Coding Rna; Non-Linear Models; Nonlinear Models; Nucleotide Sequence; Organ; Pattern; Phase; Principal Component Analyses; Principal Component Analysis; Process; Proteins; Rna Splicing; Rna, Messenger; Regression Analyses; Regression Analysis; Regression Diagnostics; Regulation; Research; Role; Sbir; Sbirs (R43/44); Sampling; Schizophrenia; Schizophrenic Disorders; Scientist; Screening Procedure; Signaling Protein; Site; Small Business Innovation Research; Small Business Innovation Research Grant; Software; Software Tools; Source; Splicing; Statistical Regression; Subcellular Process; Technology; Time; Tissues; Tools, Software; Transcript; Variance Analyses; Variant; Variation; Visualization; Www; Base; Clinical Data Repository; Clinical Data Warehouse; Comparison Group; Computer Based Statistical Methods; Computer Program/Software; Data Management; Data Repository; Dementia Praecox; Depression; Design; Designing; Develop Software; Developing Computer Software; Digital; Disease/Disorder; Empowered; Gene Desert; Gene Expression Signature; Gene Product; Glioblastoma Multiforme; Mrna; Mirna; Neural; Neural Network; New Technology; Next Generation; Nucleic Acid Sequence; Public Health Relevance; Relating To Nervous System; Relational Database; Schizophrenic; Screening; Screenings; Social Role; Software Development; Spongioblastoma Multiforme; Statistical Methods, Computer Based; Tool; Transcriptome; Web; World Wide Web