SBIR-STTR Award

Prediction Of Protein Function At The Industrial Scale
Award last edited on: 6/9/08

Sponsored Program
SBIR
Awarding Agency
NIH : NIGMS
Total Award Amount
$926,426
Award Phase
2
Solicitation Topic Code
-----

Principal Investigator
Acady R Mushegian

Company Information

Axys Pharmaceuticals Inc (AKA: Sequana Therapeutics Inc~Akkadix Corporation)

180 Kimball Way
South San Francisco, CA 94080
   (858) 452-6550
   N/A
   www.axyspharm.com
Location: Single
Congr. District: 14
County: San Mateo

Phase I

Contract Number: 1R43GM058331-01
Start Date: 00/00/00    Completed: 00/00/00
Phase I year
1998
Phase I Amount
$99,422
We propose to facilitate the understanding of protein functions and metabolic processes in living species, by developing a computer system for completely automated prediction of biochemical or biological functions for large sets of protein sequences. Instead of the conventional approaches that typically rely on one round of database search and inherit the annotation from the best-scoring sequence match, we will construct a proprietary structured database of complete proteomes and develop an automated multistep strategy of database searches, feature prediction, and annotation. The cascade of analysis will include: selection of the reference databases, flexible filtering, domain dissection, iterative searches, motif analysis, and rule-based modification of the database annotations. The system to support this approach will be capable of analyzing one protein every five minutes, which is approximately a tenfold increase over the productivity of a trained analyst. In addition, custom heuristic algorithms will result in 20-30 percent greater accuracy in annotations compared to the existing automated approaches. The rapid and accurate prediction of protein functions at the industrial scale established in the proposed project will be applied for exhaustive annotation of newly sequenced proteomes and EST collections, in order to reconstruct in detail biochemical pathways targeted for modification in various organisms. PROPOSED COMMERCIAL APPLICATION: The development of the genome-scale functional annotation of protein sequences at high speed and accuracy will accelerate gene discovery, target validation and metabolism reconstruction. This will provide for rapid and reliable development of human therapeutic molecules, antimicrobials, agricultural products, and industrial enzymes. Many new functional predictions for proteins from different species will be made and the value-added sequence databases will become available.

Phase II

Contract Number: 2R44GM058331-02
Start Date: 00/00/00    Completed: 00/00/00
Phase II year
1999
(last award dollars: 2000)
Phase II Amount
$827,004

We propose to develop a computational approach for completely automated prediction of biochemical functions for large sets of protein-coding sequences. Instead of the conventional approaches that rely either on one round of database search and annotation inheritance from the best-scoring sequence match, or on manual case-by-case reinspection, we will construct proprietary structured database of complete proteomes (Proteome Bank) and develop automated multi-step strategy of database searches, feature prediction, and annotation. The cascade of analysis will include: automated selection of reference databases (Proteome Bank, other proprietary database; public database, etc.), several types of filtering, domain dissection, iterative search, and rule-based modification of the database annotations. The system will analyze one or more proteins every five minutes, which is a ten fold increase over the productivity of a highly trained analyst. In addition, custom heuristic algorithms should result in 20-30 percent greater accuracy in annotations as compared to the existing automated approaches. The ability to predict protein functions rapidly and accurately at the genome scale will be applied for exhaustive annotation of proteomes and EST collections from humans, animals, plants and microbes, in order to reconstruct biochemical pathways and prioritize them as targets for modification. PROPOSED COMMERCIAL APPLICATION The genome-scale functional annotation of protein sequences at high speed and accuracy will accelerate gene discovery, target validation and metabolism reconstruction. This will provide for rapid and reliable development of human therapeutic molecules, antimicrobials, agricultural products, and industrial enzymes. Many new functional predictions for proteins from different species will be made and value-added sequence databases will become available.

Thesaurus Terms:
computer assisted sequence analysis, computer program /software, computer system design /evaluation, information system, protein structure /function genome, information retrieval, protein sequence