Developing optimal nonlinear scoring function for protein design

Biology – Quantitative Biology – Biomolecules

Scientific paper

Rate now

  [ 0.00 ] – not rated yet Voters 0   Comments 0

Details

25 pages, 6 figures, 7 tables. Accepted by Bioinformatics

Scientific paper

Motivation. Protein design aims to identify sequences compatible with a given protein fold but incompatible to any alternative folds. To select the correct sequences and to guide the search process, a design scoring function is critically important. Such a scoring function should be able to characterize the global fitness landscape of many proteins simultaneously. Results. To find optimal design scoring functions, we introduce two geometric views and propose a formulation using mixture of nonlinear Gaussian kernel functions. We aim to solve a simplified protein sequence design problem. Our goal is to distinguish each native sequence for a major portion of representative protein structures from a large number of alternative decoy sequences, each a fragment from proteins of different fold. Our scoring function discriminate perfectly a set of 440 native proteins from 14 million sequence decoys. We show that no linear scoring function can succeed in this task. In a blind test of unrelated proteins, our scoring function misclassfies only 13 native proteins out of 194. This compares favorably with about 3-4 times more misclassifications when optimal linear functions reported in literature are used. We also discuss how to develop protein folding scoring function.

No associations

LandOfFree

Say what you really think

Search LandOfFree.com for scientists and scientific papers. Rate them and share your experience with other people.

Rating

Developing optimal nonlinear scoring function for protein design does not yet have a rating. At this time, there are no reviews or comments for this scientific paper.

If you have personal experience with Developing optimal nonlinear scoring function for protein design, we encourage you to share that experience with our LandOfFree.com community. Your opinion is very important and Developing optimal nonlinear scoring function for protein design will most certainly appreciate the feedback.

Rate now

     

Profile ID: LFWR-SCP-O-457718

  Search
All data on this website is collected from public sources. Our data reflects the most accurate information available at the time of publication.