Medical and Evolution Applications

1. Distinguishing between Neutral and Deleterious Mutants

Most proteins mutants are neutral in their effects.

Stabilities can distinguish some deleterious mutants from neutral ones.  We find neutral mutants in general to have a narrower range of stabilities than deleterious ones.  The deleterious mutants can be either much more or much less stable the neutral ones in general.  See an example below for 98 mutants in 5 different proteins.


 

Chart

 

The five proteins included were:

P00966

Arginosuccinate synthase

ASS1

Citrullinemia

High Blood Pressure

P16219

Acyl-CoA synthase (mitochondrial, short chain specific)

ACADS

ACADSD (Acyl-CoA dehydrogenase short chain deficiency)

Muscle weakness in infants

P45381

Aspartoacylase

ASPA

CAND (Canavan Disease)

Rare, neurodegenerative disease

P51648

Fatty aldehyde dehydrogenase

ALDH3A2

SLS (Sjoegren Larsson syndrome)

Mental retardation, Autosomal recessive disorder

Q04771

Activin receptor type-1

ACVR1

FOP (Fibrodysplasia ossificans progressiva)

Skeletal malformations,

Autosomal dominant disorder

2. Annotating Genes of Unknown Function by Improving Protein Sequence Matching

Diagram

We hypothesize that structural information can help to improve protein sequence matching.  The correlated sequence information for each structural family is collected and the leads to additional biases in possible amino acid substitutions beyond what is being used for independent single amino acid substitutions.  By using a new amino acid substitution matrix that incorporates this information all known cases of where the structures are known to be similar but sequence matches do not indicate any similarity are brought into agreement.  A few examples are shown below:

Diagram

Details are found in Jia K, Jernigan RL. SeqStruct : A New Amino Acid Similarity Matrix Based on Sequence Correlations and Structural Contacts Yields Sequence-Structure Congruence. bioRxiv 2018, 268904.

This new protein matching approach enables the assignment of function to many proteins (genes) presently having no assigned function.

Diagram

Funding Organization: 

Principal Investigator(s):