[Home] [Server] [Help] [Forum]

EvoDesign Results for Your Protein: PD133

[Click on PD133.tar to download the tarball file including the EvoDesign results listed on this page]


  Summary of Input
Scaffold Structure


  Clustering Results

Sequences generated during the Monte Carlo simulation are clustered. The top clusters are listed in the table below. Information in the table includes the relative cluster sizes as well as the number of the top sequences that originate from each cluster. Users are able to download every sequence in each cluster; the files include the sequences as well as the free-energy of each sequence predicted by EvoDesign.

Cluster
Number
Relative Cluster
Size
Download Each
Sequence in Cluster
147.3%Clus 1 Sequences
234.8%Clus 2 Sequences
39.1%Clus 3 Sequences
46.8%Clus 4 Sequences

  Summary of Output

Design Number Sequence Identity (%)[?] Normalized Relative Error [?]
Secondary Structure Solvent Accessibility Torsion Angle φ Torsion Angle ψ
Design 1 25.4 -0.13 -0.07 0.07 0.30
Design 2 28.4 0.47 0.04 0.47 0.73
Design 3 17.9 0.07 0.07 0.05 0.04
Design 4 25.4 -0.13 -0.14 0.49 0.53
Design 5 26.9 -0.27 0.06 0.01 0.28
Design 6 35.8 0.53 0.06 0.24 0.14
Design 7 29.9 0.60 -0.08 0.53 1.51
Design 8 31.3 0.33 0.02 0.53 0.79
Design 9 26.9 -0.27 0.09 0.49 0.27
Design 10 32.8 0.47 0.09 0.25 0.93
Data.zip SI SS SA φ ψ
(a) Sequence Identity: The percent sequence identity between the designed sequence and the scaffold sequence.
(b) Normalized Relative Error (NRE): NRE=(EDSāˆ’ETS)/ETS, where EDS is the error of the neural-network predictions relative to the scaffold structure on the design sequence and ETS is the error of the predictions based on the sequence of the target scaffold. The secondary structure, solvent accessibility and torsion angles for the scaffold structure are assigned by DSSP.
(c) Secondary Structure (SS): SS is predicted by PSSPred for the scaffold and design sequences. The Q3 errors of the design sequence (EDS) and scaffold sequence (ETS) with respect to the scaffold structure are used to calculate the NRE for SS.
(d)Solvent Accessibility (SA): SA for scaffold and design sequences are predicted by neural-network method. The correlation on SA between design sequence and scaffold structure (EDS) and between scaffold sequence and scaffold structure (ETS) is used to calculate NRE on SA.
(e) Torsion Angle (TA): TA is predicted by ANGLOR for scaffold and design sequences. The mean absolute difference of the design sequence (EDS) and scaffold sequence (ETS) from the scaffold structure is used to calculate NRE for TA.

  Final Designed Sequences

Energy:         EvoDesign calculated energy
Iden.: % sequence identity between the design and scaffold sequences.
DSSP_SS: Secondary structure as assigned by DSSP on the scaffold structure.
Scaffold: Scaffold sequence.
Identical residues in the scaffold and design sequences are marked by '|'.
Design: Design sequence.
  Lowest Energy Sequence

Design
Number
EnergyIden. (%)
1-88.63 25.4 DSSP_SS
Scaffold

Design1
CCCHHHEECEEEEEEEECCCCEEEEEEECCCCEEEEEEEHHHHHHCCCCCCCEEEEEEECCEEEEEC
SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA
   | |  || |     |    ||   | ||   |        | |    | | |            
QKAAKNMYKGTVKEIEQGGSETEVYVQIPGGEELTMTMPIQTAEALKLEPGREITVIMPPDSIHVFR

  Cluster 1 Lowest Energy Sequences

Design
Number
EnergyIden. (%)
2-87 28.4 DSSP_SS
Scaffold

Design2
CCCHHCCHCCEEEEEHHHCCHEEEEEECCCCCEEEEEEECCCHHHHCCCCCCEEEEEEEHHEEEECC
SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA
|  | | |||  |  |      |   || ||  |       | |      ||            |  
SKAAENTLKGRIVQVKELGQFTEIIVEIPGGEEIVMLVPVKSAEAMRLEPGAHIAVWIEASKIHIFK
  Cluster 1 Lowest Energy Sequences

Design
Number
EnergyIden. (%)
3-84.03 17.9 DSSP_SS
Scaffold

Design3
CCCCCCECCCEEEEECCCCCCEEEEEEECCCCEEEEECCHHHHHHHCCCCCCCEEEEEECCEEEEEC
SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA
     |   |  |            | | ||           |       ||      |     |  
RKPVENVYEGRIVHIEELGEETQIHLQIPGGQSLMAKVPVECVQTMKLEPGANVVVLMKADRIHIFR
  Cluster 1 Lowest Energy Sequences

Design
Number
EnergyIden. (%)
4-85.38 25.4 DSSP_SS
Scaffold

Design4
CCCCHHHEHCEEEHEHCCCCEEEEEEEECCCCCEEEEEEHHHHHHHCCCCCCEEEEEEECCCCEECC
SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA
   | |   | |     |    |   |  ||           |||  |  |   |       | |  
EKVADNVFEGRVSFIERGGDYCEIHIECGGGENLKILVPVEAVEEMKVEPGKHITVLIPASNVHIFR
  Cluster 1 Lowest Energy Sequences

Design
Number
EnergyIden. (%)
5-82.11 26.9 DSSP_SS
Scaffold

Design5
CCCHCCEECEEEEEEEECCCCEEEEEEECCCCEEEEEECHHHHHHHCCCCCCEEEEEEECCEEEEEC
SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA
   | |  || ||    |    ||     ||   |  |            ||  |   |      | 
CKAADNIFKGRVVEIEMGGDYTEVIVKMPGGQSLTAVIPMKKANQMKIEQGAQVTVYMKADIIHVLR

  Cluster 2 Center

Design
Number
EnergyIden. (%)
6-81.45 35.8 DSSP_SS
Scaffold

fdfddDesign6
CCCHHHHHCCEEEEEEECCHHEEEEEECCCCCCEEEEEEHHCCCCCCCCCCCEEEEEEECHEEEECC
SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA
|  | | |||||    |     |    | || ||   |       || | | |   |     | |  
SSTAENILKGKVIKVEKLGELTEIYIAIPGGEKIAVLIPIECASDLGLKPGDEIVVVFPASSVHIFH

  Cluster 2 Lowest Energy Sequences

Design
Number
EnergyIden. (%)
7-80.46 29.9 DSSP_SS
Scaffold

Design7
CCHHHHHHCCHHHHHHHCCCCEEEEEEECCCCEEEEEEEHHHHHHCCCCCCHEEHEEECCCEEEECC
SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA
|    |  || |  |       |||    ||           || |    |    |  | | | |  
SRAVENLFKGRVDFLESLGDKTEVVMVVPGGQQLVVVVPIQCVEALKISPGQQIVAWFKPTQVHIYR
  Cluster 2 Lowest Energy Sequences

Design
Number
EnergyIden. (%)
8-79.36 31.3 DSSP_SS
Scaffold

Design8
CCHHHHHHECEEEEEEHCCCHHEEEEECCCCCEEEEEECCCCHHHHEHCCCCEEEEECCCCCEEECC
SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA
|      | | ||    |    | |  | || ||       | | | | |||          |    
SKALEYVLEGTVVAVEMGGEFTEIVMQIPGGKKIVVKVPVNSAEALQVTEGASVLVWMPPESVHVIR

  Cluster 3 Center

Design
Number
EnergyIden. (%)
90 26.9 DSSP_SS
Scaffold

fdfddDesign9
CCCHHHEHECEEEEEEHCCCEEEEEEEECCCCEEEEEEEHHHHHHHCCCCCCEEEEEECCCHEEECC
SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA
|      |||  | |  |     |     || | |  |      ||    |           |  | 
SKVTDYVLKGTIVQLELGGTETQVHIAVDGGQKLTVLIPIEQAQELKLQPGRKIIVWLPADQVHVLR


  Cluster 4 Center

Design
Number
EnergyIden. (%)
100 32.8 DSSP_SS
Scaffold

fdfddDesign10
CCCHHHHHEEHHHHHHHCCCCEEEEEECCCCHEEEEEECHCHHHHHCCCCCCEEEEEECCCHEEECC
SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA
  |  |   | |  |    | |||   | ||   |    | ||| |    |      |    | |  
QKSVDNLYEGTVTALQMMGVMAEVYIKIPGGQVLTALVPLNSVEQLKLQPGKHIIIYVPASKVHIMR


  I-TASSER Modeling of Designed Sequences


Models of Top 10 Designed Sequences

Click
to view
Design
#
TM-scoreRMSDC-scoreModel
Structure
1 0.911.02 Download
2 0.930.85 Download
3 0.873.86 Download
4 0.901.04 Download
5 0.901.15 Download
6 0.891.20 Download
7 0.901.12 Download
8 0.930.81 Download
9 0.881.47 Download
10 0.921.06 Download

(a)TM-score is caculated between the predicted structure of the designed sequence and the scaffold structure.
(b)RMSD is caculated between the predicted structure of the designed sequence and the scaffold structure.
(c)C-score typically ranges from [-5,2] and is a quantitative measure of the confidence of each model. The higher the C-score, the higher the confidence in the model.



References: