Summary of Input |
Scaffold Structure |
Clustering Results |
Sequences generated during the Monte Carlo simulation are clustered. The top clusters are listed in the table below. Information in the table includes the relative cluster sizes as well as the number of the top sequences that originate from each cluster. Users are able to download every sequence in each cluster; the files include the sequences as well as the free-energy of each sequence predicted by EvoDesign. |
Cluster Number | Relative Cluster Size | Download Each Sequence in Cluster |
---|---|---|
1 | 47.3% | Clus 1 Sequences |
2 | 34.8% | Clus 2 Sequences |
3 | 9.1% | Clus 3 Sequences |
4 | 6.8% | Clus 4 Sequences |
Summary of Output |
Design Number | Sequence Identity (%) |
Normalized Relative Error [?] | |||
Secondary Structure | Solvent Accessibility | Torsion Angle φ | Torsion Angle ψ | ||
Data.zip | SI | SS | SA | φ | ψ |
(a) | Sequence Identity: The percent sequence identity between the designed sequence and the scaffold sequence. |
(b) | Normalized Relative Error (NRE): NRE=(EDSāETS)/ETS, where EDS is the error of the neural-network predictions relative to the scaffold structure on the design sequence and ETS is the error of the predictions based on the sequence of the target scaffold. The secondary structure, solvent accessibility and torsion angles for the scaffold structure are assigned by DSSP. |
(c) | Secondary Structure (SS): SS is predicted by PSSPred for the scaffold and design sequences. The Q3 errors of the design sequence (EDS) and scaffold sequence (ETS) with respect to the scaffold structure are used to calculate the NRE for SS. |
(d) | Solvent Accessibility (SA): SA for scaffold and design sequences are predicted by neural-network method. The correlation on SA between design sequence and scaffold structure (EDS) and between scaffold sequence and scaffold structure (ETS) is used to calculate NRE on SA. |
(e) | Torsion Angle (TA): TA is predicted by ANGLOR for scaffold and design sequences. The mean absolute difference of the design sequence (EDS) and scaffold sequence (ETS) from the scaffold structure is used to calculate NRE for TA. |
Final Designed Sequences |
Energy: EvoDesign calculated energy
Iden.: % sequence identity between the design and scaffold sequences.
DSSP_SS: Secondary structure as assigned by DSSP on the scaffold structure.
Scaffold: Scaffold sequence.
Identical residues in the scaffold and design sequences are marked by '|'.
Design: Design sequence.
Lowest Energy Sequence |
Design Number | Energy | Iden. (%) | ||||||
---|---|---|---|---|---|---|---|---|
1 | -88.63 | 25.4 | DSSP_SS Scaffold Design1 | CCCHHHEECEEEEEEEECCCCEEEEEEECCCCEEEEEEEHHHHHHCCCCCCCEEEEEEECCEEEEEC SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA    | |  || |     |    ||   | ||   |        | |    | | |             QKAAKNMYKGTVKEIEQGGSETEVYVQIPGGEELTMTMPIQTAEALKLEPGREITVIMPPDSIHVFR |
Cluster 1 Lowest Energy Sequences |
Design Number | Energy | Iden. (%) | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2 | -87 | 28.4 | DSSP_SS Scaffold Design2 | CCCHHCCHCCEEEEEHHHCCHEEEEEECCCCCEEEEEEECCCHHHHCCCCCCEEEEEEEHHEEEECC SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA |  | | |||  |  |      |   || ||  |       | |      ||            |   SKAAENTLKGRIVQVKELGQFTEIIVEIPGGEEIVMLVPVKSAEAMRLEPGAHIAVWIEASKIHIFK |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Cluster 1 Lowest Energy Sequences |
Design Number | Energy | Iden. (%) | ||||||
---|---|---|---|---|---|---|---|---|
3 | -84.03 | 17.9 | DSSP_SS Scaffold Design3 | CCCCCCECCCEEEEECCCCCCEEEEEEECCCCEEEEECCHHHHHHHCCCCCCCEEEEEECCEEEEEC SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA      |   |  |            | | ||           |       ||      |     |   RKPVENVYEGRIVHIEELGEETQIHLQIPGGQSLMAKVPVECVQTMKLEPGANVVVLMKADRIHIFR |
||||
Cluster 1 Lowest Energy Sequences |
Design Number | Energy | Iden. (%) | ||||||
---|---|---|---|---|---|---|---|---|
4 | -85.38 | 25.4 | DSSP_SS Scaffold Design4 | CCCCHHHEHCEEEHEHCCCCEEEEEEEECCCCCEEEEEEHHHHHHHCCCCCCEEEEEEECCCCEECC SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA    | |   | |     |    |   |  ||           |||  |  |   |       | |   EKVADNVFEGRVSFIERGGDYCEIHIECGGGENLKILVPVEAVEEMKVEPGKHITVLIPASNVHIFR |
||||
Cluster 1 Lowest Energy Sequences |
Design Number | Energy | Iden. (%) | ||||||
---|---|---|---|---|---|---|---|---|
5 | -82.11 | 26.9 | DSSP_SS Scaffold Design5 | CCCHCCEECEEEEEEEECCCCEEEEEEECCCCEEEEEECHHHHHHHCCCCCCEEEEEEECCEEEEEC SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA    | |  || ||    |    ||     ||   |  |            ||  |   |      |  CKAADNIFKGRVVEIEMGGDYTEVIVKMPGGQSLTAVIPMKKANQMKIEQGAQVTVYMKADIIHVLR |
||||
Cluster 2 Center |
Design Number | Energy | Iden. (%) | ||||||
---|---|---|---|---|---|---|---|---|
6 | -81.45 | 35.8 | DSSP_SS Scaffold fdfddDesign6 | CCCHHHHHCCEEEEEEECCHHEEEEEECCCCCCEEEEEEHHCCCCCCCCCCCEEEEEEECHEEEECC SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA |  | | |||||    |     |    | || ||   |       || | | |   |     | |   SSTAENILKGKVIKVEKLGELTEIYIAIPGGEKIAVLIPIECASDLGLKPGDEIVVVFPASSVHIFH |
Cluster 2 Lowest Energy Sequences |
Design Number | Energy | Iden. (%) | ||||||
---|---|---|---|---|---|---|---|---|
7 | -80.46 | 29.9 | DSSP_SS Scaffold Design7 | CCHHHHHHCCHHHHHHHCCCCEEEEEEECCCCEEEEEEEHHHHHHCCCCCCHEEHEEECCCEEEECC SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA |    |  || |  |       |||    ||           || |    |    |  | | | |   SRAVENLFKGRVDFLESLGDKTEVVMVVPGGQQLVVVVPIQCVEALKISPGQQIVAWFKPTQVHIYR |
||||
Cluster 2 Lowest Energy Sequences |
Design Number | Energy | Iden. (%) | ||||||
---|---|---|---|---|---|---|---|---|
8 | -79.36 | 31.3 | DSSP_SS Scaffold Design8 | CCHHHHHHECEEEEEEHCCCHHEEEEECCCCCEEEEEECCCCHHHHEHCCCCEEEEECCCCCEEECC SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA |      | | ||    |    | |  | || ||       | | | | |||          |     SKALEYVLEGTVVAVEMGGEFTEIVMQIPGGKKIVVKVPVNSAEALQVTEGASVLVWMPPESVHVIR |
||||
Cluster 3 Center |
Design Number | Energy | Iden. (%) | ||||||
---|---|---|---|---|---|---|---|---|
9 | 0 | 26.9 | DSSP_SS Scaffold fdfddDesign9 | CCCHHHEHECEEEEEEHCCCEEEEEEEECCCCEEEEEEEHHHHHHHCCCCCCEEEEEECCCHEEECC SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA |      |||  | |  |     |     || | |  |      ||    |           |  |  SKVTDYVLKGTIVQLELGGTETQVHIAVDGGQKLTVLIPIEQAQELKLQPGRKIIVWLPADQVHVLR |
Cluster 4 Center |
Design Number | Energy | Iden. (%) | ||||||
---|---|---|---|---|---|---|---|---|
10 | 0 | 32.8 | DSSP_SS Scaffold fdfddDesign10 | CCCHHHHHEEHHHHHHHCCCCEEEEEECCCCHEEEEEECHCHHHHHCCCCCCEEEEEECCCHEEECC SISARNQLKGKVVGLKKGVVTAEVVLEIAGGNKITSIISLDSVEELGVKEGAELTAVVKSTDVMILA   |  |   | |  |    | |||   | ||   |    | ||| |    |      |    | |   QKSVDNLYEGTVTALQMMGVMAEVYIKIPGGQVLTALVPLNSVEQLKLQPGKHIIIYVPASKVHIMR |
I-TASSER Modeling of Designed Sequences |
Models of Top 10 Designed Sequences
|