Input Sequence in FASTA format
>P02818 (100 residues)
MRALTLLALLALAALCIAGQAGAKPSGAESSKGAAFVSKQEGSEVVKRPRRYLYQWLGAP
VPYPDPLEPRREVCELNPDCDELADHIGFQEAYRRFYGPV
|
Predicted Secondary Structure
Sequence |
20 40 60 80 100
| | | | |
MRALTLLALLALAALCIAGQAGAKPSGAESSKGAAFVSKQEGSEVVKRPRRYLYQWLGAPVPYPDPLEPRREVCELNPDCDELADHIGFQEAYRRFYGPV |
Prediction | CCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCSSCHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCC |
Confidence | 9213999999999999526888886678754467411465523555044445444555566899902667777425723899999981999999971799 |
| H:Helix;
S:Strand; C:Coil |
Predicted Solvent Accessibility
Sequence |
20 40 60 80 100
| | | | |
MRALTLLALLALAALCIAGQAGAKPSGAESSKGAAFVSKQEGSEVVKRPRRYLYQWLGAPVPYPDPLEPRREVCELNPDCDELADHIGFQEAYRRFYGPV |
Prediction | 7430100101100000013536654554656644220346424512444453444444345434543454344156344044004430134015424358 |
| Values range from 0 (buried residue)
to 8 (highly exposed residue) |
Predicted Contact, Hydrogen and Distance Map Used in D-I-TASSER simulation
Contact Map
|
Distance Map
|
Hydrogen bond networks
|
|
D-I-TASSER simulation is guided by the consensus contact map
(left figure), distance map (middle figure) and Hydrogen bond network (right figure) derived based on confidence scores of
DeepPotential.
In the contact, distance map and hydrogen bond networks, the axes mark the residue index along the sequence.
For the contact map, each dot represents a residue pair with predicted contact,
while for the distance map and hydrogen bond network, a color scale represents a distance of 1-20+ angstroms or a angle of 0-180 degree.
|
Top 10 threading templates used by D-I-TASSER
Rank | PDB hit | ID1 | ID2 | Cov | Norm. Zscore | Downloadalignment | | 20 40 60 80 100
| | | | | |
| SS Seq | CCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCSSCHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCC MRALTLLALLALAALCIAGQAGAKPSGAESSKGAAFVSKQEGSEVVKRPRRYLYQWLGAPVPYPDPLEPRREVCELNPDCDELADHIGFQEAYRRFYGPV |
1 | 1q8hA | 0.88 | 0.30 | 8.44 | 1.37 | SPARKS-K | | ---------------------------------------------------------------PDPLP--RRVC-LNPDCDELADHIGFQEAYRRFYGIA |
2 | 1q8hA | 0.88 | 0.30 | 8.44 | 2.74 | HHsearch | | ---------------------------------------------------------------PDPLP--RRVC-LNPDCDELADHIGFQEAYRRFYGIA |
3 | 2r79A2 | 0.08 | 0.08 | 3.16 | 0.44 | CEthreader | | VPQRAEAAELDYRQRLRRQADWIAAAQKSQPAPGVLLVIGNAGGQLLVAGRNTGGDATHEGYKPISVEALAALLEGDAARAALLKQNPGLAPTRAARDGR |
4 | 7jtkE | 0.04 | 0.04 | 2.08 | 0.55 | EigenThreader | | ATQRMEAAERRKLEEKERRMQQERERVERERVVRQKVAASAFARGYLSGIVNTVFDRLVDPELEAVRRRPTFVLRELKAVEAAAAELTDIDILSYMMDKG |
5 | 1q8hA | 0.74 | 0.25 | 7.09 | 0.73 | FFAS-3D | | -----------------------------------------------------------------PDPLPRRVCLN-PDCDELADHIGFQEAYRRFYGIA |
6 | 1vzmA | 0.41 | 0.15 | 4.42 | 1.21 | SPARKS-K | | -----------------------------------------------------------KELTLAQT-SLR-VC-TNMACDM-ADAQGIVAAYQAFYGPI |
7 | 1q8hA | 0.86 | 0.32 | 9.01 | 0.79 | CNFpred | | ---------------------------------------------------------------PDPLMPRRMVCMLNPDCDELADHIGFQEAYRRFYGIA |
8 | 5n8pA | 0.06 | 0.06 | 2.55 | 0.83 | DEthreader | | AQQTANSGTGLTTSSTIVTITDSILLIVNGFGGFANFVANVGFNLVNIATGLAVAVTDSLTLIFDIVILL--A-AFGA--A--VTLGATLQYLDAAVFSA |
9 | 6i0dM | 0.09 | 0.09 | 3.43 | 0.68 | MapAlign | | VGSLPMLAAVLGARLLFRFAIPLAPEGFAQAQGLLLFLAALSALYGAWVAFAAKDFKTLLAYAGLSHMGVAALGSGTPEGAMGGLYLPGEFLTLLGAYKA |
10 | 1q8hA | 0.88 | 0.30 | 8.44 | 0.89 | MUSTER | | ---------------------------------------------------------------PDPLP--RRVC-LNPDCDELADHIGFQEAYRRFYGIA |
(a) | ID1 is the number of template residues identical to query divided by number of aligned residues. |
(b) | ID2 is the number of template residues identical to query divided by query sequence length. |
(c) | Cov is equal the number of aligned template residues divided by query sequence length. |
(d) | Norm. Zscore is the normalized Z-score of the threading alignments. A Normalized Z-score >1 means a good alignment and is highlighted in bold. |
(e) | Download alignment lists the threading program used to identify the template, and provide the 3D structure of aligned regions of threading templates (threading[1-10].pdb.gz). |
(f) | Template residues identical to query sequence are highlighted in color. |
|
Top 1 final models from D-I-TASSER
|
Click to view |
Ranka | Download |
Estimated TM-scoreb |
|
1 |
model1.pdb.gz |
0.364 |
|
(a) |
D-I-TASSER simulations generate a large ensemble of structural
conformations, i.e. decoys. These decoys are clustered by
SPICKER based on pairwise structure similarity to
report up to five final models from the five largest clusters. Models are
ranked in descending order of cluster size. If the simulations converge
well, it is possible to have less than 5 models generated, which is
usually an indication of good model quality.
|
(b) |
The model confidence is quantitatified by estimated TM-score (eTM-score), calculated based on
significance of threading template alignments, contact map satisfaction rate,
mean absolute error between distance of model and distance of DeepPotential, and convergence of
D-I-TASSER simulations. eTM-score is typically in the range of [0, 1],
with higher eTM-score signifies higher model confidence. |
|
Proteins with similar structure
|
Top 10 structural analogs in PDB (as identified by
TM-align)
(a) | Query structure is shown in cartoon, while the structural analog is displayed using backbone trace. |
(b) | Ranking of proteins is based on TM-score of the structural alignment between the query structure and known structures in the PDB library. |
(c) | RMSDa is the RMSD between residues that are structurally aligned by TM-align. |
(d) | IDENa is the percentage sequence identity in the structurally aligned region. |
(e) | Cov. represents the coverage of the alignment by TM-align and is equal to the number of structurally aligned residues divided by length of the query protein. |
|
Predicted Gene Ontology (GO) Terms
|
GO term | CscoreGO | Name |
GO:0005198 | 0.56 | structural molecule activity |
GO:0008147 | 0.55 | structural constituent of bone |
Download full result of the above consensus prediction. |
| Click the graph to show a high resolution version. |
(a) | CscoreGO is the confidence score of predicted GO terms. CscoreGO values range in between [0-1]; where a higher value indicates a better confidence in predicting the function using the template. |
(b) | The graph shows the predicted terms within the Gene Ontology hierachy for Molecular Function. Confidently predicted terms are color coded by CscoreGO: |
| [0.4,0.5) | [0.5,0.6) | [0.6,0.7) | [0.7,0.8) | [0.8,0.9) | [0.9,1.0] |
|
|
|
Download full result of the above consensus prediction. |
| Click the graph to show a high resolution version. |
(a) | CscoreGO is the confidence score of predicted GO terms. CscoreGO values range in between [0-1]; where a higher value indicates a better confidence in predicting the function using the template. |
(b) | The graph shows the predicted terms within the Gene Ontology hierachy for Biological Process. Confidently predicted terms are color coded by CscoreGO: |
| [0.4,0.5) | [0.5,0.6) | [0.6,0.7) | [0.7,0.8) | [0.8,0.9) | [0.9,1.0] |
|
|
|
Download full result of the above consensus prediction. |
| Click the graph to show a high resolution version. |
(a) | CscoreGO is the confidence score of predicted GO terms. CscoreGO values range in between [0-1]; where a higher value indicates a better confidence in predicting the function using the template. |
(b) | The graph shows the predicted terms within the Gene Ontology hierachy for Cellular Component. Confidently predicted terms are color coded by CscoreGO: |
| [0.4,0.5) | [0.5,0.6) | [0.6,0.7) | [0.7,0.8) | [0.8,0.9) | [0.9,1.0] |
|
|
|
Predicted Enzyme Commission (EC) Numbers
|
Top 5 enzyme homologs in PDB
| Click on the radio buttons to visualize predicted active site residues. |
(a) | CscoreEC is the confidence score for the Enzyme Commission (EC) number prediction. CscoreEC values range in between [0-1]; where a higher score indicates a more reliable EC number prediction. |
(b) | TM-score is a measure of global structural similarity between query and template protein. |
(c) | RMSDa is the RMSD between residues that are structurally aligned by TM-align. |
(d) | IDENa is the percentage sequence identity in the structurally aligned region. |
(e) | Cov. represents the coverage of global structural alignment and is equal to the number of structurally aligned residues divided by length of the query protein. |
|
Predicted Ligand Binding Sites
|
Template proteins with similar binding site:
| Click on the radio buttons to visualize predicted binding site and residues. |
(a) | CscoreLB is the confidence score of predicted binding site. CscoreLB values range in between [0-1]; where a higher score indicates a more reliable ligand-binding site prediction. |
(b) | BS-score is a measure of local similarity (sequence & structure) between template binding site and predicted binding site in the query structure. Based on large scale benchmarking analysis, we have observed that a BS-score >1 reflects a significant local match between the predicted and template binding site.
| (c) | TM-score is a measure of global structural similarity between query and template protein. |
(d) | RMSDa the RMSD between residues that are structurally aligned by TM-align. |
(e) | IDENa is the percentage sequence identity in the structurally aligned region. |
(f) | Cov. represents the coverage of global structural alignment and is equal to the number of structurally aligned residues divided by length of the query protein. |
|