The HPSF (Human Proteome Structure and Function) database is a repository
of structure and function annotations on the 'missing proteins'
of the human proteome.
The missing proteins that have not been validated at protein level are
first extracted from the
neXtProt database.
The structure folding simulations are then generated by
I-TASSER
with all homologous templates excluded from the threading libraries.
Finally, the functional insights of each protein, including enzyme commission, gene ontology, ligand-binding
and subcellular location, are provided by the structure-based function annotation tool,
COFACTOR.
One goal of the HPSF database is to construct a comprehensive repository consisting of annotations on
the folding and function of all missing proteins in human proteome using the cutting-edge bioinformatics
methods, which should provide important help to recognize possible protein-coding genes
from the 'missing proteins' and to guide further protein characterization experiments.
(>>more about HPSF database).
HPSF News
Links to related database
- uPE1 protein annotation: COFACTOR function prediction for 66 uPE1 proteins on human chromosome 17.
- neXtProt: A comprehensive human-centric protein-related data.
- Ensembl: Automatic annotation on selected eukaryotic genomes
- HGNC: Responsible for approving unique symbols and names for human loci to allow unambiguous scientific communication.
- UniProt: A comprehensive, high-quality and freely accessible resource of protein sequence and functional information.
- Vega: A repository for high-quality gene models produced by the manual annotation of vertebrate genomes
- GENECODE: Encyclopaedia of genes and gene variants
-
Qiwen Dong, Rajasree Menon, Gilbert S. Omenn and Yang Zhang. Structural Bioinformatics Inspection of neXtProt PE5 Proteins in the Human Proteome. J Proteome Res, 2015. 14(9): 3750-3761 [PDF]