Home Research COVID-19 Services Publications People Teaching Job Opening News Forum Lab Only
Online Services

I-TASSER I-TASSER-MTD C-I-TASSER CR-I-TASSER QUARK C-QUARK LOMETS MUSTER CEthreader SEGMER DeepFold DeepFoldRNA FoldDesign COFACTOR COACH MetaGO TripletGO IonCom FG-MD ModRefiner REMO DEMO DEMO-EM SPRING COTH Threpp PEPPI BSpred ANGLOR EDock BSP-SLIM SAXSTER FUpred ThreaDom ThreaDomEx EvoDesign BindProf BindProfX SSIPe GPCR-I-TASSER MAGELLAN ResQ STRUM DAMpred

TM-score TM-align US-align MM-align RNA-align NW-align LS-align EDTSurf MVP MVP-Fit SPICKER HAAD PSSpred 3DRobot MR-REX I-TASSER-MR SVMSEQ NeBcon ResPRE TripletRes DeepPotential WDL-RF ATPbind DockRMSD DeepMSA FASPR EM-Refiner GPU-I-TASSER

BioLiP E. coli GLASS GPCR-HGmod GPCR-RD GPCR-EXP Tara-3D TM-fold DECOYS POTENTIAL RW/RWplus EvoEF HPSF THE-DB ADDRESS Alpaca-Antibody CASP7 CASP8 CASP9 CASP10 CASP11 CASP12 CASP13 CASP14



The ocean harbors a huge diversity of microbes that provide nearly half of the primary energy production on this planet, through either photosynthesis or chemosynthesis. This page lists the results of structural and function modeling of unknown Pfam families by combining the marine microbiome with new techniques from deep-learning contact-map prediction and ab initio folding simulations, followed by structure-based function annotation. More details can be found at: Yan Wang, Qiang Shi, Pengshuo Yang, Chengxin Zhang, S. M. Mortuza, Zhidong Xue, Kang Ning, Yang Zhang. Fueling ab initio folding with oceanic metagenomics enables structure and function predictions of new protein families. (in press, 2019)


Representative sequences

Multiple Sequence Alignment

Predicted 3D structures and structure-based function annotations

Pfam ID Representative sequence Predicted
structure
Predicted
function
Estimated
TM-score
PF02326 >sp|P38462|YMF19_MARPO [Download]
MPQLDQFTYLTQFVWLCVFYMTFYVLLYNDGLPKISRIIKLRKRLVSQEK
VGAEQSNDRVEQDVVFKECFQASANYLYSSVSGASKWCKGMVQLANAHKL
QRMNKDYVCSLGEISVSQVIKKNALSTLSPSTYQTTSLASRQTTALNKIY
VLRGQKRTLAKIKNGPRKKKIS
MetaGO result 0.468
PF04380 >sp|Q46868|YQIC_ECOLI [Download]
MIDPKKIEQIARQVHESMPKGIREFGEDVEKKIRQTLQAQLTRLDLVSRE
EFDVQTQVLLRTREKLALLEQRISELENRSTEIKKQPDPETLPPTL
MetaGO result 0.699
PF05939 >sp|P03737|TIPM_LAMBD [Download]
MKTFRWKVKPGMDVASVPSVRKVRFGDGYSQRAPAGLNANLKTYSVTLSV
PREEATVLESFLEEHGGWKSFLWTPPYEWRQIKVTCAKWSSRVSMLRVEF
SAEFEQVVN
MetaGO result 0.600
PF06067 >sp|P18005|YUBP_ECOLI [Download]
MRLASRFGRYNSIHRERPLTDDELMQFVPSVFSGDKHESRSERYTYIPTI
NIINKLRDEGFQPFFACQSRVRDLGRREYSKHMLRLRREGHINGQEVPEI
ILLNSHDGSSSYQMIPGIFRFVCTNGLVCGNNFGEIRVPHKGDIVGQVIE
GAYEVLGVFDKVTDNMEAMKEIHLNSDEQHLFGRAALMVRYEDENKTPVT
PEQIITPRRWEDKQNDLWTTWQRVQENMIKGGLSGRSASGKNTRTRAITG
IDGDIRINKALWVIAEQFRKWKS
MetaGO result 0.504
PF06698 >sp|B0C1E1|RL29_ACAM1 [Download]
MALSKMADLKNLSVDEIDAKVQELKKELFDLRFQKATKETIQPHRFKHIR
HEIAQLLTLKQQQS
MetaGO result 0.827
PF07583 >k87_4201897_1_1192157_1 [Download]
LFAIKPGDVEDSELIYRIFAKDADELMPPKESKLALTAEQKALLKQWIAE
GAEYEPHWAYEKPKREAVPTVKNSQWSRNDVDRFILAQLEKRGWKPSKAA
DKHALIRRVSLDLTGLPPTPVEVAAFVADTSPDAYEKLVDRLLAKPAYGE
HWARQWLDLARYADSTGYADDQPRTIWGYRDWVIRALNSNMPFDQFTREQ
LAGDLLDKPTNDQLIATAFHRNTQTNNEGGTSDEEFRNVAVIDRVNTTMQ
VWMGTTMACAQCHDHKYDPLSQEEYFRMFAIFNNTEDSDKRDEAPFVSLF
TDEQKKQ
MetaGO result 0.534
PF07586 >k87_2239226_1_1294060_1 [Download]
RAAFVFFPNGAIMPSWKPTGSGTDFELSQTLKPLEPFRSELNIFTGLAQD
NGRAKGDGPGDHARCAASYLTGAHPVKTSGANIKVGVSVDQVAAQQIGKR
TRLPSLEIGIERGRNAGNCDSGYSCAYSSNVAWKTATTPVAKEINPRAVF
NRLFGSTEDAKVRKRRSRIRKSILDLVAADAQRLQKTLGKNDQAKLDEYF
TSVREIEQRIARAEQDAEQRRPREFKVPDQTPRNLTEHVNLMYDLLALAF
QTDTTRVSTYMLGNAGSNRTYPMVGVNSGWHSLSHHRDEKAKVDQLQKID
QWHVEQFA
MetaGO result 0.507
PF07587 >k87_701284_6_155137_1 [Download]
NRLGLAQWLVHPEHPLTARVTVNRMWQHHFGTGLVKTAEDFGVQGEYPSH
PALLDWLAVDFIESGWDVKRLHKLLVMSATYQQSSRVDAAKLAADPENRL
LSRGARFRLDAEVIRDQALAVSGLLVPEIGGKSVRPYQPPGLWKPVGFGG
SNTSVFKQDTGDKLYRRSMYTFWKRTSPPPSMTAFDAPDRETCQVRRART
NTPLQALVLMNDVQFVEAARKFAECVVRQGGSSVEERVTFAYRSLLSRRP
TASELASVTKTF
MetaGO result 0.558
PF07624 >k87_7071864_1_1569173_1 [Download]
DGTPFAGPQELKSIVREKKTLFARCLAEKMLTYALGRGVEYYDRPTVERI
VKQLADQDYKFSTLVAEIVKSDPF
MetaGO result 0.783
PF07626 >k87_5295747_1_1242780_1 [Download]
YCDKCHGPDTDNGGLRFDKYRSTADVQADRKTWAKAMQYLELGAMPPPDH
DKQPDAKARESVVKYLDKTLFDIDCDIVRDPGRVTIRRLNRAEYNNTVRD
LLGVDFEPAKNFPSDDVGEGFDNIGDVLSVSPLLVEKYLDAAEQVAAKAI
VPVDPFTSRNRKFTGGQFNKNLGVNAQGSSFGIFSRGHVGIVIDFPSDGE
YVFRINASADQAGPDPARLTLSVDGKGVKTIDIKSKNRGDRKNHEHRLKM
VRGRRRIRVAF
MetaGO result 0.533
PF07627 >k87_10638706_1_1481988_1 [Download]
RGGILRHGSILLATSYATRTSPVIRGKWILDNILGVPPPPPPANVPELEE
TQTGQQALSMRERLAEHRANPACASCHRLMDPIGFAFENYDAVGRWRTTD
MetaGO result 0.628
PF07631 >k87_15041902_1_1674288_1 [Download]
LFRVERDPAGVRPDSPYEISDFELASRLSFFLWSSIPDDELLDLATEGQL
SRPEVLEQQVRRMLADERSRTLLTNFVGQWLHLRNLESATPDMRLFPDFD
DNLRQAMRRETELFLENVIREDRSVLDLLTADYTFLNERLAKHYEIPH
MetaGO result 0.635
PF07637 >k87_1921734_2_1214167_1 [Download]
SLPQFEKLYNEANEERIADNRILLARVKDVKAVARQPRGRPQRTSSTKPG
NVAIDWVEVRGPILPKTASAKSLVFFVTPKSDEEEPTVAREIIGRFAHRA
FRRPVDAAEVDRFVSLYEKVTARGDDFEDSVKLALAGVLVSPYFLFRVEA
GPGDDEFRLNDFQ
MetaGO result 0.365
PF07864 >k87_10715412_1_1903987_1 [Download]
MGGSSACEPEGWLIDPNKHWSLRFHRDQKSWRSDLFVFMDKGRAMPDGSP
ALLKSRRHLPKRDAVEIWNKLRADGWHRVEPQWGVGLDP
MetaGO result 0.639
PF08855 >k87_1367713_1_1135291_1 [Download]
MAFFESEIVQEEAKRLFNDYQQLMQLGSEYGKFDREGKKMFIDTMEELME
RYRVFMKRFELSEDFQAKMTVEQLRTQLGQFGVTPEQMFEQMNQTLSRMK
SQLEQDSR
MetaGO result 0.711
PF09834 >k87_10288621_25_1863378_1 [Download]
LGTKLVEKKRHIAKTITWRIVGTIDTILLSWLITGDPLMGLSIGAVEVIT
KMVLYYVHERVWFNVNLSKEGKLLESRKRHLVKTLTWRLVGTLDTMTLAW
LITGNPIAGLQIGLAEVFSKMLLYYLHERVWYKSTYGLRTANKKGYE
MetaGO result 0.560
PF09923 >sp|Q9ZDG9|Y359_RICPR [Download]
MKLLQIIFIITIYINLHIFVLAENLESVEDSENDVFIPLDKNHPILNQKD
NIYSAEFKNYTNGKIIALNKITATSEEIGLKAGEEKYFGNIKIKLHKCIK
NLDPYNQDNYLLMTITEYKIDEDPTLLFQGWMVSSSISLSTFEHPIYEIF
AKDCF
MetaGO result 0.525
PF10985 >k87_361919_1_163629_1 [Download]
MEKKQFSVEQIDRIIEMAWEDRTPFDAIEYQFGIKEQEVISIMRNNLKPS
SFKLWRKRVQGRKTKHSKQRGEDVNRFKCSRQRGKSNNKVSKR
MetaGO result 0.655
PF11233 >k87_9285995_22_1724173_1 [Download]
MQAKIGTLGLAFTAVLTLAACGNSRDPSLMNLRSSHQGPDEFGVIPTKPL
EMPENLDALPPPTPGGKNITDPTPEADAVAALGGNPRRLDETGQVPTSDR
ALVSAASRYGSQSGIRQTLAAEDYQWRKDHDGRLLERLFQVSVYYKAYKP
MELDQSAELERWRRLGVVTPGAPPSGEAQRAVK
MetaGO result 0.422
PF11297 >k87_5365323_1_1 [Download]
MGEQKRKEATKSDFIFGKKNYKFMLIGLAIIILGFILMSGGGSDDPNVWN
PDVFSWRRIRLAPTLVLIGFGIEVYAILLNPNKK
MetaGO result 0.623
PF11351 >k87_12002477_1_1542064_1 [Download]
MGLIERIFTAIFGGERNVIRDTVEVFRENAEAGAQRAHAVQGQAMAQYGA
EFAQARQGGFDRFMDGVNRLPRPMLALGTLGLFVTAMVNPLWFAERMQGI
ALVPEPLWWLLGVIVSFYFGARQQVKSQEFQRAIVGTIARVPQVVENIET
LREMRADSPQVADTGADSCLALAAVAPSENAALDAWRRSRG
MetaGO result 0.421
PF11360 >k87_199046_16_120644_1 [Download]
MSKVYALLFNIRTENEAIHTVQIGDRNKILMFESEDDAMRYSLLLEAQDF
PVPTIESFDSDEIKEFCKQADYDWEIISNEELAIPPEKNVEKFSWKNEED
LKIVTDEVQMSTEEMDMMRNKLEGLL
MetaGO result 0.516
PF11753 >sp|P03781|NUCK_BPT7 [Download]
MGLLDGEAWEKENPPVQATGCIACLEKDDRYPHTCNKGANDMTEREQEMI
IKLIDNNEGRPDDLNGCGILCSNVPCHLCPANNDQKITLGEIRAMDPRKP
HLNKPEVTPTDDQPSAETIEGVTKPSHYMLFDDIEAIEVIARSMTVEQFK
GYCFGNILKYRLRAGKKSELAYLEKDLAKADFYKELFEKHKDKCYA
MetaGO result 0.320
PF12322 >sp|P13335|GP26_BPT4 [Download]
MYEYKFDVRVGSKIINCRAFTLKEYLELITAKNNGSVEVIVKKLIKDCTN
AKDLNRQESELLLIHLWAHSLGEVNHENSWKCTCGTEIPTHINLLHTQID
APEDLWYTLGDIKIKFRYPKIFDDKNIAHMIVSCIETIHANGESIPVEDL
NEKELEDLYSIITESDIVAIKDMLLKPTVYLAVPIKCPECGKTHAHVIRG
LKEFFELL
MetaGO result 0.579
PF14108 >sp|Q8LFP9|ABA4_ARATH [Download]
MGFSSFISQPLSSSLSVMKRNVSAKRSELCLDSSKIRLDHRWSFIGGSRI
SVQSNSYTVVHKKFSGVRASWLTTTQIASSVFAVGTTAVLPFYTLMVVAP
KAEITKKCMESSVPYIILGVLYVYLLYISWTPETLKYMFSSKYMLPELSG
IAKMFSSEMTLASAWIHLLVVDLFAARQVYNDGLENQIETRHSVSLCLLF
CPVGIVSHFVTKAIINNQYK
MetaGO result 0.429
PF15461 >sp|Q9HNE6|BLH_HALSA [Download]
MGASPVALTPLTARARRTLARPALALGWVAISIAALPAITGVSLSPTARY
APLVASAVVFGMPHGAIDYLALPRAVTGTVTVRWLAVVGVLYLVLGGGYA
AAWFFAPVPAAFAFVAITWLHWGQGDLYPLLDFLDVDYLDTRPRRAATVL
IRGGLPMLVPLLGFPERYRSVVDAFAAPFGGSVGDLAVFDPRVRLWLGVA
FAAATVAVLAAGRRRTHSPGAWRVDAAETLLLWVFFFVVPPVFAVGVYFC
VWHSVRHVARAIAVDGSVHPSLRAGDILGPLARFGVEAAPMTAAALALGG
VLWWAVPNPPTTLESGAALYLVLIAVLTLPHVAVVTWMDRVQGVL
MetaGO result 0.414
PF16316 >k87_6521621_3_1292723_1 [Download]
MNELLLLLTDLEIFGADLIDKKDFFELLVKAVFNFLVIGYIVRYLYYPAT
KNKDYLFTYLLISVTVFCLCFLLENVKLELGFALGLFAIFGIIRYRTDPI
PIKEMTYLFIVIGVSVINALANKKISHAELLFTNLLVVAVTYGLEKIWLL
KHESRKTITYEKIELITPDKHDELVADLKERTGLNVTRVEIRKIDFLRDT
AQLRVFYFEETED
MetaGO result 0.549

Database and script package download
Reference

zhanglabzhanggroup.org | +65-6601-1241 | Computing 1, 13 Computing Drive, Singapore 117417