GenomeNet

Database: Pfam
Entry: ENCP4
LinkDB: ENCP4
Original site: ENCP4 
#=GF ID   ENCP4
#=GF AC   PF08967.14
#=GF DE   Putative type 4B encapsulin shell protein
#=GF PI   DUF1884;
#=GF AU   Mistry J;0000-0003-2479-5322
#=GF AU   Sammut SJ;0000-0003-4472-904X
#=GF SE   pdb_1she
#=GF GA   28.60 28.60;
#=GF TC   28.60 30.30;
#=GF NC   28.50 25.00;
#=GF BM   hmmbuild HMM.ann SEED.ann
#=GF SM   hmmsearch --cpu 4 -E 1000 -Z 75585367 HMM pfamseq
#=GF TP   Domain
#=GF CL   CL0373
#=GF RN   [1]
#=GF RM   34362927
#=GF RT   Large-scale computational discovery and analysis of
#=GF RT   virus-derived microbial nanocompartments.
#=GF RA   Andreas MP, Giessen TW;
#=GF RL   Nat Commun. 2021;12:4748.
#=GF DR   INTERPRO; IPR014418;
#=GF DR   SO; 0000417; polypeptide_domain;
#=GF CC   Proteins in this entry may be the encapsulin shell protein in a
#=GF CC   type 4 A-domain encapsulin nanocompartment system. It has been
#=GF CC   shown that bacterial/archaeal encapsulin-like systems and
#=GF CC   HK97-type viruses share a common ancestor and it is likely that
#=GF CC   encapsulins have evolved from HK97-type phages [1].
#=GF SQ   8
#=GS ENCP4_PYRFU/4-81   AC P61996.1
#=GS ATPG_PROM4/92-176  AC A9BCD8.1
#=GS Q5JFJ5_THEKO/6-96  AC Q5JFJ5.1
#=GS Q8U2E0_PYRFU/4-95  AC Q8U2E0.1
#=GS F0LIR3_THEBM/1-79  AC F0LIR3.1
#=GS C6A2H1_THESM/2-93  AC C6A2H1.1
#=GS Q5JEV9_THEKO/2-81  AC Q5JEV9.1
#=GS F0LMI5_THEBM/2-93  AC F0LMI5.1
ENCP4_PYRFU/4-81              ....NNIMSQVKEIIEAAIKELEDDGFEPDIILAGPIFIRYLPEDVR-...........--LK..VYEIEEL..GSDAIIADSKYLGQ.IKKAAKRISIDP..
ATPG_PROM4/92-176             gynt-----NIIKRTEQRYNELKRQGFTPDLVLIGRKAIGYFQNRSSQ...........YKIRafFQDLEQVptSKDAESVTSEILAEfLSKSTDRIEV--iy
Q5JFJ5_THEKO/6-96             ....RGDLIRILSSVEEKANELKLEGFEPDVVLVGKEAYEFIKEQVNEefggeeevfelSGLR..VRILEEL..GGDAVVVDTKALGY.AP-ATRRFKVVP..
Q8U2E0_PYRFU/4-95             ....RGDLIRILGEIEEKMNELKMDGFNPDIILFGREAYNFLSNLLKKemeeegpfthvSNIK..IEILEEL..GGDAVVIDSKVLGL.VPGAAKRIKIIK..
F0LIR3_THEBM/1-79             .mke-----EIYELVKTTINELREEGLNPDIMLAGPEFLKHASEVLKE...........CHLA..VYEIKEL..NSDAVIADSQYLGQ.LKRASRRISI--gl
C6A2H1_THESM/2-93             ....RGELIRILGSVEEKANELKLDGFEPDVVLFGKEAYEFLKNQVNQefggedsvseiSGLS..IRVVDEF..GKDAVVVDSKVLGL.GLGGAKRLKVIK..
Q5JEV9_THEKO/2-81             ..pp---MSEILEVIERIIGELRSEGMNPNIMLAGPQFIEYSKDALKQ...........INLK..IYRIEEL..GYDAVIADSNYLGQ.IKKASRRVSVEP..
F0LMI5_THEBM/2-93             ....RGDLIRILSAVEEKANELKMDGFEPDIVLFGKEAYEFLKAQVDEefgederitevSGLK..VRVLEEL..GRDAVVIDSKMLGI.GLGGARRIRIIK..
#=GC seq_cons                 ....ps.h.cILptlEc+hNELKh-GFpPDIlLhG+EAhcalpsplcp...........ssL+..lphlEEL..GpDAVllDSKhLG..lhtAu+RIcl....
//
DBGET integrated database retrieval system