#=GF ID WSS_VP
#=GF AC PF12175.12
#=GF DE White spot syndrome virus structural envelope protein VP
#=GF AU Mistry J;0000-0003-2479-5322
#=GF AU Gavin OL;
#=GF SE pdb_2edm
#=GF GA 24.00 24.00;
#=GF TC 24.50 24.60;
#=GF NC 21.10 19.80;
#=GF BM hmmbuild HMM.ann SEED.ann
#=GF SM hmmsearch --cpu 4 -E 1000 -Z 75585367 HMM pfamseq
#=GF TP Domain
#=GF RN [1]
#=GF RM 17409146
#=GF RT Crystal structures of major envelope proteins VP26 and VP28 from
#=GF RT white spot syndrome virus shed light on their evolutionary
#=GF RT relationship.
#=GF RA Tang X, Wu J, Sivaraman J, Hew CL;
#=GF RL J Virol. 2007;81:6709-6717.
#=GF DR INTERPRO; IPR022004;
#=GF DR SO; 0000417; polypeptide_domain;
#=GF CC This family of proteins is found in viruses. Proteins in this
#=GF CC family are approximately 210 amino acids in length. There is a
#=GF CC conserved NNT sequence motif. These proteins are structural
#=GF CC envelope proteins in viruses. This is the beta barrel C terminal
#=GF CC domain. There is a protruding N terminal domain which completes
#=GF CC the proteins. Three of four envelope proteins in white spot
#=GF CC syndrome virus share sequence homology with each other and are
#=GF CC present in this family - VP24, VP26 and VP28. VP19 is the other
#=GF CC major envelope protein but shares no sequence homology with the
#=GF CC other proteins. These proteins are essential for entry into
#=GF CC cells of the crustacean host.
#=GF SQ 7
#=GS Q77JA7_WSSVS/1-208 AC Q77JA7.1
#=GS Q77J04_WSSVS/1-200 AC Q77J04.1
#=GS Q9ICB7_WSSV/1-200 AC Q9ICB7.1
#=GS Q8QTG3_WSSV/1-208 AC Q8QTG3.1
#=GS Q9ICG6_WSSV/1-203 AC Q9ICG6.1
#=GS Q77J23_WSSVS/1-203 AC Q77J23.1
#=GS A0A268SP15_9BACL/8-69 AC A0A268SP15.1
Q77JA7_WSSVS/1-208 ..MHM---WGVYAAILAGL.....TLILVVISIVVTNIELNKKL--.......DKKDKDAYPVESEIINLTINGvaRGNHFNFVNGTLQTRNYGKVYVAgQGTSDSELVKKKGDIILTSLlgDGDHTLNVNKAESKELELYARVYNNTKRDITVDSVSLSPGL.......NATGREFSANKFVLYFKPTVLKKNRINTLVFGATFDEDIddtnrhyllSMRFSPGNdLFKVG-----ek......
Q77J04_WSSVS/1-200 ..MDLSFTLSVVSAILAIT.....AVIAVFIVIFRYHNTVTKTIEThtdnietNMDENLRIPVTAEV-------..GSGYFKMTDVSFDSDTLGKIKIR.NGKSDAQMKEEDADLVITPV..EG-RALEVTVGQNLTFEGTFKVWNNTSRKINITGMQMVPKI.......NP-SKAFVGSSNTSSFTPVSIDEDEVGTFVCGTTFGAPI.........AAT-AGGN.LFDMYVHVT-ysg.....
Q9ICB7_WSSV/1-200 ..MDLSFTLSVVSAILAIT.....AVIAVFIVIFRYHNTVTKTIEThtdnietNMDENLRIPVTAEV-------..GSGYFKMTDVSFDSDTLGKIKIR.NGKSDAQMKEEDADLVITPV..EG-RALEVTVGQNLTFEGTFKVWNNTSRKINITGMQMVPKI.......NP-SKAFVGSSNTSSFTPVSIDEDEVGTFVCGTTFGAPI.........AAT-AGGN.LFDMYVHVT-ysg.....
Q8QTG3_WSSV/1-208 ..MHM---WGVYAAILAGL.....TLILVVISIVVTNIELNKKL--.......DKKDKDAYPVESEIINLTINGvaRGNHFNFVNGTLQTRNYGKVYVAgQGTSDSELVKRKGDIILTSLlgDGDHTLNVNKAESKELELYARVYNNTKRDITVDSVSLSPGL.......NATGREFSANKFVLYFKPTVLKKNRINTLVFGATFDEDIddtnrhyllSMRFSPGNdLFKVG-----ek......
Q9ICG6_WSSV/1-203 ..MEFGNLTNLDVAIIAILsiaiiALIVIMVIMIVFNTRVGRSVVA.......NYDQMMRVPIQRRAKVMSIRG..--------ERSYNT-PLGKVAMK.NGLSDKDMKDVSADLVISTV..TAPRTDPAGTGAENS-NMTLKILNNTGVDLLINDITVRPTViagnikgNTMSNTYFSSKDIKSSSSKITLIDVCSKFEDGAAFEATM.........NIGFTSKN.VIDIKDEIKK........
Q77J23_WSSVS/1-203 ..MEFGNLTNLDVAIIAILsiaiiALIVIMVIMIVFNTRVGRSVVA.......NYDQMMRVPIQRRAKVMSIRG..--------ERSYNT-PLGKVAMK.NGLSDKDMKDVSADLVISTV..TAPRTDPAGTGAENS-NMTLKILNNTGVDLLINDITVRPTViagnikgNTMSNTYFSSKDIKSSSSKITLIDVCSKFEDGAAFEATM.........NIGFTSKN.VIDIKDEIKK........
A0A268SP15_9BACL/8-69 sf--LSGIYGVAAAILGLI.....ALIWVIIVLSNTNTYLKMLI-H.......DRKERQKIPLTKALAN-----..-----------------------.--------------------..-----------------------------------------.......---------------------------------------.........--------.----------plvnqerq
#=GC seq_cons ..Mchu.hhuVsuAILAll.....ALIhVhIllhshNspls+pl.t.......shc-p.+lPlppclhs...............phohpo.shGKlhht.pGhSDtphhc.puDlllosl..pu.+s..sshutp.p.phhh+lhNNTthcl.lsshph.Ptl.......Ns.upta.usp.h...psh.h..s.hsph..GssFttsh.........sht.sstN.lhchh.............
//