#=GF ID Astro_VPg
#=GF AC PF19416.3
#=GF DE Astrovirus VPg protein
#=GF AU Bateman A;0000-0002-6982-4660
#=GF SE P0C6K4
#=GF GA 25.00 25.00;
#=GF TC 27.60 144.30;
#=GF NC 24.00 21.60;
#=GF BM hmmbuild HMM.ann SEED.ann
#=GF SM hmmsearch --cpu 4 -Z 75585367 -E 1000 HMM pfamseq
#=GF TP Family
#=GF RN [1]
#=GF RM 22787221
#=GF RT Identification of human astrovirus genome-linked protein (VPg)
#=GF RT essential for virus infectivity.
#=GF RA Fuentes C, Bosch A, Pinto RM, Guix S;
#=GF RL J Virol. 2012;86:10070-10078.
#=GF DR INTERPRO; IPR045836;
#=GF DR SO; 0100021; polypeptide_conserved_region;
#=GF CC This entry represents the presumed VPg protein from human
#=GF CC astrovirus. Viral genome-linked proteins (VPgs) are
#=GF CC virus-encoded small proteins that are covalently linked to the
#=GF CC 5' terminus of many RNA viral genomes through a phosphodiester
#=GF CC bond. Viral genome-linked proteins (VPgs) have been identified
#=GF CC in several single-stranded positive-sense RNA virus families.
#=GF CC The protein resulting from this putative VPg coding region is a
#=GF CC highly disordered protein [1]. A common feature of VPgs is that
#=GF CC they are rich in basic amino acids [mostly Lys (K), Gly (G), Thr
#=GF CC (T), and Arg (R)], which favors the interaction with the
#=GF CC negatively charged RNA. Tyr-693 at the conserved TEEEY-like
#=GF CC motif has been postulated to be the residue responsible for the
#=GF CC covalent linkage to viral RNA. Mutagenesis of Tyr-693 in the VPg
#=GF CC protein is lethal for HAstV replication [1].
#=GF SQ 4
#=GS NS1AB_HASV1/664-755 AC Q67726.1
#=GS NS1A_TASV1/795-889 AC Q9JH70.1
#=GS NS1A_HASV1/664-755 AC P0C6K4.1
#=GS NS1AB_TASV1/795-889 AC Q9JH69.3
NS1AB_HASV1/664-755 QKKKGKT-------KHGRGRVRRNLRKG----VKLLTEEEYRELLEKGLDRETFLDLIDRIIGERSGYPDY-DDEDYYDEDDDGWGMVGDDVEFDYTEVINFDQ
NS1A_TASV1/795-889 QKKKGKTKRTARGGKHALG--KKYLSKAHFSRMRMLTEEEYNKMVEDGFSPDEIKEVVDQL--REQAWQNYLIDNDIGEDDDLDW---YDDMLED--ERLNEEI
NS1A_HASV1/664-755 QKKKGKT-------KHGRGRVRRNLRKG----VKLLTEEEYRELLEKGLDRETFLDLIDRIIGERSGYPDY-DDEDYYDEDDDGWGMVGDDVEFDYTEVINFDQ
NS1AB_TASV1/795-889 QKKKGKTKRTARGGKHALG--KKYLSKAHFSRMRMLTEEEYNKMVEDGFSPDEIKEVVDQL--REQAWQNYLIDNDIGEDDDLDW---YDDMLED--ERLNEEI
#=GC seq_cons QKKKGKT.......KHuhG..++.LpKu....h+hLTEEEYpchlEcGhs.-phh-llDpl..ccpua.sY..DpDhh--DD.sW...hDDh..D..EhlN.-.
//