#=GF ID HMUDK_HMUD1
#=GF AC PF18748.5
#=GF DE 5-hmdU DNA kinase
#=GF PI Ploopntkinase1;
#=GF AU Iyer LM;0000-0002-4844-2022
#=GF AU Aravind L;0000-0003-0771-253X
#=GF AU Burroughs AM;0000-0002-2229-8771
#=GF AU El-Gebali S;0000-0003-1378-5495
#=GF SE Iyer LM
#=GF GA 27.00 27.00;
#=GF TC 27.00 42.60;
#=GF NC 26.70 24.10;
#=GF BM hmmbuild HMM.ann SEED.ann
#=GF SM hmmsearch -Z 75585367 --cpu 4 -E 1000 HMM pfamseq
#=GF TP Family
#=GF CL CL0023
#=GF RN [1]
#=GF RM 23814188
#=GF RT Computational identification of novel biochemical systems
#=GF RT involved in oxidation, glycosylation and other complex
#=GF RT modifications of bases in DNA.
#=GF RA Iyer LM, Zhang D, Burroughs AM, Aravind L;
#=GF RL Nucleic Acids Res. 2013;41:7635-7655.
#=GF RN [2]
#=GF RM 34522950
#=GF RT Pathways of thymidine hypermodification.
#=GF RA Lee YJ, Dai N, Muller SI, Guan C, Parker MJ, Fraser ME, Walsh
#=GF RA SE, Sridar J, Mulholland A, Nayak K, Sun Z, Lin YC, Comb DG,
#=GF RA Marks K, Gonzalez R, Dowling DP, Bandarian V, Saleh L, Correa
#=GF RA IR, Weigele PR;
#=GF RL Nucleic Acids Res. 2022;50:3001-3017.
#=GF RN [3]
#=GF RM 29555775
#=GF RT Identification and biosynthesis of thymidine hypermodifications
#=GF RT in the genomic DNA of widespread bacterial viruses.
#=GF RA Lee YJ, Dai N, Walsh SE, Muller S, Fraser ME, Kauffman KM, Guan
#=GF RA C, Correa IR Jr, Weigele PR;
#=GF RL Proc Natl Acad Sci U S A. 2018;115:E3116.
#=GF DR INTERPRO; IPR040924;
#=GF DR SO; 0100021; polypeptide_conserved_region;
#=GF CC 5-hmdU DNA kinase (HMUDK) and 5-hmdU DNA kinase 1 (HDMU1) are
#=GF CC P-loop nucleotide kinases that phosphorylates
#=GF CC 5-hydroxymethyluracil (5hmdU) into
#=GF CC 5-phosphomethyl-2'-deoxyuridine (5-PmdU) on DNA as a step in the
#=GF CC pathway leading to thymidine hypermodifications in the viral
#=GF CC genome [2]. HMUDK also transfers glutamate to
#=GF CC 5-pyrophosphoryloxymethyldeoxyuridine (5-PPmdU) to produce
#=GF CC 5-Nalpha-glyutamylthymidine (Nalpha-GluT) [2,3]. These
#=GF CC modifications probably prevent degradation of viral genome by
#=GF CC the host restriction-modification antiviral defense system
#=GF CC [1,3].
#=GF SQ 24
#=GS K7YIY0_9CAUD/13-208 AC K7YIY0.1
#=GS A0A4P2WVH0_9CAUD/1-111 AC A0A4P2WVH0.1
#=GS A0A0C5PQK6_9CAUD/14-209 AC A0A0C5PQK6.1
#=GS A0A482MZM3_9CAUD/14-209 AC A0A482MZM3.1
#=GS S6CGF3_9CAUD/14-209 AC S6CGF3.1
#=GS A0A2Z3DPE0_9CAUD/14-209 AC A0A2Z3DPE0.1
#=GS A0A2D0WBE1_9CAUD/1-165 AC A0A2D0WBE1.1
#=GS K4I4U2_9CAUD/14-209 AC K4I4U2.1
#=GS I0J2S0_9CAUD/14-209 AC I0J2S0.1
#=GS A0A6G6XUL2_9CAUD/14-209 AC A0A6G6XUL2.1
#=GS A0A2K8I3V5_9CAUD/2-171 AC A0A2K8I3V5.1
#=GS HMUD1_BPSAV/14-209 AC E1XT70.1
#=GS C8XUG4_9CAUD/14-209 AC C8XUG4.1
#=GS A0A7S9SMU6_9CAUD/14-209 AC A0A7S9SMU6.1
#=GS HMUDK_BPS10/23-198 AC F8WQ30.1
#=GS A0A1W6DYL9_9CAUD/11-218 AC A0A1W6DYL9.1
#=GS A0A248H6W9_9CAUD/14-209 AC A0A248H6W9.1
#=GS A0A0A0YSQ1_9CAUD/11-206 AC A0A0A0YSQ1.1
#=GS A0A7S5UQA0_9CAUD/1-180 AC A0A7S5UQA0.1
#=GS A0A0N7CCV4_9CAUD/1-178 AC A0A0N7CCV4.1
#=GS A0A7D3UYA3_9CAUD/23-199 AC A0A7D3UYA3.1
#=GS A0A5B9N1P7_9CAUD/14-209 AC A0A5B9N1P7.1
#=GS A9J510_BPPYU/2-170 AC A9J510.1
#=GS A0A2D0W9H6_9CAUD/1-166 AC A0A2D0W9H6.1
K7YIY0_9CAUD/13-208 ..................HTFVRPPKVEVPRQPVGDIYYVKGCNGSGKSTVPSYLSEKDPDAYV...CLL.........G....SR.V.....LLTVFPSFG.ILAFGKYDK....T....KSK.GVDSLSDYAEIELALALSER...EDLVKYD.AFFEGIIPATILHTW......IEKLN.....RPAR...RLVTLFLDTPLATCLSRVDTRNGG..E....DYNRDLVAEKFRRIESHRVRHKELFPTVPAGMIRSEGKTMEQMVE.............
A0A4P2WVH0_9CAUD/1-111 ..................----------------------------------------------...---.........-....--.-.....---------.---------....-....---.----------MLFALSIADL...PEYQVYD.VIFEGIIPSTLLSSW......IPRLT.....RPPR...ELVVLFMDTPLETCIARVKSRNGG..A....DFNESLVVEKWERVQDHRQRHKGLFPTVAAGMMKSHGLTVDQAV-m............
A0A0C5PQK6_9CAUD/14-209 ..................HLFVKPPAVEGEYSARGELYYIKGSNGSGKSTVPSYLAENDPQAYV...VTH.........N....SK.I.....MLTVCPSYN.IVCVGKYDK....S....KSK.GVDSLKDTEQMLFALSIADQ...PEYLKYD.VIFEGIIPSTLLSSW......IPRLT.....RPPR...ELVVLFMDTPLETCIARVKSRNGG..A....DFNESLVVEKWERVHNHRGRHKELFPTVCAGMMKSHGLTVDQAV-m............
A0A482MZM3_9CAUD/14-209 ..................HLFHKPPAVVGNYPARGELYYVKGSNGSGKSTVPSWMAENDPQAYV...VTH.........N....GK.V.....MLTVCPSFN.IICVGKYDK....S....KSK.GVDSLKDTEQMLFAVSITEQ...PEYIGYD.VIFEGIIPATLLNTW......IERLN.....RPTR...RLVVLFLDTPKEICLARVSSRNGG..E....DFKHELVLEKWKRVNSHRERHKELFPNITAGMMKSNGLTVDQAVS.............
S6CGF3_9CAUD/14-209 ..................HLFHKPPAVVGEYPARGELYYVKGSNGSGKSTVPSWMAENDPQAYV...VTH.........N....GK.V.....MLTVCPSFN.IICVGKYDK....S....KSK.GVDSLKDTEQMLFAVSITEQ...PEYIGYD.VIFEGIIPATLLNTW......IERLN.....RPTR...RLVVLFLDTPKEVCLARVSSRNGG..E....DFKHELVLEKWKRVNSHRERHKELFPNIPAGMMKSNGLTVDQAVS.............
A0A2Z3DPE0_9CAUD/14-209 ..................HLFVKPPAVEGEYPARGELYYVKGSNGSGKSTVPSYLAENDPQAYV...VAR.........D....SK.I.....MLTVCPSYN.IICIGKYDK....S....KSK.GVDSLKDTEQMLFALSIADQ...PEYLKYD.VIFEGIIPSTLLSSW......IPRLT.....RPPR...ELVVLFMDTPLETCIARVKSRNGG..A....EFNESLVVEKWERVHDHRQRHKGLFPTVPAGMMKSHGLTVDQAV-m............
A0A2D0WBE1_9CAUD/1-165 ...............mtk------------------LINVRGCNGSGKTXLLRCLGRGEGVTVVegsVPD.........H....KP.I.....PITYTPE-G.FAIIGDYTPaaagA....TTA.GLDRIKTQAAAKAIIEFAAS...NTAVR-A.VLFEGVVVSTIYGPW......QE-WS.....KANG...GMIWAFLDTPLEVCLKRIQERNGG..K....PIKEDQVAAKHKTIAR--VRE------------------------kaladg.......
K4I4U2_9CAUD/14-209 ..................HTFIKPPVVKGIYPARGQLYYVKGSNGSGKSTVPSQLAERDPQAYV...VTH.........D....GK.I.....MLTVCPSFN.VVCIGKYDK....S....KSK.GVDSLKDTEQMLFALSIADL...PEYQVYD.VIFEGIIPSTLLSSW......IPRLT.....RPPR...ELVVLFMDTPLETCIARVKSRNGG..A....DFNESLVVEKWERVQDHRQRHKGLFPTVAAGMMKSHGLTVDQAV-m............
I0J2S0_9CAUD/14-209 ..................HTFTKPPVVKGIYPARGQLYYIKGSNGSGKSTVPSQLAERDHQAYV...VTH.........D....GK.I.....MLTVCPSFN.VVCVGKYDK....S....KSK.GVDSLKDTEQMLFALSIADL...PEFQVYD.VIFEGIIPSTLLSSW......IPRLT.....RPPR...ELVVLFMDTPLETCIARVKSRNGG..A....DFNESLVVEKWERVHDHRERHKGLFPKVAAGMMKSHGLTVDQAV-m............
A0A6G6XUL2_9CAUD/14-209 ..................HTFIKPPVVKGIYPARGQLYYVKGSNGSGKSTVPSQLAERDPQAYV...VTH.........D....GK.I.....MLTVCPSFN.VVCIGKYDK....S....KSK.GVDSLKDTEQMLFALSIADL...PEYQVYD.VIFEGIIPSTLLSSW......IPRLT.....RPPR...ELVVLFMDTPLETCIARVKSRNGG..A....DFNESLVVEKWERVQDHRQRHKGLFPTVAAGMMKSHGLTVDQAV-m............
A0A2K8I3V5_9CAUD/2-171 ...............kyi--------------------NVRGCNGSGKTTLLRCLA-RDPLCRV...INVi......vpD....HKpI.....PVTYAP-DG.IAIIGDYTP....AaagaTTA.GLDRIKTQAAAKAVAELVGR...DPDVK-A.VLFEGVVVSTIYGPW......QE-WS.....KANG...GMIWAFLDTPLEVCLKRIQERNGG..K....PIKEDQVADKHRTIAR--VRDK-----------------------aladgetvrdih.
HMUD1_BPSAV/14-209 ..................HLFVKPPAVEGEYPARGELYYVKGSNGSGKSTVPSYLAENDPQAYV...VTY.........N....GK.I.....MLTVCPSYN.IICIGKYDK....S....KSK.GVDSLKDTEQMLFALSIADQ...PEYLKYD.VLFEGIIPSTLLSSW......IPRLT.....RPPR...ELVVLFMDTPLETCVSRVKSRNGG..A....DFNESLVVEKWERVHDHSQRHKGLFPTVPAGMMKSNGLTIEQAVF.............
C8XUG4_9CAUD/14-209 ..................HTFIKPPVVKGIYPARGQLYYVKGSNGSGKSTVPSQLAERDPQAYV...VTH.........D....GK.I.....MLTVCPSFN.VVCIGKYDK....S....KSK.GVDSLKDTEQMLFALSIADL...PEYQVYD.VIFEGIIPSTLLSSW......IPRLT.....RPPR...ELVVLFMDTPLETCIARVKSRNGG..A....DFNESLVVEKWERVQDHRQRHKGLFPTVAAGMMKSHGLTVDQAV-m............
A0A7S9SMU6_9CAUD/14-209 ..................HLFVKPPAVEGEYPARGELYYVKGSNGSGKSTVPSYLAENDPQAYV...VTY.........N....GK.I.....MLTVCPSYN.IICIGKYDK....S....KSK.GVDSLKDTEQMLFALSIADQ...PEYLKYD.VLFEGIIPSTLLSSW......IPRLT.....RPPR...ELVVLFMDTPLETCVSRVKSRNGG..A....DFNESLVVEKWERVHDHRQRHKGLFPTVPAGMMKSNGLTIEQAVF.............
HMUDK_BPS10/23-198 ......fgykiqyrqdlv------------------LVNIRGCNGAGKSTVPMQMLQTDPGAFM...LTL.........D....GK.D.....KATVFPSYG.FVAMGRYF-....S....KTG.GLDGFKNNEETLKVLKLLWE...---LPFS.IIMEGVISSTIFSTYcdlfkeLEQRN.....NPKR...AVGVLNLLPPFEVCLERIKKRTPEkfD....SIKKDQIEGKWRTVNRNAQKFR-----------------------dagvt........
A0A1W6DYL9_9CAUD/11-218 fkkvelsftppgvmnmpi------------------FFEVRGTNGSGKSTVPFMLQAGDPEAFK...AVE.........KdpvlGD.L.....VLTCSPNSR.TIIVGSYPI....G....RAVgGCDTISGSEKIEGHLAFAKRlleTHSGKFDkVFFEGIMTSTSNSRYtk..flLQELK.....VPQE...QLVVGWCNTPLEVCIERIYGRT-G..K....EFNHSLVEGKHDQLSRQPQNHRDLFPDVNRVV-------------ydcmcsketmlen
A0A248H6W9_9CAUD/14-209 ..................HTFTKPPVVKGIYPARGQLYYIKGSNGSGKSTVPSQLAERDHQAYV...VTH.........D....GK.I.....MLTVCPSFN.VVCVGKYDK....S....KSK.GVDSLKDTEQMLFALSIADL...PEFQVYD.VIFEGIIPSTLLSSW......IPRLT.....RPPR...ELVVLFMDTPLETCIARVKSRNGG..A....DFNESLVVEKWERVHDHRERHKGLFPKVAAGMMKSHGLTVDQAV-m............
A0A0A0YSQ1_9CAUD/11-206 ..................HGLMKPKPVQTRYPARGELFYVKACNGAGKSTIPSYCAERDPEAYT...VSQ.........D....GR.I.....LLTVMPSYG.MLAFGKYDK....S....KSK.GVDSLKDYDEMKKAVELSEL...EEFIGYT.AFFEGVIPSTILHTW......VEYLN.....RPAR...PLTTVFIDTDLETCLSRVNSRNGG..A....EFNADLVTEKFNRVMSHKTRHKELFPSVPAVTIRSQGITIDEMVS.............
A0A7S5UQA0_9CAUD/1-180 ..............miin---------------------IRGTNGSGKTTLARTFQ-DHPSARV...VNLvdypaptkrD...pAA.IkfvtgVVTDLPGVGsVCCVGSYSQ....A....Q-G.GLDTVPNFELQRMAIAHA--...-AAICDH.VICEGVLASTVAGSW......LEFFV.....RTQMaglKVAVCYLDTPLDVCLARIKERQER..AgkvrDIKEDLVADKVKAIDATRAK-------------------------fdaagist.....
A0A0N7CCV4_9CAUD/1-178 ..................------------------MYYIKGSNGSGKSTVPSYLAENDPQAYV...VTH.........N....SK.I.....MLTVCPSYN.IVCVGKYDK....S....KSK.GVDSLKDTEQMLFALSIADQ...PEYLKYD.VIFEGIIPSTLLSSW......IPRLT.....RPPR...ELVVLFMDTPLETCIARVKSRNGG..A....DFNESLVVEKWERVHDHRQRHKGLFPTVPAGMMKSDGLTVEQAVF.............
A0A7D3UYA3_9CAUD/23-199 ..........aeylkkng----------AKYPDGMKMVNIRGCNGAGKSTVPMEFLFNDPAVYL...LTY.........E....GK.D.....VATVFPTYG.WVAMGKY-R....T....KTG.GLDGYKNGEQTRNMLQLL--...-WCLPFN.IIMEGVIASTIYSTY......ADLFNeykshKIKR...EIGVMNLLPPFEVIKDRLEKRNGG..K....EIKWEQVESKYRTVKKNAQK-------------------------fleagli......
A0A5B9N1P7_9CAUD/14-209 ..................HLFHKPPAVVGEYPARGELYYVKGSNGSGKSTVPSWMAENDPQAYV...VTH.........N....GK.V.....MLTVCPSFN.IICVGKYDK....S....KSK.GVDSLKDTEQMLFAVSITEQ...PEYIGYD.VIFEGIIPATLLNTW......IERLN.....RPTR...RLVVLFLDTPKEVCLARVSSRNGG..E....DFKHELVLEKWKRVNSHRERHKELFPNIPAGMMKSNGLTVDQAVS.............
A9J510_BPPYU/2-170 ...............kyi--------------------NVRGCNGSGKTTLLRCLA-RDPLCRV...INVi......vpD....HKpI.....PVTYAP-DG.IAIIGDYTP....AaagaTTA.GLDRIKTQAAAKAVAELVGR...DPDVK-A.VLFEGVVVSTIYGPW......QE-WS.....KANG...GMIWAFLDTPLEVCLKRIQERNGG..K....PIKEDQVADKHRTIAR--VRDK-----------------------aladgetvrdi..
A0A2D0W9H6_9CAUD/1-166 ................mt-----------------KLINVRGCNGSGKTTLLRCLGRGEGVTVVegsVPD.........H....KP.I.....PITYTPE-G.FAIIGDYTPaaagA....TTA.GLDRIKTQAAAKAIIEFAAS...NTAVR-A.VLFEGVVVSTIYGPW......QE-WS.....KANG...GMIWAFLDTPLEVCLKRIQERNGG..K....PIKEDQVAAKHKTIAR--VRE------------------------kaladge......
#=GC seq_cons ............................s.h.shhpLaYVKGoNGSGKSTVPShLAEpDPpAYV...Vop.........c....uK.I.....hLTVsPSas.llslGKYDK....S....KSK.GVDSLKDsEpMLhALulA-p...sEaltYD.VIFEGIIPSTlLuoW......I.+Lo.....RPsR...cLVVLFhDTPLEsCluRVcSRNGG..t....DFpEsLVsEKWcRVpcHRpRHKtLFPslsAGMh+SpGlTl-QhV.h............
//