#=GF ID Endonuc-MspI
#=GF AC PF09208.14
#=GF DE Restriction endonuclease MspI
#=GF AU Sammut SJ;0000-0003-4472-904X
#=GF SE pdb_1sa3
#=GF GA 25.60 25.60;
#=GF TC 29.10 103.80;
#=GF NC 24.70 24.40;
#=GF BM hmmbuild HMM.ann SEED.ann
#=GF SM hmmsearch -E 1000 --cpu 4 -Z 75585367 HMM pfamseq
#=GF TP Domain
#=GF CL CL0236
#=GF RN [1]
#=GF RM 15341737
#=GF RT An asymmetric complex of restriction endonuclease MspI on its
#=GF RT palindromic DNA recognition site.
#=GF RA Xu QS, Kucera RB, Roberts RJ, Guo HC;
#=GF RL Structure. 2004;12:1741-1747.
#=GF RN [2]
#=GF RM 22638584
#=GF RT Sequence, structure and functional diversity of PD-(D/E)XK
#=GF RT phosphodiesterase superfamily.
#=GF RA Steczkiewicz K, Muszewska A, Knizewski L, Rychlewski L, Ginalski
#=GF RA K;
#=GF RL Nucleic Acids Res. 2012;40:7016-7045.
#=GF DR INTERPRO; IPR015291;
#=GF DR SCOP; 1sa3; fa;
#=GF DR SO; 0000417; polypeptide_domain;
#=GF CC Members of this family of prokaryotic restriction endonucleases
#=GF CC recognise the palindromic tetranucleotide sequence 5'-CCGG and
#=GF CC cleave between the first and second nucleotides, leaving 2 base
#=GF CC 5' overhangs. They fold into an alpha/beta architecture, with a
#=GF CC five-stranded mixed beta-sheet sandwiched on both sides by
#=GF CC alpha-helices [1].
#=GF SQ 15
#=GS B7GLB6_ANOFW/142-394 AC B7GLB6.1
#=GS A0A1I1AKL7_9CLOT/4-263 AC A0A1I1AKL7.1
#=GS B8I860_RUMCH/140-395 AC B8I860.1
#=GS A0A1T4XDB8_9CLOT/139-388 AC A0A1T4XDB8.1
#=GS K0J324_AMPXN/140-395 AC K0J324.1
#=GS A0A1T4PRY9_9ENTE/143-402 AC A0A1T4PRY9.1
#=GS A5N0N6_CLOK5/140-395 AC A5N0N6.1
#=GS Q5QUR0_IDILO/179-434 AC Q5QUR0.1
#=GS A0A4U7J5S5_9FIRM/140-395 AC A0A4U7J5S5.1
#=GS A0A7L4ZPU9_9FLAO/140-415 AC A0A7L4ZPU9.1
#=GS A0A1M6GWM1_9CLOT/139-388 AC A0A1M6GWM1.1
#=GS I0X5S0_9SPIR/140-392 AC I0X5S0.1
#=GS A0A3P1SB27_9FIRM/140-392 AC A0A3P1SB27.1
#=GS A0A419SG00_9BACL/140-394 AC A0A419SG00.1
#=GS A0A0B0HFY8_9BACI/4-251 AC A0A0B0HFY8.1
B7GLB6_ANOFW/142-394 ..................................LERFIESKGLENEKTGKRKAVEGKNFEKQIVSLLNHPSNIEKWNQSNA---TEDGYFYPTFQQIMNLIGISKN..EKIIKIKATNE...IPKL......E....TGGQPKTDILLTITTD..RRSKETFTFSCKRSSATYVSIHEYTAESFIEVLDIK.DEK.LQEAFRKLQEVGGFTKLKEQYDDLYKVMEKE..LPN..LNKKLAQWAYAGIGGYG.--DQKIHWANYIIVFKNDTKELE.....MEKMDDYITNILQNV....-KGQMGTSFQWTYPSKKKGKAIQLKG.
A0A1I1AKL7_9CLOT/4-263 ...................kikeaykkynikalh--------------YGEIGDKLGDAYESFVVNVFSDKKYLSMFDKLDENKLDE-----FIFKSIIIKEKIEVS..-EIMKIEATNK...IPKR......D....NGGNAKTDVWVKIYTM..KGQVINIPISVKQTTVPKVAMAEYDVDTILNETGIK.NFE.VERLMKKHQCDASAINFSKEE---KEILTRE..LEKdnNKDKLLRWILTMSPEKK.--YNDIRVPRYLIKFQLKRETLDvietgVYDIDEYIHHITTDRrgkpAKGGFGTGLAWTYATGSKGRKIQFKG.
B8I860_RUMCH/140-395 ..................................LRQKIVEKASQNIAQGLRANVLGNDAETSIVNLLNDLKNKALWNDYQNAQQTIKSSTYKIYKEILEKIDLKEG.fDKILEVTATND...IPLL......S....NRGKPKTDVSVTIKTN..TKEL-IRNISIKNTREKTVTIHEGSVSDLISALKLS.ESDpLSQALIHFEKVGSKKKLIAEHPNSDKILEEN..LKL..YNRELIEFFIFGLHSP-.LVNDKIQMVDLIIF----TNKFA.....VWNRDDYIKHYIEEY...sGKGQFGTPFKWTYPSKKRGQKIQIKG.
A0A1T4XDB8_9CLOT/139-388 ...............................lya---MIEKKYLADRITGQQKALQGLNFEEQIEVILNSQKNFAKWANIDE---LETGLFYPYFKQIMDSINITNP..VIIKQISATRD...IELL......P....SGGKPKTDVLLIVTFN..DGSTKNYTFSCKRTSSDWVSVHEYPVDKFIDVLKIT.DKK.LIQTLELFQELGGLKALG---KELTQYLEKE..LPK..YNRRLSLWVYGGVGGDG.--NPETQWADYIITYQNETSEFK.....IHKLEEYIEDILT-I....NDGHFGTPFRWTYPSGGKGKRIQLKG.
K0J324_AMPXN/140-395 ..................................LRQKIIEKATQNIEQGLRSNILGSDAETSIVNLLNDKRNANLWNDYEALKHTVKSSTYHIYKSILGKLNLSEG.iDKIIEVSATDN...IPLL......S....NRGKPKTDIYVKIKTE..KLKI-CSSISVKNTSKKTVTVHEGNVSDIIKALNLL.ESNpLTQALKDFEKVGSKKNLLLKHPDSCRILDEK..LKY..YNKELVGLFIFGLHSP-.LVNNNAQIADLIIF----TNNFA.....IWSQDEYIDYYINEY...cTKGQFGTPFRWTYPSKKRGQKIQIKG.
A0A1T4PRY9_9ENTE/143-402 ..................................LREKILAHISKNTSQGIKSNILGKDAEVEIVKLLNNVNNRKLWNDYSNYQKTIKSSTFDIYKEILSTAGLTFG.kHLISNVVATDS...IPKL......NidgqKKGYPKTDVSFKVTSD..KGES-SHTISVKNTEAKMVTVHEGRVSDLITALQIKvDSE.LATALRQFEIGGSKKYLEDKYPELLSVMNNQ..LRK..YNNTLTKFFFFGDNSP-.LVYDPIQIADMILY----TKNFS.....IWTKQDYVEYYNNSY...sNCGQFGTPFSWTYPSKKRGKKIQVKG.
A5N0N6_CLOK5/140-395 ..................................LRQKIVEKASQNIAQGLRANVLGNDAETSIVNLLNDLKNKALWNDYQNAQQTIKSSTYKIYKEILEKIDLKEG.fDKILEVTATND...IPLL......S....NRGKPKTDVSVTIKTN..TKEL-IRNISIKNTREKTVTIHEGSVSDLISALKLS.ESDpLSQALIHFEKVGSKKKLIAEHPNSDKILEEN..LKL..YNRELIEFFIFGLHSP-.LVNDKIQMVDLIIF----TNKFA.....VWNRDDYIKHYIEEY...sGKGQFGTPFKWTYPSKKRGQKIQIKG.
Q5QUR0_IDILO/179-434 ...............................qrs---FSDELAQVEEKTGSYYGKSGNKFERFLVEELNEPNNLSAYQANEKACFEYD----TVLDSVVSELPIEKQ..-HIQLLEATDT...ITKL......K....NGGSPKTDIHLRVYLS..PKEFHIANISVKNTIATRVSCHDYQAKDFVRVIAPD.DSD.FRNLVEIFQEAGSWKEFNSLTSEKGLVLDTDevLNP..YMEKIIQWAVTGQHDADyLIDEKIQLANFILTRNADSGACK.....MQSAKSYIEELKAAI....GKGR-GAPFSWTYPSKRRGQRIQLKM.
A0A4U7J5S5_9FIRM/140-395 ..................................LRQKIVEKASQNIAQGLRANVLGNDAETSIVNLLNDLKNKALWNDYQNAQQTVKSSTYKIYKEILEKIDLKEG.fDKILEVTATND...IPLL......S....NRGKPKTDVSVTIKTN..TKEL-IRNISIKNTREKTVTIHEGSVSDLISALKLS.ESDpLSQALIHFEKVGSKKKLIAEHPNSDKILEEN..LKL..YNRELIEFFIFGLHSS-.LVNDKIQMVDLIIF----TNKFA.....VWNRDDYIKHYIEEY...sGKGQFGTPFKWTYPSKKRGQKIQIKG.
A0A7L4ZPU9_9FLAO/140-415 hlftlsefveflqtyyeeksslfediqseqkfks-----------IREAGSFYGIQGNKLEKEISEWLNNKTYLKRYKAIKE-YSTYD----IIIDTILKKYQLNKN..-DIIKIHTTNS...IPLL......K....NGGNPKTDLFIQITTI..DGEIISETISIKNTTKKRVSCHDYKADDFIRVLNCA.GTK.LETYLKLFQNYPTYSEFEDNLPINYTIEEFSnlMKG..KAKLLTEWCLKGSHDIEnLIDSSKQISNFVLI--NSNGKIH.....FFEYDKYIDYIMKNS....-TLKFSTPFSWTYPSKQRGKRIQLKM.
A0A1M6GWM1_9CLOT/139-388 ...............................fyt---MVEEKHLRGMITGQQKALQGLNFEEQIEVILNNQKNFAKWANIDE---LETGLFFPYFKQIMDSINVTNP..AVIKEISATRD...IELL......P....SGGKPKTDVLLIVTFN..DESTKNYTFSCKRTSSDWVSVHEYPVDKFIDVLEIT.DKK.LIQTLELFQEVGGMKALGK---ELTQYLEKE..MPK..YNRRLSLWVYGGVGGDG.--NPETQWADYIITYQNETSEFK.....IHKLDEYIDNILK-I....NDGHFGTPFRWTYPSGGKGKRIQLKG.
I0X5S0_9SPIR/140-392 ................................fy--EFIENYANKQLSVGKRKDKEGNNFERRISNILSYAENLNKYKTNDP---RITGLNYPFFKKIMDCLNIDIKqvVGIKAICDSDI...IGRL......M....TGGKPKTDIIVTVYFSddYKQSQSFTISCKKTRFTKVSVHQYTADSFANVLAPE.NEK.LRVLLRAFQTAGNLKTFGLQNQEE---LTKE..LRP..YLEKLIRWVLGGYGGQ-.-IRDELQLANYILV--NDNSEIY.....IHTLDGYINLLVSSN....TKGNFGTPFQWTYASKRKGKDIQLK-c
A0A3P1SB27_9FIRM/140-392 .............................lynll-----EASATSSLTSGQIKDLQGRNFESLVATTLSNTDNLHKWKTNNKVST---GMHYNIFKAIVSYLNLDSS..-VIEKIKATSDskvIGKL......P....SGGNPKTDVMMDIIFK..DSTQSRVTISCKRSSDKKVSVHEYSAESFANILDSK.NSS.LRLLLENFQATPSLSSFGETNID---ALTKE..LAP..YSDKLSQWVLAGIHGYG.--DKNKHWASHILTYDNNDNTIS.....FHDINTYIELLKNNG....GSGHFGTLFTWTYPSKKRGKSIQLK-c
A0A419SG00_9BACL/140-394 ..............................lyfl----VEEKAFERFGVGARKDRQGKAFEKLLVDILEHPANLEIWNGEGD---KDLGFNYPWFKKIISIYGAKAK..EKLSFIEATDE...IPDLppkagqR....RGGKPKTDILVRLIFD..SAPAETFTISSKRTSSDWVAIHQYSADTYIDVLGIA.EDE.LKSALLELERVGAPTMIAPIYQ---QYITEQ..LPN..YYERLAKWAYAGIGGEG.--DPATQMAEYFTIYKNETKELE.....ISHLDDYIARILTEV....-EGQLGSPFRFTYTG-TRGTNIQLRG.
A0A0B0HFY8_9BACI/4-251 .......................ikkvwkmktle----------------KEKATEGKNFEKRLVSILNHSSNIKKWNQSSS---TEDGYFYPTFQQIMSLIGLSRN..EKILKIEATDK...IPRL......E....TGGQPKTDILLTITTD..RRSEEMFTFSCKRSSAKYVSIHEYTAESFIEVLDIK.DKK.LQEAFLKLQEVGGFTELKKRYDALYKVMEKE..LPI..LNKKLAEWAYAGIGGYG.--DPKIHWANYIIVFKNDTKTLE.....MEKIDVYIANILQNV....-EGQMGTSFQWTYPSKKKGKAIQLKG.
#=GC seq_cons ................................h....hl.p+u.pshtpGpcpslpGpsaEppIVslLNchcNhshWsshss...s.cu.hYslaKpIlsplslsps..tcIhcIpATsc...IPhL......s....sGGpPKTDlhlpIpos..cpph.shTISsKpToscpVolHEYss-caIslLslp.-sc.LppsLcpFQclGohKpLttphsp.hplL-cc..Lph..YN++LscWslsGltu.t...ssclQhAcaIIh..spTschp.....lashD-YIcpllpph....scGpFGTPFpWTYPSK+RG++IQLKG.
//