#=GF ID HOCHOB
#=GF AC PF17943.5
#=GF DE Homeobox-cysteine loop-homeobox
#=GF AU El-Gebali S;0000-0003-1378-5495
#=GF SE WormBase
#=GF GA 26.30 26.30;
#=GF TC 26.60 30.60;
#=GF NC 26.10 25.10;
#=GF BM hmmbuild HMM.ann SEED.ann
#=GF SM hmmsearch --cpu 4 -Z 75585367 -E 1000 HMM pfamseq
#=GF TP Domain
#=GF RN [1]
#=GF RM 26024448
#=GF RT The Homeobox Genes of Caenorhabditis elegans and Insights into
#=GF RT Their Spatio-Temporal Expression Dynamics during Embryogenesis.
#=GF RA Hench J, Henriksson J, Abou-Zied AM, Luppert M, Dethlefsen J,
#=GF RA Mukherjee K, Tong YG, Tang L, Gangishetti U, Baillie DL, Burglin
#=GF RA TR;
#=GF RL PLoS One. 2015;10:e0126947.
#=GF DR INTERPRO; IPR040960;
#=GF DR SO; 0000417; polypeptide_domain;
#=GF CC This domain is considered a double homeodomain, termed HOCHOB,
#=GF CC present in the C. elegans genome. Family members include CEH-91
#=GF CC and CEH-93 that share extended sequence similarity with each
#=GF CC other upstream of their typical HDs (Homeodomains). CEH-92,
#=GF CC another family member, has three copies of this domain. The
#=GF CC domain consists of two divergent HDs that are separated by a
#=GF CC linker of about 17 residues. The linker has a number of
#=GF CC conserved positions, two of which are cysteine residues
#=GF CC suggesting that they could be involved in metal binding. Hence,
#=GF CC the name HOCHOB (Homeobox-cysteine loop-homeobox). Furthermore,
#=GF CC there are two conserved histidine residues, one in each HD (in
#=GF CC CEH-91 displaced by two positions), and there is also a
#=GF CC conserved aspartic acid. It is speculated that the HOCHOB domain
#=GF CC is an evolutionary novelty that is derived from two HDs and may
#=GF CC have gained metal-binding capacity [1].
#=GF SQ 27
#=GS A0A261B459_9PELO/716-835 AC A0A261B459.1
#=GS A0A8R1HZN6_CAEJA/246-366 AC A0A8R1HZN6.1
#=GS G0PI76_CAEBE/420-539 AC G0PI76.1
#=GS A8XHE9_CAEBR/1451-1570 AC A8XHE9.1
#=GS Q95Q08_CAEEL/351-471 AC Q95Q08.2
#=GS Q86DC2_CAEEL/173-293 AC Q86DC2.5
#=GS A0A261A5G5_9PELO/83-202 AC A0A261A5G5.1
#=GS A0A261B611_9PELO/20-139 AC A0A261B611.1
#=GS G0PI76_CAEBE/103-223 AC G0PI76.1
#=GS A8XHE9_CAEBR/770-889 AC A8XHE9.1
#=GS Q95Q08_CAEEL/585-703 AC Q95Q08.2
#=GS A0A261B611_9PELO/237-354 AC A0A261B611.1
#=GS G0PI76_CAEBE/783-903 AC G0PI76.1
#=GS G0P3X6_CAEBE/156-276 AC G0P3X6.1
#=GS A0A261B459_9PELO/446-565 AC A0A261B459.1
#=GS G0MF91_CAEBE/98-219 AC G0MF91.1
#=GS G0P7S8_CAEBE/98-219 AC G0P7S8.1
#=GS E3MIE5_CAERE/827-946 AC E3MIE5.1
#=GS Q95Q08_CAEEL/80-199 AC Q95Q08.2
#=GS E3MXB6_CAERE/192-312 AC E3MXB6.1
#=GS A0A261B459_9PELO/132-251 AC A0A261B459.1
#=GS E3MIE6_CAERE/20-138 AC E3MIE6.1
#=GS E3MIE6_CAERE/277-394 AC E3MIE6.1
#=GS Q7K6J1_CAEEL/310-433 AC Q7K6J1.1
#=GS E3MIE5_CAERE/132-251 AC E3MIE5.1
#=GS E3MIE5_CAERE/476-595 AC E3MIE5.1
#=GS A8XHE9_CAEBR/1091-1210 AC A8XHE9.1
A0A261B459_9PELO/716-835 ...DFEIFQANRHPTILE...MINISYDTGVSYEQVFFRFKHFRKLNNEECSPGDP.CEKVEKFSEANPD...LG..GV..LNK..RTLGIIYEEFEHL.IHL.GYYLPLGYIH..MIMEKVDLPASVIRLQYGEWYRK-r.......
A0A8R1HZN6_CAEJA/246-366 ..s--ERFEENRHPNATE...MVKIAESCGVRYRTVFDDFENRRIAKRVKCEKDDA.CERIRTFFVRDNEp.lTY..NK..VKE..DIRAKLEEEIELH.LLS.RKRFSLGYVH..IIMEKTDLPASYIRGQYENWKKK-m.......
G0PI76_CAEBE/420-539 ...DFVYFEKNRHPSIQE...MINISKRMGITYRQVYYRFRDFRCAFKVNCPENDI.CRKVQKFFAQRAV...FS..GV..LDA..TSVNTMYDLFEHY.GHV.GKNPDIGYTH..LIAEKVDLPAFIVREQYKEWFAE-t.......
A8XHE9_CAEBR/1451-1570 ...DIDIFEANRHPSIQD...MVNVSHDVGVSYEQVYFRFKHLRTLKNEVCEDGDD.CQKVESFSQMNPV...FS..GT..LDA..KTLAYLHMEFDKL.IDF.GYYLPLGYIH..LILEKVNLPSRVIRAQYKEWYSKK........
Q95Q08_CAEEL/351-471 ...DFELFNRNRHPTIQE...MIGISCRTGVDYSRVFHRFQEFRAILKEQCPSTDDpCQKVANLFAAAGT...AA..QP..ERT..EIGDSVMEIFEIL.GSA.GRYPNEGYIH..VVAEKLNLSPGTVRKCYSDWLNR-k.......
Q86DC2_CAEEL/173-293 nas----FARNCHPTAIG...MQKLSERCGIRYKTIFDNFETIRKDGKIECETNDA.CCRIREFLSRETD...TT..SYieITN..EVQTVLEDEIELH.LLS.RKRFTLGYVH..VIMEKTDLAPSYIRAQYENWRRK-l.......
A0A261A5G5_9PELO/83-202 .sd---AFQLNNHPTALE...MQRLAEECAVRYRTVFDNFEERRKTKQIKCEDGDP.CEKIKRFIAMESE...QVphIT..LSE..EEKALLEEAIETH.LLS.RKKLSVGYLH..VIMEIVELPANYIRGQ--------cdsmrrr.
A0A261B611_9PELO/20-139 ...DQEFYDRNPHPSIEE...MICISKETNIDYTQIFKRFSELRLKNGEICSKNDT.CIKVFKYFNCDDT...FN..GT..IDK..RLLRMMNDEFEFV.GRR.-EELPIENLH..LLMQGTRLPEKTIVEEYTKWK---lqkd....
G0PI76_CAEBE/103-223 ...DLSYFEQNKHPSILH...MLKISEETGVRYDNVFYRFCQLRVMNNAVCHSNDS.CERVLKYFDPKTV...YA.iPS..LEE..KDMPLLHEEFKRF.GYA.GPVLCTGNVH..LIAEKFFLPPNVIREYFFEWYSN-s.......
A8XHE9_CAEBR/770-889 ...DKKYFEANRHPSITD...MIRISGSATVSYQQVFYRFSELRLFYSEKCSANDT.CRRVAHYFQSNIR...YK..GR..LTM..DSLEAMRQEFNKF.CHH.GPVLNIGSVH..LLVDKLEISPGMVQKCYKKWLEE-s.......
Q95Q08_CAEEL/585-703 ...DLELFEKNRHPSIQE...MIHISECFGISYEKVFIRFQDLRSIANEHCEPEDI.CEKVRRFSQMYPK...LS..GN..LDE..--IPALHVEFEKL.VRFgGTQLPIGYVH..LVMEKVELEPRVIREQFMEWFRR-r.......
A0A261B611_9PELO/237-354 ...DLEYFERNRHPSIQT...MINISIEVDVPYETVIKRYFELRMVVREKCKRNDT.CRKIFQFLEEKED...--..VL..LDE..KNKKLLEKEFSNI.KYA.DRYQVVGQFH..LIMDKICLPLHFVHRKYKEWFEM-r.......
G0PI76_CAEBE/783-903 ...DSEIFDKNRHPSIQD...MIDISQDVGVTYEQVFQRFQHLRKLKNETCTVGDI.CQKVEMYSQSYSS...LN..GV..LDE..TVRNTMKEEFAKFpIEF.GPILPLGHVH..LIMEKVGLPWSVVSVQYAEWYAE-m.......
G0P3X6_CAEBE/156-276 ..l--KSFNENPHPTALV...MQQLAEECFARYRTVVDYFENRRLTKQIKCERNDP.CERIKQFFMRENE...ETalTR..ISD..DVRSALEEEIEAH.LAL.KKKFSVGYLH..VIMERTNLPPTFIRNQYDN-----vkrrk...
A0A261B459_9PELO/446-565 ...DFQLFELNRHPSIGE...MLAISKMTGAKYKQVFYRFRDFRNALKEPCSRNDN.CRKVLKFFAHRTE...YD..GI..LHG..KSVNKMYNLFKKF.GAD.GKSPDIGYIH..LVAEQVDLPAFVVREQYKDWFMS-m.......
G0MF91_CAEBE/98-219 ..g--DLFSRNPHPSVSE...MNELTEFYSVKYRTIFETMEMKRNSQKIICNRGDN.CERIRDYFRQQSVepyAS..PT..LPP..DVKQLLETLIENQ.ILT.KKRFELGYVH..MVMEKTNLPFGAVRSSYVKWRK--rl......
G0P7S8_CAEBE/98-219 ..a--DLFSRNPHPSVSE...MNELTEFYSVKYRTIFETMELKRNSQQIICNRGDN.CERIREYFRQQSVepyAS..PT..IPP..DVKQLLETLIENQ.LFT.RKRFELGYVH..MVMEKTNLPFGAVRSSYAKWRR--rl......
E3MIE5_CAERE/827-946 ...DFEIFQANRHPTILE...MINISYDTGVSYEQVFFRFKHFRKLNNEVCSPGDP.CEKVEKFSEANPD...LG..GV..LDK..RTLGIIYEEFEHL.IHL.GYYLPLGYIH..MIMEKVDLSASVIRLQYGEWYSKR........
Q95Q08_CAEEL/80-199 ...DLRYFKEFKHPTIQQ...MIYISEDSGYRYERVFNRFCELRKLQGLKCLRNDT.CRRVAKYFSPLTV...YD..GA..DDV..TARNLMLQYFKRF.NYS.GPLPSTGCIH..LVVDRLVLPPDLIRDYYVNWYKF-s.......
E3MXB6_CAERE/192-312 .sd---AFQLNNHPTALE...MQRLAEECAVRYRTVFDNFEERRKTKQIKCEDGDP.CEKIKRFIAMESE...QVphIT..LSE..EEKSLLEEAIETN.LLS.RKKLSVGYLH..VIMEIVELPANYIRGQ--------cdsmrrrm
A0A261B459_9PELO/132-251 ...DVIYFGVNKHPSIHD...MIDISEEVFVPYDQVFHRFSELRIINQERCEKNDT.CERVSKYFISKIG...YN..GL..LTE..NTVAMMHHEFKNF.IHL.GPMTDTGRIH..LIVDKLDLPPDTVRKFYYKWFIK-s.......
E3MIE6_CAERE/20-138 ...DQEFYDRNPHPSIEE...MICISKETSIDYSQIFKRFSELRLKNGETCSKNDT.CIKVFKYFNCDDT...FN..GT..INK..RLLRMMNDEFEFV.GRR.-EKLPIENLH..LLMQATRLPEKTIVEEYTKWRQK-e.......
E3MIE6_CAERE/277-394 ...DLEYFERNRHPSIQT...MINISIEVDVPYETVIKRYFELRMVVRERCKRNDT.CRKIFQFLEEEED...--..VL..LDE..RNKKLLEKEFSGI.KYA.DRYQVVGQFH..LIMDKICLPLHFVHRKYKEWFEM-r.......
Q7K6J1_CAEEL/310-433 ..e--NLYQSNRHPSMEDinsLLDFSGDQ-RSIEAIFGFFERRRNDDNEMCTGDDA.CSSLRRYLEMELH...PH..GP..LLEppEIRCRMIKMFEKH.WAK.YRYV-FRDLHsyMFSDALGLPPKYIRQRFEHFKE--rk......
E3MIE5_CAERE/132-251 ...DVIYFGVNKHPSIHD...MIDISEEVFVPYDQVFHRFSELRIINHERCEKNDT.CERVSKYFISKIG...YN..GL..LTE..NTVAMMHHEFKNF.IHL.GPMTDTGRIH..LIVDKLDLAPDTVRKFYYKWFIK-s.......
E3MIE5_CAERE/476-595 ...DFQLFELNRHPSIGE...MLAISKMTGAKYKQVFYRFRDFRNALKEPCPRNDN.CRKVLKFFAHRTE...YD..GI..LHG..KSVNKMYDLFEKF.GAD.GKSPDIGYIH..LVAEQVDLPAFVVREQYKEWFMS-m.......
A8XHE9_CAEBR/1091-1210 ...DYQYFMKNQHPTIQE...MIDISKSTGTKYKQVFYRFLDFRCAFNVKCPPGDN.CDKVLKFFAHRAA...YD..GI..LNG..DTVNRMYDIFEML.GPA.GKNPDTGYMH..LVASEIDLPPFIIRDQYKEWLSEK........
#=GC seq_cons ...D.phFppN+HPSIp-...MlpISccsulpYcpVFtRFpchRthpptpCpcsDs.Cc+Vt+ahtpcss...hs..uh..Lsp..cshshhpctFEph.hht.schhslGalH..llhEKlsLPsphlRppYtcWhpp.p.......
//