#=GF ID CDCA
#=GF AC PF18484.5
#=GF DE Cadmium carbonic anhydrase repeat
#=GF AU El-Gebali S;0000-0003-1378-5495
#=GF SE PDB:3BOB
#=GF GA 29.20 29.20;
#=GF TC 30.80 29.90;
#=GF NC 28.80 21.40;
#=GF BM hmmbuild HMM.ann SEED.ann
#=GF SM hmmsearch --cpu 4 -Z 75585367 -E 1000 HMM pfamseq
#=GF TP Repeat
#=GF RN [1]
#=GF RM 17222138
#=GF RT Diversity of the cadmium-containing carbonic anhydrase in marine
#=GF RT diatoms and natural waters.
#=GF RA Park H, Song B, Morel FM;
#=GF RL Environ Microbiol. 2007;9:403-413.
#=GF RN [2]
#=GF RM 18322527
#=GF RT Structure and metal exchange in the cadmium carbonic anhydrase
#=GF RT of marine diatoms.
#=GF RA Xu Y, Feng L, Jeffrey PD, Shi Y, Morel FM;
#=GF RL Nature. 2008;452:56-61.
#=GF DR INTERPRO; IPR040931;
#=GF DR SO; 0001068; polypeptide_repeat;
#=GF CC This domain is the cadmium carbonic anhydrase repeat unit of the
#=GF CC beta-carbonic anhydrase of a marine diatom [1], that uses both
#=GF CC zinc and cadmium for catalysis of the reversible hydration of
#=GF CC carbon dioxide for use in inorganic carbon acquisition for
#=GF CC photosynthesis (thus being a cambialistic enzyme). Compared with
#=GF CC alpha- and gamma-carbonic anhydrases that use three histidines
#=GF CC to coordinate the zinc-atom, this beta-carbonic anhydrase has
#=GF CC two cysteines and one histidine, and rapidly binds cadmium [2].
#=GF SQ 17
#=GS R1FN91_EMIHU/129-211 AC R1FN91.1
#=GS A0A0F7KI59_9PROT/71-296 AC A0A0F7KI59.1
#=GS R1CFN9_EMIHU/328-466 AC R1CFN9.1
#=GS A0A2V3W9Q9_9PROT/86-305 AC A0A2V3W9Q9.1
#=GS R4PXW2_9BACT/39-250 AC R4PXW2.1
#=GS C1N5U2_MICPC/39-222 AC C1N5U2.1
#=GS R1EGY0_EMIHU/119-239 AC R1EGY0.1
#=GS B8CG97_THAPS/57-237 AC B8CG97.1
#=GS A0A3Q8WUF0_9ACTO/38-248 AC A0A3Q8WUF0.1
#=GS A0A850GFC3_9DELT/28-222 AC A0A850GFC3.1
#=GS C1ECX3_MICCC/38-227 AC C1ECX3.1
#=GS C1ECX3_MICCC/260-449 AC C1ECX3.1
#=GS A0A1I2EGV4_9PROT/81-299 AC A0A1I2EGV4.1
#=GS K0RDT8_THAOC/94-276 AC K0RDT8.1
#=GS F9ZEW7_9PROT/84-303 AC F9ZEW7.1
#=GS A6FY58_9DELT/28-226 AC A6FY58.1
#=GS A5KSD2_9BACT/39-259 AC A5KSD2.1
R1FN91_EMIHU/129-211 .......LGPQYGVGLHHDSSGYG---..------------W....GKAGARET--..........---LQE...YIDELgllQVIAAVPSVVA..TDGLKhpahFLQCTELK......-------------..............................---KAGFSA...M..SLAII...A.E--...----------------------------------.......------------------------------.....----------nrli.........................
A0A0F7KI59_9PROT/71-296 ......n-IPVESNLPEICVDGRTDKN.gSRKRVPSAAGGTL....SIVYGFDLGNsesv.dkkteIELTAE...VID-I..lKNKKHTTAVHG..DDHSD....-CGCGACAkapdiyRYIIKEIDAIATLtnnygisisdt........ekayvtktaekRLNQSDFFA...E..DRSSV...I.EAArshGADYEELVDAHNELGIALNVKAGTTVDRAAIRREfghqydlFVVDAWTFDNA-------------------.....----------arelnaenhpevadriskaiaiqn.....
R1CFN9_EMIHU/328-466 .......LGPQYGVGLHHDSSGYG---..------------W....GKAGARET--..........---LQE...YIDELgllQVIAAVPSVIA..TDGLKhpahFLECTELK......-------------..............................---KAGLSA...M..SLAII...A.EVA...SAVMIIFHGLALVGLLPLSAKLAKG---------.......FAGLVWWRRRP-------ATKPERPPKGAA.....GLRSNRLPK-a............................
A0A2V3W9Q9_9PROT/86-305 ......n-IPVSGVVPEICVDGRTDKD.gKRKEAPSAAGGTL....SIVYGSDLGGtpt....andIDEMQLtkrIINTL...KEKGHSTGVHG..DDHSS....-CGCGACAkaktiyQHIAERINDIAALasqlgidltev........ekssivrqaknRLSQSDFFA...E..DRAAI...L.RAAqenGAIFEELIGAHNELGIALNTKPGTTVDRSAIRAKygpqydmFVVDAWAFA-----------TAANDINSNG.....SDEYAQRI--an...........................
R4PXW2_9BACT/39-250 ......y-VPVNPKAKTRCIDGRHDPAldEGMLGPQVPGGAI....GGALAYRLGVdkddltrgtfYTDTET...MIDSY...LRLGLAPGGHR..DNREHe..hGVGCGAIDg....mDAILDCLLDSGLIednkrlvraildtrfdrdrylrvlgagtvlESHADQYFA..gR..DEIFT...VlEKK..sPGSVSVLEGHHNEKLLIVNFVPSTTLASNRFARDhgg.lqaFGYDIW------------------------.....----------rskqlar......................
C1N5U2_MICPC/39-222 .......LVDVSPTGYLKCVDGRAVDH..NNTAGPKMLGGVY....AIAHNRGKKT..........TADLEA...ICAEV...AKAGHVPSVHG..DGDGN....MLGCGYCK......LWLTGKFADLDPV..............................KGAPPTYSA...D..EGAAA...V.KSG...GGKVEMCKGKHAEKFVYINFVADKTVEPNGDNQK.......FVVDAWCAKKFKLDIPSYLVTAAATVERLG.....GPKIAKLVV-p............................
R1EGY0_EMIHU/119-239 .......LGPQYGVGLHHDSSGYG---..------------W....GKAGARET--..........---LQE...YIDELgllQVIAAVPSVIA..TDGLKhpahFLECTELK......-------------..............................---KAGLSA...M..SLAII...A.EVA...SAVMIIFHGLALVGLLPLSAKLAKG---------.......FAGLVW------------------------.....----------ftltagfli....................
B8CG97_THAPS/57-237 .......MVEVDPAGILKCVDGRGSDN..TRMAGPKMPGGIY....AIAHNRGTTS..........VDGLKE...ITKEV...ASKGHVPSVHG..DHSAD....MLGCGFFR......LWVTGEFDSMG--..............................-YPRPEFDA...D..QGAAA...V.KES...GGVIEMHHGSHTEKVVYINLVENKTLEPDENDQR.......FIVDGWAAIKFNLDVVKFLVAAAATVEMLG.....GPRIAKIVVA.............................
A0A3Q8WUF0_9ACTO/38-248 .dellpg----------RCIDGRRPTT.pYQTIAPCAPGASLslliGLAATRGIAD.........pMWGAGE...LAARL...TEAGMEPHIHTgpDDHSS....--GCGALD......-WAPDIIA-LANEteplvreca............ealdmrvptTYPLASLTG...EalDSAGIyriF.HED..pGSRVTPLRGVHSEIAIVVNHEKGTTIDQNTLDRLgg..vdvFDVDVWSLE---------------------.....----------vaadwladqfgvdreralsamnaftiatl
A0A850GFC3_9DELT/28-222 ......i-LDVDSEGLMKCVDGRPSSH..AAMNGPKTLGGVY....AIASMRGARD..........LEGLTQ...ATRDV...AAAGHVPSVHG..DDHAQp..pAMGCGYFK......LWKTGKLADLAPEggss.....................eglppGLEPPRYSA...E..EGSEA...V.RAA...GGEYETLTGAHEEQEVIINLVDSTTFAPNADSQR.......FVVDAWVAEKFGVDPARYLTAAAKTVELLC.....EVRKARIIV-d............................
C1ECX3_MICCC/38-227 .......LVDVDPAGFLKCVDGRGSDAvgKQQHGPKMLGGVY....GIAVNRGIKT..........TKELEA...ICQEV...KAAGHVPTVHG..DEGGI....-LGCGFCK......LWMNGKFTDEGGV..............................ATAPPDFTA...D..QGAAC...V.KAA...GGVVENHVAKHTEKYVILNFVPGKTFVPNGKDQR.......FIVDCWALGKFNLDITKYALTAAATVEKLNpgqkpCPWKAYIVT-p............................
C1ECX3_MICCC/260-449 .......LVKVSPNGFLKCVDGRGSDAkgDQQRGPKMLGGVY....GIAVNRGIKT..........TKELEA...ICQEV...KAAGHVPTVHG..DEGGI....-LGCGFCK......LWLNDKFADEGMV..............................NESKPKFSA...E..DGSKT...V.EKA...GGVVENHVGKHTEKVVYLNFIDGMTLEPNADDQR.......FIVDAWAAGKFNLDVPKYCVTAAATVEKLNpgqapCPWKAVLIV-p............................
A0A1I2EGV4_9PROT/81-299 ......n-IPVQGMVPEICVDGRTDKD.gKRKEAPSAAGGTL....SIVYGSDLGSaa.....nneNDEIQL...TTQTInllTSKGHATGVHG..DDHSS....-CGCGACAkaktiyQHITERINDIASLtsqyginltea........ekefivqkarnRLNQPGFFA...E..DRASV...L.HTAqqnGSIFEELVGVHNELGIALNTKPGTTVDRSAIRAKygpqydmFVVDAWTFG-----------TAAKEVNPAG.....NDKDTQRIT-k............................
K0RDT8_THAOC/94-276 .......LVPVESSGYLKCVDGRGVDH..TNTRGPKMLGGVY....AIAHNRGLKT..........TDDLQD...ICREV...SEKGYIPSVHG..DGDGN....MLGCGYCK......LWLTGKFADLDPV..............................KGAPPTYSA...D..DGAAA...V.KAK...GQ-VEMCKGSHAEKFVYINFVEDQTIEPNHDDQK.......FVVDAWAAMKFDLDVPSYLVTAAATVERLG.....GPKIAKLVV-p............................
F9ZEW7_9PROT/84-303 .......NVPVNGSVPEICVDGRTNKS.gYRKSAPCAAGGTL....SIVYGGDLGSnsa....atdINELQL...TTQTInklKEKGHQTGVHG..DDHSD....-CGCGACSkaptiyQHITERINDLASLisklginitgs........ekesivqqaknRLDQAGFFA...E..NRASI...I.QAAqdtGAAYEELVGQHNELGIALNTRVGTTVDRSAIRSKygpqydvFVVDAWAFG-----------TAAKDINSTA.....NEEDEQRIAK.............................
A6FY58_9DELT/28-226 .......IVDVGGDGLMKCVDGRPSFH..PAMNGPKTLGGVY....AIASMRDARD..........VAGLVQ...ATRDV...AAFGHVPSVHG..DQHAEp..pPMGCGYFK......LWKTGKLMNLAPEgkedef.................kaselpkGIVPPNYSA...E..EGSEI...V.LSE...GGVYETLEGAHEEQEVVINLVTDTTFEPSRESQR.......FVVDAWITDKFNIDAGRYLTVAAKTVELLS.....DVRKARIIVN.............................
A5KSD2_9BACT/39-259 ittderi--------PRRCIDGRSPAVggFHDAAPNSAGGSL....TLLVADELIGrhvhvegestAADLSR...LLKTL...KQKGYQVGGHT..DTHAHg..nTSGCGANDklpailQFVSEH-----DTviretaaalnv.......vvdepthrqiveGTKKSRTFAsgaE..ILSVL...R.AEA...GQNVDILDGDHNEGIVVINTRPGTTLDRNSLKKVygsdlqaFNVDIWSFGEA-------------------.....----------araiaredneaaqkaiamv..........
#=GC seq_cons ........lsVpssuhh+CVDGRsspt...phtuPphhGGsh....uIAhsRshts..........httLpp...hhpcl...pttGasPuVHG..D-cup.....hGCGhhc.......ahstchss.us...............................thstssasA...-..stAth...l.csu...GuhhE.hhGtHsEthlhlNhhsupTl-.st.ppc.......FlVDsWsht...........hstt..p..s.....t...t.h...s............................
//