#=GF ID CagA_N
#=GF AC PF18971.4
#=GF DE CagA protein
#=GF AU Bateman A;0000-0002-6982-4660
#=GF SE Bateman A
#=GF GA 27.00 27.00;
#=GF TC 2049.50 2043.90;
#=GF NC 20.30 20.20;
#=GF BM hmmbuild HMM.ann SEED.ann
#=GF SM hmmsearch -Z 75585367 -E 1000 --cpu 4 HMM pfamseq
#=GF TP Family
#=GF RN [1]
#=GF RM 22817985
#=GF RT Tertiary structure-function analysis reveals the pathogenic
#=GF RT signaling potentiation mechanism of Helicobacter pylori
#=GF RT oncogenic effector CagA.
#=GF RA Hayashi T, Senda M, Morohashi H, Higashi H, Horio M, Kashiba Y,
#=GF RA Nagase L, Sasaya D, Shimizu T, Venugopalan N, Kumeta H, Noda NN,
#=GF RA Inagaki F, Senda T, Hatakeyama M;
#=GF RL Cell Host Microbe. 2012;12:20-33.
#=GF DR INTERPRO; IPR045157;
#=GF DR SO; 0100021; polypeptide_conserved_region;
#=GF CC The Helicobacter pylori type IV secretion effector CagA is a
#=GF CC major bacterial virulence determinant and critical for gastric
#=GF CC carcinogenesis [1]. X-ray crystallographic analysis of the
#=GF CC N-terminal CagA fragment (residues 1-876) revealed that the
#=GF CC region has a structure comprised of three discrete domains.
#=GF CC Domain I constitutes a mobile CagA N terminus, while Domain II
#=GF CC tethers CagA to the plasma membrane by interacting with
#=GF CC membrane phosphatidylserine. Domain III interacts
#=GF CC intramolecularly with the intrinsically disordered C-terminal
#=GF CC region, and this interaction potentiates the pathogenic
#=GF CC scaffold/hub function of CagA [1].
#=GF SQ 1
#=GS CAGA_HELPY/1-876 AC P55980.1
CAGA_HELPY/1-876 MTNETIDQTRTPDQTQSQTAFDPQQFINNLQVAFIKVDNVVASFDPDQKPIVDKNDRDNRQAFDGISQLREEYSNKAIKNPTKKNQYFSDFIDKSNDLINKDNLIDVESSTKSFQKFGDQRYQIFTSWVSHQKDPSKINTRSIRNFMENIIQPPIPDDKEKAEFLKSAKQSFAGIIIGNQIRTDQKFMGVFDESLKERQEAEKNGGPTGGDWLDIFLSFIFNKKQSSDVKEAINQEPVPHVQPDIATTTTDIQGLPPEARDLLDERGNFSKFTLGDMEMLDVEGVADIDPNYKFNQLLIHNNALSSVLMGSHNGIEPEKVSLLYAGNGGFGDKHDWNATVGYKDQQGNNVATLINVHMKNGSGLVIAGGEKGINNPSFYLYKEDQLTGSQRALSQEEIRNKVDFMEFLAQNNTKLDNLSEKEKEKFQNEIEDFQKDSKAYLDALGNDRIAFVSKKDTKHSALITEFNNGDLSYTLKDYGKKADKALDREKNVTLQGSLKHDGVMFVDYSNFKYTNASKNPNKGVGATNGVSHLEAGFNKVAVFNLPDLNNLAITSFVRRNLENKLTAKGLSLQEANKLIKDFLSSNKELAGKALNFNKAVAEAKSTGNYDEVKKAQKDLEKSLRKREHLEKEVEKKLESKSGNKNKMEAKAQANSQKDEIFALINKEANRDARAIAYTQNLKGIKRELSDKLEKISKDLKDFSKSFDEFKNGKNKDFSKAEETLKALKGSVKDLGINPEWISKVENLNAALNEFKNGKNKDFSKVTQAKSDLENSVKDVIINQKVTDKVDNLNQAVSVAKAMGDFSRVEQVLADLKNFSKEQLAQQAQKNEDFNTGKNSELYQSVKNSVNKTLVGNGLSGIEATALAKNFSDIKKE
#=GC seq_cons MTNETIDQTRTPDQTQSQTAFDPQQFINNLQVAFIKVDNVVASFDPDQKPIVDKNDRDNRQAFDGISQLREEYSNKAIKNPTKKNQYFSDFIDKSNDLINKDNLIDVESSTKSFQKFGDQRYQIFTSWVSHQKDPSKINTRSIRNFMENIIQPPIPDDKEKAEFLKSAKQSFAGIIIGNQIRTDQKFMGVFDESLKERQEAEKNGGPTGGDWLDIFLSFIFNKKQSSDVKEAINQEPVPHVQPDIATTTTDIQGLPPEARDLLDERGNFSKFTLGDMEMLDVEGVADIDPNYKFNQLLIHNNALSSVLMGSHNGIEPEKVSLLYAGNGGFGDKHDWNATVGYKDQQGNNVATLINVHMKNGSGLVIAGGEKGINNPSFYLYKEDQLTGSQRALSQEEIRNKVDFMEFLAQNNTKLDNLSEKEKEKFQNEIEDFQKDSKAYLDALGNDRIAFVSKKDTKHSALITEFNNGDLSYTLKDYGKKADKALDREKNVTLQGSLKHDGVMFVDYSNFKYTNASKNPNKGVGATNGVSHLEAGFNKVAVFNLPDLNNLAITSFVRRNLENKLTAKGLSLQEANKLIKDFLSSNKELAGKALNFNKAVAEAKSTGNYDEVKKAQKDLEKSLRKREHLEKEVEKKLESKSGNKNKMEAKAQANSQKDEIFALINKEANRDARAIAYTQNLKGIKRELSDKLEKISKDLKDFSKSFDEFKNGKNKDFSKAEETLKALKGSVKDLGINPEWISKVENLNAALNEFKNGKNKDFSKVTQAKSDLENSVKDVIINQKVTDKVDNLNQAVSVAKAMGDFSRVEQVLADLKNFSKEQLAQQAQKNEDFNTGKNSELYQSVKNSVNKTLVGNGLSGIEATALAKNFSDIKKE
//