KEGG   Homo sapiens (human): 29894
Entry
29894             CDS       T01001                                 
Symbol
CPSF1, CPSF160, HSU37012, MYP27, P/cl.18
Name
(RefSeq) cleavage and polyadenylation specific factor 1
  KO
K14401  cleavage and polyadenylation specificity factor subunit 1
Organism
hsa  Homo sapiens (human)
Pathway
hsa03015  mRNA surveillance pathway
Disease
H02041  Myopia
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09122 Translation
   03015 mRNA surveillance pathway
    29894 (CPSF1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03021 Transcription machinery [BR:hsa03021]
    29894 (CPSF1)
   03019 Messenger RNA biogenesis [BR:hsa03019]
    29894 (CPSF1)
Transcription machinery [BR:hsa03021]
 Eukaryotic type
  RNA polymerase II system
   Other transcription-related factors
    Transcription termination factor
     29894 (CPSF1)
Messenger RNA biogenesis [BR:hsa03019]
 Eukaryotic type
  mRNA processing factors
   3' end processing
    Cleavage and polyadenylation specificity factor (CPSF) complex
     29894 (CPSF1)
SSDB
Motif
Pfam: CPSF_A MMS1_N UspB
Other DBs
NCBI-GeneID: 29894
NCBI-ProteinID: NP_037423
OMIM: 606027
HGNC: 2324
Ensembl: ENSG00000071894
UniProt: Q10570
Structure
Position
8:complement(144393231..144409335)
AA seq 1443 aa
MYAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYRLNRDAEALTKNDRSTEGK
AHREKLELAASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSL
HYFEEPELRDGFVQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEG
QRSSFLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSI
VAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVAL
NSLTTGTTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAF
HFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSK
KKRVDATAGWSAAGKSVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAV
GEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIA
PVRKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTSGF
ATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQLHFIPVDLGAPIVQCAVADPYVVIMS
AEGHVTMFLLKSDSYGGRHHRLALHKPPLHHQSKVITLCLYRDLSGMFTTESRLGGARDE
LGGRSGPEAEGLGSETSPTVDDEEEMLYGDSGSLFSPSKEEARRSSQPPADRDPAPFRAE
PTHWCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEARREEATRQ
GELPLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNIN
FREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGA
LRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRC
TAHYVAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLIS
PVSWEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRIL
IMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASEL
TGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDF
MVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGAT
EGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRA
FRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRVT
AHF
NT seq 4332 nt   +upstreamnt  +downstreamnt
atgtacgccgtgtacaaacaggcgcatccgcccaccggtctggagttctccatgtactgc
aacttcttcaacaacagcgagcgcaacctggtagtggccgggacctcgcagctctacgtg
taccgcctcaaccgcgacgccgaggctctgaccaagaatgacaggagcacagaggggaag
gcccaccgggagaagctcgagcttgctgcctccttctccttctttggcaacgtcatgtcc
atggccagcgtgcagctggcaggagccaagcgggatgccctgctcctaagcttcaaggat
gccaagctgtctgtggtggagtacgacccgggcacccatgacctgaagaccctgtcactg
cactactttgaggagcctgagcttcgggacgggtttgtgcagaatgtacacacgccgcga
gtgcgggtggaccccgacgggcgctgtgcagccatgcttgtctacggcacgcggctggtg
gtcctgcccttccgcagggagagcctggctgaggagcacgaggggctcgtgggtgagggg
cagaggtccagcttcctgcccagctacatcatcgacgtgcgggccctagacgagaagctg
ctcaacatcatcgacctgcagttcctgcatggctactacgagcctaccctcctcatcctg
tttgagcccaaccagacctggcctgggcgcgtggccgtgcggcaggacacgtgctccatt
gtggccatctcactgaacatcacgcagaaggtgcaccccgtcatctggtccctcaccagc
ctgccctttgactgcacccaggctctggctgtgcccaagcccataggtggggtggtggtg
tttgccgtcaactcgctgttgtacctgaaccagagcgtccccccgtatggcgtggctctc
aacagcctcaccacaggaaccacggctttcccgcttcgcacccaggagggtgtgcggatc
accctggactgcgcccaggccaccttcatctcctacgacaagatggtcatctccctcaag
ggcggcgagatctacgtgctgaccctcatcaccgacggcatgcgcagtgtccgagcgttc
cactttgacaaggcggccgccagcgtcctcaccaccagcatggtcaccatggagcccggg
tacctgttcctgggttctcgcctgggcaattccctcctcctcaagtacacggagaagctg
caggagcccccggccagtgctgtccgtgaggctgccgacaaggaagagcctccctcaaag
aagaagcgagtggatgcgacggccggctggtcagctgcgggtaagtcggtgccgcaggat
gaggtggacgagattgaagtgtacggcagcgaggcccagtcgggaacacagctggccacc
tactcctttgaggtgtgtgacagcatcctgaacattggaccctgtgccaatgccgccgtg
ggcgagcctgccttcctctctgaagagtttcagaacagccccgagccggacctggagatt
gtggtttgctccggccacgggaagaacggggctttgtcggtgctgcagaagagcatccgg
ccccaggtggtgacaacctttgagcttcccggctgctatgacatgtggacagtcatcgcc
ccggtgcgtaaggaggaggaggacaatcccaagggggagggcacagagcaggaacccagc
accacccctgaagcagacgacgacggccgcagacacggattcctgattctgagccgggaa
gactccaccatgatcctgcagacggggcaggagatcatggagctggacaccagtggcttc
gccactcagggccccacggtctttgctgggaacatcggggacaaccgctacattgtccaa
gtgtcaccactgggcatccgcctgctggaaggagtgaatcagctgcacttcatccccgtg
gacctgggcgcccccatcgtgcagtgcgccgtggccgacccctatgtggtcatcatgagt
gccgagggccacgtcaccatgttcctgctgaagagtgactcctacggtggccgccaccac
cgcctggcgctgcacaagcccccgctgcaccatcagtccaaggtgattacgctgtgcctg
taccgagacctcagcggcatgttcaccactgagagccgcctgggtggggcccgtgacgag
ctcgggggccgcagtggcccggaggccgagggcctgggctcagagactagccccacagtg
gatgacgaggaggagatgctgtatggggattcgggctccctcttcagccccagcaaggag
gaggcccgaagaagcagccagccccctgctgaccgggaccctgcacccttccgggcagag
cctacccactggtgcctgctggtgcgggagaatggcaccatggagatctaccagcttccc
gactggcggctggtgttcctggtgaagaacttccctgtggggcagcgggtccttgtggac
agctcctttggacagcccactacacagggcgaggcccgcagggaggaggccacgcgccag
ggggagctgcccctcgtcaaggaggtgctgctggtggcgctgggcagccgccagagcagg
ccctacctgctggtgcatgtggaccaagagctgcttatctacgaggccttcccccacgac
tctcagctcggccagggcaatctcaaagtccgctttaagaaggtccctcacaacatcaac
ttccgtgagaagaagccaaagccatccaagaagaaagcagaaggtggcggcgcagaggag
ggggctggggcccggggccgcgtggcgcgtttccgctacttcgaggatatttatggctac
tcaggggtcttcatctgcggcccctcccctcactggctcttggtgaccggccgaggggct
ctgcggctacaccccatggccatcgacggcccggtcgactctttcgctccattccacaat
gtcaactgtccccgcggcttcctgtacttcaacagacagggcgagctgaggatcagtgtc
ctgcctgcctacctgtcctatgatgccccatggcctgtcaggaagatcccgctgcgctgc
acggcccactatgtggcttaccacgtggagtctaaggtgtatgctgtggccaccagcacc
aacacgccgtgtgcccgcatcccacgcatgactggcgaggagaaggagtttgagaccatc
gagagagatgagcggtacatccacccccagcaggaggccttctccatccagctcatctcc
ccggtcagctgggaggctattcccaatgccaggatcgagctgcaggagtgggagcatgtg
acctgcatgaagacagtgtctctgcgcagtgaggagaccgtgtcgggcctcaaaggctac
gtggccgccgggacctgcctcatgcagggggaggaggtcacgtgccgagggcggatcttg
atcatggatgtgattgaggtggtgcccgagcctggccagcccttgaccaagaacaagttc
aaagtcctttacgagaaggagcagaaggggcccgtgaccgccctgtgccactgcaatggc
cacctggtgtcggccatcggccagaagattttcctgtggagcctgcgggccagcgagctg
acgggcatggccttcatcgacacgcagctctacatacaccagatgatcagcgtcaagaac
ttcatcctggcagccgacgtcatgaagagcatttcgctgctgcgctaccaggaggaaagc
aagacgctgagcctggtgtcgcgggatgccaagcccctggaggtgtacagcgtggacttc
atggtggacaatgcccagctgggttttctggtgtctgaccgcgaccgcaacctcatggtg
tacatgtacctgcccgaagccaaggagagtttcgggggcatgcgcctgctgcgtcgggca
gacttccacgtgggtgcccacgtgaacacgttctggaggaccccgtgccggggggccact
gaagggctcagcaaaaagtcggtcgtgtgggagaataagcacatcacgtggtttgccacc
ctggacggcggcatcgggctgctgctgcccatgcaggagaagacctaccggcggctgctg
atgctgcagaacgcgctgaccaccatgctgccacaccacgccggcctcaacccccgcgcc
ttccggatgctgcacgtggaccgccgcaccctccagaatgccgtgcgcaacgtgctggat
ggggagctgctcaaccgctacctgtacctgagcaccatggagcgcagcgagctagccaag
aagatcggcaccacaccagacataatcctggacgacttgctggagacggaccgcgtcacc
gcccacttctag

DBGET integrated database retrieval system