Gene: TET1

Basic information

Tag Content
Uniprot ID Q8NFU7; Q5VUP7; Q7Z6B6; Q8TCR1; Q9C0I7;
Entrez ID 80312
Genbank protein ID CAD28467.3; AAH53905.1; AAM88301.1; BAB21767.1;
Genbank nucleotide ID NM_030625.2
Ensembl protein ID ENSP00000362748
Ensembl nucleotide ID ENSG00000138336
Gene name Methylcytosine dioxygenase TET1
Gene symbol TET1
Organism Homo sapiens
NCBI taxa ID 9606
Cleft type
Developmental stage
Data sources Homology search
Reference
Functional description Dioxygenase that catalyzes the conversion of the modified genomic base 5-methylcytosine (5mC) into 5-hydroxymethylcytosine (5hmC) and plays a key role in active DNA demethylation. Also mediates subsequent conversion of 5hmC into 5-formylcytosine (5fC), and conversion of 5fC to 5-carboxylcytosine (5caC). Conversion of 5mC into 5hmC, 5fC and 5caC probably constitutes the first step in cytosine demethylation. Methylation at the C5 position of cytosine bases is an epigenetic modification of the mammalian genome which plays an important role in transcriptional regulation. In addition to its role in DNA demethylation, plays a more general role in chromatin regulation. Preferentially binds to CpG-rich sequences at promoters of both transcriptionally active and Polycomb-repressed genes. Involved in the recruitment of the O-GlcNAc transferase OGT to CpG-rich transcription start sites of active genes, thereby promoting histone H2B GlcNAcylation by OGT. Also involved in transcription repression of a subset of genes through recruitment of transcriptional repressors to promoters. Involved in the balance between pluripotency and lineage commitment of cells it plays a role in embryonic stem cells maintenance and inner cell mass cell specification. Plays an important role in the tumorigenicity of glioblastoma cells. TET1-mediated production of 5hmC acts as a recruitment signal for the CHTOP-methylosome complex to selective sites on the chromosome, where it methylates H4R3 and activates the transcription of genes involved in glioblastomagenesis (PubMed:25284789). Binds preferentially to DNA containing cytidine-phosphate-guanosine (CpG) dinucleotides over CpH (H=A, T, and C), hemimethylated-CpG and hemimethylated-hydroxymethyl-CpG (PubMed:29276034).
Sequence
MSRSRHARPS RLVRKEDVNK KKKNSQLRKT TKGANKNVAS VKTLSPGKLK QLIQERDVKK 60
KTEPKPPVPV RSLLTRAGAA RMNLDRTEVL FQNPESLTCN GFTMALRSTS LSRRLSQPPL 120
VVAKSKKVPL SKGLEKQHDC DYKILPALGV KHSENDSVPM QDTQVLPDIE TLIGVQNPSL 180
LKGKSQETTQ FWSQRVEDSK INIPTHSGPA AEILPGPLEG TRCGEGLFSE ETLNDTSGSP 240
KMFAQDTVCA PFPQRATPKV TSQGNPSIQL EELGSRVESL KLSDSYLDPI KSEHDCYPTS 300
SLNKVIPDLN LRNCLALGGS TSPTSVIKFL LAGSKQATLG AKPDHQEAFE ATANQQEVSD 360
TTSFLGQAFG AIPHQWELPG ADPVHGEALG ETPDLPEIPG AIPVQGEVFG TILDQQETLG 420
MSGSVVPDLP VFLPVPPNPI ATFNAPSKWP EPQSTVSYGL AVQGAIQILP LGSGHTPQSS 480
SNSEKNSLPP VMAISNVENE KQVHISFLPA NTQGFPLAPE RGLFHASLGI AQLSQAGPSK 540
SDRGSSQVSV TSTVHVVNTT VVTMPVPMVS TSSSSYTTLL PTLEKKKRKR CGVCEPCQQK 600
TNCGECTYCK NRKNSHQICK KRKCEELKKK PSVVVPLEVI KENKRPQREK KPKVLKADFD 660
NKPVNGPKSE SMDYSRCGHG EEQKLELNPH TVENVTKNED SMTGIEVEKW TQNKKSQLTD 720
HVKGDFSANV PEAEKSKNSE VDKKRTKSPK LFVQTVRNGI KHVHCLPAET NVSFKKFNIE 780
EFGKTLENNS YKFLKDTANH KNAMSSVATD MSCDHLKGRS NVLVFQQPGF NCSSIPHSSH 840
SIINHHASIH NEGDQPKTPE NIPSKEPKDG SPVQPSLLSL MKDRRLTLEQ VVAIEALTQL 900
SEAPSENSSP SKSEKDEESE QRTASLLNSC KAILYTVRKD LQDPNLQGEP PKLNHCPSLE 960
KQSSCNTVVF NGQTTTLSNS HINSATNQAS TKSHEYSKVT NSLSLFIPKS NSSKIDTNKS 1020
IAQGIITLDN CSNDLHQLPP RNNEVEYCNQ LLDSSKKLDS DDLSCQDATH TQIEEDVATQ 1080
LTQLASIIKI NYIKPEDKKV ESTPTSLVTC NVQQKYNQEK GTIQQKPPSS VHNNHGSSLT 1140
KQKNPTQKKT KSTPSRDRRK KKPTVVSYQE NDRQKWEKLS YMYGTICDIW IASKFQNFGQ 1200
FCPHDFPTVF GKISSSTKIW KPLAQTRSIM QPKTVFPPLT QIKLQRYPES AEEKVKVEPL 1260
DSLSLFHLKT ESNGKAFTDK AYNSQVQLTV NANQKAHPLT QPSSPPNQCA NVMAGDDQIR 1320
FQQVVKEQLM HQRLPTLPGI SHETPLPESA LTLRNVNVVC SGGITVVSTK SEEEVCSSSF 1380
GTSEFSTVDS AQKNFNDYAM NFFTNPTKNL VSITKDSELP TCSCLDRVIQ KDKGPYYTHL 1440
GAGPSVAAVR EIMENRYGQK GNAIRIEIVV YTGKEGKSSH GCPIAKWVLR RSSDEEKVLC 1500
LVRQRTGHHC PTAVMVVLIM VWDGIPLPMA DRLYTELTEN LKSYNGHPTD RRCTLNENRT 1560
CTCQGIDPET CGASFSFGCS WSMYFNGCKF GRSPSPRRFR IDPSSPLHEK NLEDNLQSLA 1620
TRLAPIYKQY APVAYQNQVE YENVARECRL GSKEGRPFSG VTACLDFCAH PHRDIHNMNN 1680
GSTVVCTLTR EDNRSLGVIP QDEQLHVLPL YKLSDTDEFG SKEGMEAKIK SGAIEVLAPR 1740
RKKRTCFTQP VPRSGKKRAA MMTEVLAHKI RAVEKKPIPR IKRKNNSTTT NNSKPSSLPT 1800
LGSNTETVQP EVKSETEPHF ILKSSDNTKT YSLMPSAPHP VKEASPGFSW SPKTASATPA 1860
PLKNDATASC GFSERSSTPH CTMPSGRLSG ANAAAADGPG ISQLGEVAPL PTLSAPVMEP 1920
LINSEPSTGV TEPLTPHQPN HQPSFLTSPQ DLASSPMEED EQHSEADEPP SDEPLSDDPL 1980
SPAEEKLPHI DEYWSDSEHI FLDANIGGVA IAPAHGSVLI ECARRELHAT TPVEHPNRNH 2040
PTRLSLVFYQ HKNLNKPQHG FELNKIKFEA KEAKNKKMKA SEQKDQAANE GPEQSSEVNE 2100
LNQIPSHKAL TLTHDNVVTV SPYALTHVAG PYNHWV 2136

Abbreviation :
CLO : cleft lip only. CPO : cleft palate only. CLP : cleft lip and palate. CL/P : cleft lip with/without cleft palate.
For humans: CL/P, CLO, CPO, and CLP. For mice: CLO, CLP, and CPO.

Gene expression information

Gene expression in different tissues (GTEx V7)

  

Gene expression in different tissues (ENCODE)

  

Protein structural annotations

3D structure in PDB database


loading...

Protein disorder information

Orthologous information

Relation Gene symbol Entrez ID UniProt ID Cleft type Developmental stage Species Evidence Details
1:1 orthologTET1102186373A0A452E1W7Capra hircusPredictionMore>>
1:1 orthologTET180312Q8NFU7Homo sapiensPredictionMore>>
1:1 orthologTet1Q3URK3CPOMus musculusPublicationMore>>
1:1 orthologTET1450497H2Q1Z7Pan troglodytesPredictionMore>>
1:1 orthologTET1F1SUI3Sus scrofaPredictionMore>>
1:1 orthologTet1F1LUQ3Rattus norvegicusPredictionMore>>
1:1 orthologtet1101883702F1R3C6Danio rerioPredictionMore>>

Other genetic variants/mutations

loading...

Disease or phenotype associated information

loading...

Gene Ontology (GO)/biological pathways

GO:Molecular Function

GO ID GO Term Evidence
GO:0003677 DNA bindingIDA
GO:0005506 iron ion bindingIDA
GO:0005506 iron ion bindingIBA
GO:0008270 zinc ion bindingIDA
GO:0008327 methyl-CpG bindingIDA
GO:0070579 methylcytosine dioxygenase activityIDA
GO:0070579 methylcytosine dioxygenase activityIMP
GO:0070579 methylcytosine dioxygenase activityIBA

GO:Biological Process

GO ID GO Term Evidence
GO:0001826 inner cell mass cell differentiationISS
GO:0006211 5-methylcytosine catabolic processIBA
GO:0006325 chromatin organizationIEA
GO:0006493 protein O-linked glycosylationISS
GO:0008284 positive regulation of cell population proliferationIMP
GO:0019827 stem cell population maintenanceISS
GO:0031062 positive regulation of histone methylationIMP
GO:0045944 positive regulation of transcription by RNA polymerase IIISS
GO:0045944 positive regulation of transcription by RNA polymerase IIIBA
GO:0070989 oxidative demethylationIBA
GO:0080111 DNA demethylationIMP
GO:0080111 DNA demethylationIBA
GO:0090310 negative regulation of methylation-dependent chromatin silencingIMP

GO:Cellular Component

GO ID GO Term Evidence
GO:0005634 nucleusIC
GO:0005634 nucleusIBA

Reactome Pathway

Reactome ID Reactome Term Evidence
R-HSA-212165 Epigenetic regulation of gene expressionIEA
R-HSA-5221030 TET1,2,3 and TDG demethylate DNAIEA
R-HSA-74160 Gene expression (Transcription)IEA

Drugs and compounds information

loading...

Functional annotations

Keywords

Keyword ID Keyword Term
KW-0002 3D-structure
KW-0010 Activator
KW-0156 Chromatin regulator
KW-0160 Chromosomal rearrangement
KW-0223 Dioxygenase
KW-0238 DNA-binding
KW-0325 Glycoprotein
KW-0408 Iron
KW-0479 Metal-binding
KW-0539 Nucleus
KW-0560 Oxidoreductase
KW-0597 Phosphoprotein
KW-0621 Polymorphism
KW-1185 Reference proteome
KW-0678 Repressor
KW-0804 Transcription
KW-0805 Transcription regulation
KW-0862 Zinc
KW-0863 Zinc-finger

Interpro

InterPro ID InterPro Term
IPR024779 2OGFeDO_noxygenase_dom
IPR040175 TET1/2/3
IPR002857 Znf_CXXC

PROSITE

PROSITE ID PROSITE Term
PS51058 ZF_CXXC

Pfam

Pfam ID Pfam Term
PF12851 Tet_JBP
PF02008 zf-CXXC

Protein-protein interaction

Protein-miRNA interaction