Gene: COL4A4

Basic information

Tag Content
Uniprot ID P53420; A8MTZ1; Q53RW9; Q53S42; Q53WR1;
Entrez ID 1286
Genbank protein ID BAA25065.1; BAA04214.1; CAA56943.1; AAY24061.1; CAA76763.1; AAY14670.1;
Genbank nucleotide ID NM_000092.4; XM_005246281.3;
Ensembl protein ID ENSP00000379866
Ensembl nucleotide ID ENSG00000081052
Gene name Collagen alpha-4(IV) chain
Gene symbol COL4A4
Organism Homo sapiens
NCBI taxa ID 9606
Cleft type
Developmental stage
Data sources Manually collected
Reference 16953426
Functional description Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen.
Sequence
MWSLHIVLMR CSFRLTKSLA TGPWSLILIL FSVQYVYGSG KKYIGPCGGR DCSVCHCVPE 60
KGSRGPPGPP GPQGPIGPLG APGPIGLSGE KGMRGDRGPP GAAGDKGDKG PTGVPGFPGL 120
DGIPGHPGPP GPRGKPGMSG HNGSRGDPGF PGGRGALGPG GPLGHPGEKG EKGNSVFILG 180
AVKGIQGDRG DPGLPGLPGS WGAGGPAGPT GYPGEPGLVG PPGQPGRPGL KGNPGVGVKG 240
QMGDPGEVGQ QGSPGPTLLV EPPDFCLYKG EKGIKGIPGM VGLPGPPGRK GESGIGAKGE 300
KGIPGFPGPR GDPGSYGSPG FPGLKGELGL VGDPGLFGLI GPKGDPGNRG HPGPPGVLVT 360
PPLPLKGPPG DPGFPGRYGE TGDVGPPGPP GLLGRPGEAC AGMIGPPGPQ GFPGLPGLPG 420
EAGIPGRPDS APGKPGKPGS PGLPGAPGLQ GLPGSSVIYC SVGNPGPQGI KGKVGPPGGR 480
GPKGEKGNEG LCACEPGPMG PPGPPGLPGR QGSKGDLGLP GWLGTKGDPG PPGAEGPPGL 540
PGKHGASGPP GNKGAKGDMV VSRVKGHKGE RGPDGPPGFP GQPGSHGRDG HAGEKGDPGP 600
PGDHEDATPG GKGFPGPLGP PGKAGPVGPP GLGFPGPPGE RGHPGVPGHP GVRGPDGLKG 660
QKGDTISCNV TYPGRHGPPG FDGPPGPKGF PGPQGAPGLS GSDGHKGRPG TPGTAEIPGP 720
PGFRGDMGDP GFGGEKGSSP VGPPGPPGSP GVNGQKGIPG DPAFGHLGPP GKRGLSGVPG 780
IKGPRGDPGC PGAEGPAGIP GFLGLKGPKG REGHAGFPGV PGPPGHSCER GAPGIPGQPG 840
LPGYPGSPGA PGGKGQPGDV GPPGPAGMKG LPGLPGRPGA HGPPGLPGIP GPFGDDGLPG 900
PPGPKGPRGL PGFPGFPGER GKPGAEGCPG AKGEPGEKGM SGLPGDRGLR GAKGAIGPPG 960
DEGEMAIISQ KGTPGEPGPP GDDGFPGERG DKGTPGMQGR RGEPGRYGPP GFHRGEPGEK 1020
GQPGPPGPPG PPGSTGLRGF IGFPGLPGDQ GEPGSPGPPG FSGIDGARGP KGNKGDPASH 1080
FGPPGPKGEP GSPGCPGHFG ASGEQGLPGI QGPRGSPGRP GPPGSSGPPG CPGDHGMPGL 1140
RGQPGEMGDP GPRGLQGDPG IPGPPGIKGP SGSPGLNGLH GLKGQKGTKG ASGLHDVGPP 1200
GPVGIPGLKG ERGDPGSPGI SPPGPRGKKG PPGPPGSSGP PGPAGATGRA PKDIPDPGPP 1260
GDQGPPGPDG PRGAPGPPGL PGSVDLLRGE PGDCGLPGPP GPPGPPGPPG YKGFPGCDGK 1320
DGQKGPVGFP GPQGPHGFPG PPGEKGLPGP PGRKGPTGLP GPRGEPGPPA DVDDCPRIPG 1380
LPGAPGMRGP EGAMGLPGMR GPSGPGCKGE PGLDGRRGVD GVPGSPGPPG RKGDTGEDGY 1440
PGGPGPPGPI GDPGPKGFGP GYLGGFLLVL HSQTDQEPTC PLGMPRLWTG YSLLYLEGQE 1500
KAHNQDLGLA GSCLPVFSTL PFAYCNIHQV CHYAQRNDRS YWLASAAPLP MMPLSEEAIR 1560
PYVSRCAVCE APAQAVAVHS QDQSIPPCPQ TWRSLWIGYS FLMHTGAGDQ GGGQALMSPG 1620
SCLEDFRAAP FLECQGRQGT CHFFANKYSF WLTTVKADLQ FSSAPAPDTL KESQAQRQKI 1680
SRCQVCVKYS

Abbreviation :
CLO : cleft lip only. CPO : cleft palate only. CLP : cleft lip and palate. CL/P : cleft lip with/without cleft palate.
For humans: CL/P, CLO, CPO, and CLP. For mice: CLO, CLP, and CPO.

Gene expression information

Gene expression in different tissues (GTEx V7)

  

Gene expression in different tissues (ENCODE)

  

Protein structural annotations

3D structure in PDB database


Protein disorder information

Orthologous information

Other genetic variants/mutations

Disease or phenotype associated information

Gene Ontology (GO)/biological pathways

GO:Molecular Function


GO:Biological Process


GO:Cellular Component


Reactome Pathway

Drugs and compounds information

Functional annotations

Keywords

Interpro

PROSITE

Pfam

Protein-protein interaction

Protein-miRNA interaction