Gene: COL4A4

Basic information

Tag Content
Uniprot ID P53420; A8MTZ1; Q53RW9; Q53S42; Q53WR1;
Entrez ID 1286
Genbank protein ID BAA25065.1; BAA04214.1; CAA56943.1; AAY24061.1; CAA76763.1; AAY14670.1;
Genbank nucleotide ID NM_000092.4; XM_005246281.3;
Ensembl protein ID ENSP00000379866
Ensembl nucleotide ID ENSG00000081052
Gene name Collagen alpha-4(IV) chain
Gene symbol COL4A4
Organism Homo sapiens
NCBI taxa ID 9606
Cleft type
Developmental stage
Data sources Manually collected
Reference 16953426
Functional description Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen.
Sequence
MWSLHIVLMR CSFRLTKSLA TGPWSLILIL FSVQYVYGSG KKYIGPCGGR DCSVCHCVPE 60
KGSRGPPGPP GPQGPIGPLG APGPIGLSGE KGMRGDRGPP GAAGDKGDKG PTGVPGFPGL 120
DGIPGHPGPP GPRGKPGMSG HNGSRGDPGF PGGRGALGPG GPLGHPGEKG EKGNSVFILG 180
AVKGIQGDRG DPGLPGLPGS WGAGGPAGPT GYPGEPGLVG PPGQPGRPGL KGNPGVGVKG 240
QMGDPGEVGQ QGSPGPTLLV EPPDFCLYKG EKGIKGIPGM VGLPGPPGRK GESGIGAKGE 300
KGIPGFPGPR GDPGSYGSPG FPGLKGELGL VGDPGLFGLI GPKGDPGNRG HPGPPGVLVT 360
PPLPLKGPPG DPGFPGRYGE TGDVGPPGPP GLLGRPGEAC AGMIGPPGPQ GFPGLPGLPG 420
EAGIPGRPDS APGKPGKPGS PGLPGAPGLQ GLPGSSVIYC SVGNPGPQGI KGKVGPPGGR 480
GPKGEKGNEG LCACEPGPMG PPGPPGLPGR QGSKGDLGLP GWLGTKGDPG PPGAEGPPGL 540
PGKHGASGPP GNKGAKGDMV VSRVKGHKGE RGPDGPPGFP GQPGSHGRDG HAGEKGDPGP 600
PGDHEDATPG GKGFPGPLGP PGKAGPVGPP GLGFPGPPGE RGHPGVPGHP GVRGPDGLKG 660
QKGDTISCNV TYPGRHGPPG FDGPPGPKGF PGPQGAPGLS GSDGHKGRPG TPGTAEIPGP 720
PGFRGDMGDP GFGGEKGSSP VGPPGPPGSP GVNGQKGIPG DPAFGHLGPP GKRGLSGVPG 780
IKGPRGDPGC PGAEGPAGIP GFLGLKGPKG REGHAGFPGV PGPPGHSCER GAPGIPGQPG 840
LPGYPGSPGA PGGKGQPGDV GPPGPAGMKG LPGLPGRPGA HGPPGLPGIP GPFGDDGLPG 900
PPGPKGPRGL PGFPGFPGER GKPGAEGCPG AKGEPGEKGM SGLPGDRGLR GAKGAIGPPG 960
DEGEMAIISQ KGTPGEPGPP GDDGFPGERG DKGTPGMQGR RGEPGRYGPP GFHRGEPGEK 1020
GQPGPPGPPG PPGSTGLRGF IGFPGLPGDQ GEPGSPGPPG FSGIDGARGP KGNKGDPASH 1080
FGPPGPKGEP GSPGCPGHFG ASGEQGLPGI QGPRGSPGRP GPPGSSGPPG CPGDHGMPGL 1140
RGQPGEMGDP GPRGLQGDPG IPGPPGIKGP SGSPGLNGLH GLKGQKGTKG ASGLHDVGPP 1200
GPVGIPGLKG ERGDPGSPGI SPPGPRGKKG PPGPPGSSGP PGPAGATGRA PKDIPDPGPP 1260
GDQGPPGPDG PRGAPGPPGL PGSVDLLRGE PGDCGLPGPP GPPGPPGPPG YKGFPGCDGK 1320
DGQKGPVGFP GPQGPHGFPG PPGEKGLPGP PGRKGPTGLP GPRGEPGPPA DVDDCPRIPG 1380
LPGAPGMRGP EGAMGLPGMR GPSGPGCKGE PGLDGRRGVD GVPGSPGPPG RKGDTGEDGY 1440
PGGPGPPGPI GDPGPKGFGP GYLGGFLLVL HSQTDQEPTC PLGMPRLWTG YSLLYLEGQE 1500
KAHNQDLGLA GSCLPVFSTL PFAYCNIHQV CHYAQRNDRS YWLASAAPLP MMPLSEEAIR 1560
PYVSRCAVCE APAQAVAVHS QDQSIPPCPQ TWRSLWIGYS FLMHTGAGDQ GGGQALMSPG 1620
SCLEDFRAAP FLECQGRQGT CHFFANKYSF WLTTVKADLQ FSSAPAPDTL KESQAQRQKI 1680
SRCQVCVKYS

Abbreviation :
CLO : cleft lip only. CPO : cleft palate only. CLP : cleft lip and palate. CL/P : cleft lip with/without cleft palate.
For humans: CL/P, CLO, CPO, and CLP. For mice: CLO, CLP, and CPO.

Gene expression information

Gene expression in different tissues (GTEx V7)

  

Gene expression in different tissues (ENCODE)

  

Protein structural annotations

3D structure in PDB database


loading...

Protein disorder information

Orthologous information

Relation Gene symbol Entrez ID UniProt ID Cleft type Developmental stage Species Evidence Details
1:1 orthologCOL4A4A0A452G3N1Capra hircusPredictionMore>>
1:1 orthologCOL4A41286P53420Homo sapiensPublicationMore>>
1:1 orthologCol4a412829Q9QZR9Mus musculusPredictionMore>>
1:1 orthologCOL4A4459986H2R630Pan troglodytesPredictionMore>>
1:1 orthologCol4a4A0A0G2K742Rattus norvegicusPredictionMore>>

Other genetic variants/mutations

loading...

Disease or phenotype associated information

loading...

Gene Ontology (GO)/biological pathways

GO:Molecular Function

GO ID GO Term Evidence
GO:0005201 extracellular matrix structural constituentIMP
GO:0005201 extracellular matrix structural constituentIBA
GO:0030020 extracellular matrix structural constituent conferring tensile strengthRCA

GO:Biological Process

GO ID GO Term Evidence
GO:0030198 extracellular matrix organizationIBA
GO:0030198 extracellular matrix organizationTAS
GO:0032836 glomerular basement membrane developmentIMP
GO:0032836 glomerular basement membrane developmentIBA
GO:0038063 collagen-activated tyrosine kinase receptor signaling pathwayIBA

GO:Cellular Component

GO ID GO Term Evidence
GO:0005576 extracellular regionTAS
GO:0005587 collagen type IV trimerIBA
GO:0005587 collagen type IV trimerIDA
GO:0005604 basement membraneIDA
GO:0005615 extracellular spaceIBA
GO:0005788 endoplasmic reticulum lumenTAS
GO:0031012 extracellular matrixIBA
GO:0062023 collagen-containing extracellular matrixHDA

Reactome Pathway

Reactome ID Reactome Term Evidence
R-HSA-1266738 Developmental BiologyTAS
R-HSA-1442490 Collagen degradationTAS
R-HSA-1442490 Collagen degradationIEA
R-HSA-1474228 Degradation of the extracellular matrixTAS
R-HSA-1474228 Degradation of the extracellular matrixIEA
R-HSA-1474244 Extracellular matrix organizationTAS
R-HSA-1474244 Extracellular matrix organizationIEA
R-HSA-1474290 Collagen formationTAS
R-HSA-162582 Signal TransductionTAS
R-HSA-1650814 Collagen biosynthesis and modifying enzymesTAS
R-HSA-186797 Signaling by PDGFTAS
R-HSA-2022090 Assembly of collagen fibrils and other multimeric structuresTAS
R-HSA-216083 Integrin cell surface interactionsTAS
R-HSA-216083 Integrin cell surface interactionsIEA
R-HSA-2214320 Anchoring fibril formationTAS
R-HSA-2243919 Crosslinking of collagen fibrilsTAS
R-HSA-3000157 Laminin interactionsIEA
R-HSA-3000157 Laminin interactionsTAS
R-HSA-3000171 Non-integrin membrane-ECM interactionsTAS
R-HSA-3000178 ECM proteoglycansIEA
R-HSA-375165 NCAM signaling for neurite out-growthTAS
R-HSA-419037 NCAM1 interactionsTAS
R-HSA-422475 Axon guidanceTAS
R-HSA-8948216 Collagen chain trimerizationTAS
R-HSA-9006934 Signaling by Receptor Tyrosine KinasesTAS

Drugs and compounds information

loading...

Functional annotations

Keywords

Keyword ID Keyword Term
KW-0002 3D-structure
KW-0023 Alport syndrome
KW-0084 Basement membrane
KW-0176 Collagen
KW-0209 Deafness
KW-0225 Disease mutation
KW-1015 Disulfide bond
KW-0272 Extracellular matrix
KW-0325 Glycoprotein
KW-0379 Hydroxylation
KW-0621 Polymorphism
KW-1185 Reference proteome
KW-0677 Repeat
KW-0964 Secreted
KW-0732 Signal

Interpro

InterPro ID InterPro Term
IPR008160 Collagen
IPR001442 Collagen_IV_NC
IPR036954 Collagen_IV_NC_sf
IPR016187 CTDL_fold

PROSITE

PROSITE ID PROSITE Term
PS51403 NC1_IV

Pfam

Pfam ID Pfam Term
PF01413 C4
PF01391 Collagen

Protein-protein interaction

Protein-miRNA interaction