Gene: COL4A2

Basic information

Tag Content
Uniprot ID P08572; Q14052; Q548C3; Q5VZA9; Q66K23;
Entrez ID 1284
Genbank protein ID AAA52043.1; AAF72631.1; AAR20245.1; AAK92479.1; AAH80644.1; AAA58422.1; AAR18250.1; AAA53097.1; CAA31275.1; CAA29076.1; AAA53099.1; CAA29098.1;
Genbank nucleotide ID NM_001846.3
Ensembl protein ID ENSP00000353654
Ensembl nucleotide ID ENSG00000134871
Gene name Collagen alpha-2(IV) chain
Gene symbol COL4A2
Organism Homo sapiens
NCBI taxa ID 9606
Cleft type
Developmental stage
Data sources Manually collected
Reference 26449438
Functional description Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen.
Sequence
MGRDQRAVAG PALRRWLLLG TVTVGFLAQS VLAGVKKFDV PCGGRDCSGG CQCYPEKGGR 60
GQPGPVGPQG YNGPPGLQGF PGLQGRKGDK GERGAPGVTG PKGDVGARGV SGFPGADGIP 120
GHPGQGGPRG RPGYDGCNGT QGDSGPQGPP GSEGFTGPPG PQGPKGQKGE PYALPKEERD 180
RYRGEPGEPG LVGFQGPPGR PGHVGQMGPV GAPGRPGPPG PPGPKGQQGN RGLGFYGVKG 240
EKGDVGQPGP NGIPSDTLHP IIAPTGVTFH PDQYKGEKGS EGEPGIRGIS LKGEEGIMGF 300
PGLRGYPGLS GEKGSPGQKG SRGLDGYQGP DGPRGPKGEA GDPGPPGLPA YSPHPSLAKG 360
ARGDPGFPGA QGEPGSQGEP GDPGLPGPPG LSIGDGDQRR GLPGEMGPKG FIGDPGIPAL 420
YGGPPGPDGK RGPPGPPGLP GPPGPDGFLF GLKGAKGRAG FPGLPGSPGA RGPKGWKGDA 480
GECRCTEGDE AIKGLPGLPG PKGFAGINGE PGRKGDRGDP GQHGLPGFPG LKGVPGNIGA 540
PGPKGAKGDS RTITTKGERG QPGVPGVPGM KGDDGSPGRD GLDGFPGLPG PPGDGIKGPP 600
GDPGYPGIPG TKGTPGEMGP PGLGLPGLKG QRGFPGDAGL PGPPGFLGPP GPAGTPGQID 660
CDTDVKRAVG GDRQEAIQPG CIGGPKGLPG LPGPPGPTGA KGLRGIPGFA GADGGPGPRG 720
LPGDAGREGF PGPPGFIGPR GSKGAVGLPG PDGSPGPIGL PGPDGPPGER GLPGEVLGAQ 780
PGPRGDAGVP GQPGLKGLPG DRGPPGFRGS QGMPGMPGLK GQPGLPGPSG QPGLYGPPGL 840
HGFPGAPGQE GPLGLPGIPG REGLPGDRGD PGDTGAPGPV GMKGLSGDRG DAGFTGEQGH 900
PGSPGFKGID GMPGTPGLKG DRGSPGMDGF QGMPGLKGRP GFPGSKGEAG FFGIPGLKGL 960
AGEPGFKGSR GDPGPPGPPP VILPGMKDIK GEKGDEGPMG LKGYLGAKGI QGMPGIPGLS 1020
GIPGLPGRPG HIKGVKGDIG VPGIPGLPGF PGVAGPPGIT GFPGFIGSRG DKGAPGRAGL 1080
YGEIGATGDF GDIGDTINLP GRPGLKGERG TTGIPGLKGF FGEKGTEGDI GFPGITGVTG 1140
VQGPPGLKGQ TGFPGLTGPP GSQGELGRIG LPGGKGDDGW PGAPGLPGFP GLRGIRGLHG 1200
LPGTKGFPGS PGSDIHGDPG FPGPPGERGD PGEANTLPGP VGVPGQKGDQ GAPGERGPPG 1260
SPGLQGFPGI TPPSNISGAP GDKGAPGIFG LKGYRGPPGP PGSAALPGSK GDTGNPGAPG 1320
TPGTKGWAGD SGPQGRPGVF GLPGEKGPRG EQGFMGNTGP TGAVGDRGPK GPKGDPGFPG 1380
APGTVGAPGI AGIPQKIAVQ PGTVGPQGRR GPPGAPGEMG PQGPPGEPGF RGAPGKAGPQ 1440
GRGGVSAVPG FRGDEGPIGH QGPIGQEGAP GRPGSPGLPG MPGRSVSIGY LLVKHSQTDQ 1500
EPMCPVGMNK LWSGYSLLYF EGQEKAHNQD LGLAGSCLAR FSTMPFLYCN PGDVCYYASR 1560
NDKSYWLSTT APLPMMPVAE DEIKPYISRC SVCEAPAIAI AVHSQDVSIP HCPAGWRSLW 1620
IGYSFLMHTA AGDEGGGQSL VSPGSCLEDF RATPFIECNG GRGTCHYYAN KYSFWLTTIP 1680
EQSFQGSPSA DTLKAGLIRT HISRCQVCMK NL 1712

Abbreviation :
CLO : cleft lip only. CPO : cleft palate only. CLP : cleft lip and palate. CL/P : cleft lip with/without cleft palate.
For humans: CL/P, CLO, CPO, and CLP. For mice: CLO, CLP, and CPO.

Gene expression information

Gene expression in different tissues (GTEx V7)

  

Gene expression in different tissues (ENCODE)

  

Protein structural annotations

3D structure in PDB database

Protein disorder information

Orthologous information

Relation Gene symbol Entrez ID UniProt ID Cleft type Developmental stage Species Evidence Details
1:1 orthologCOL4A2A0A452G6J0Capra hircusPredictionMore>>
1:1 orthologCOL4A21284P08572Homo sapiensPublicationMore>>
1:1 orthologCol4a212827P08122Mus musculusPredictionMore>>
1:1 orthologCOL4A2452661K7C8W0Pan troglodytesPredictionMore>>
1:1 orthologCOL4A2F1RLL9Sus scrofaPredictionMore>>
1:1 orthologCol4a2F1M6Q3Rattus norvegicusPredictionMore>>

Identified variants/mutations related to cleft phenotype

Gene symbol Significant Variants/SNPS Methods PubMed ID
COL4A2c.4096G>A; p.D1366NWES and Sanger sequencing26449438

Other genetic variants/mutations

loading...

Disease or phenotype associated information

loading...

Gene Ontology (GO)/biological pathways

GO:Molecular Function

GO ID GO Term Evidence
GO:0005201 extracellular matrix structural constituentIBA
GO:0005201 extracellular matrix structural constituentTAS
GO:0005515 protein bindingIPI
GO:0030020 extracellular matrix structural constituent conferring tensile strengthISS
GO:0030020 extracellular matrix structural constituent conferring tensile strengthRCA
GO:0030020 extracellular matrix structural constituent conferring tensile strengthHDA

GO:Biological Process

GO ID GO Term Evidence
GO:0001525 angiogenesisIEA
GO:0006351 transcription, DNA-templatedIEA
GO:0007568 agingIEA
GO:0014823 response to activityIEA
GO:0016525 negative regulation of angiogenesisIDA
GO:0030198 extracellular matrix organizationIBA
GO:0030198 extracellular matrix organizationNAS
GO:0030198 extracellular matrix organizationTAS
GO:0035987 endodermal cell differentiationIEP
GO:0038063 collagen-activated tyrosine kinase receptor signaling pathwayIBA
GO:0071560 cellular response to transforming growth factor beta stimulusIEA

GO:Cellular Component

GO ID GO Term Evidence
GO:0005576 extracellular regionTAS
GO:0005587 collagen type IV trimerIBA
GO:0005587 collagen type IV trimerTAS
GO:0005615 extracellular spaceIBA
GO:0005788 endoplasmic reticulum lumenTAS
GO:0031012 extracellular matrixIBA
GO:0062023 collagen-containing extracellular matrixHDA
GO:0070062 extracellular exosomeHDA
GO:0062023 collagen-containing extracellular matrixISS
GO:0062023 collagen-containing extracellular matrixTAS

Reactome Pathway

Reactome ID Reactome Term Evidence
R-HSA-1266738 Developmental BiologyTAS
R-HSA-1442490 Collagen degradationTAS
R-HSA-1442490 Collagen degradationIEA
R-HSA-1474228 Degradation of the extracellular matrixTAS
R-HSA-1474228 Degradation of the extracellular matrixIEA
R-HSA-1474244 Extracellular matrix organizationTAS
R-HSA-1474244 Extracellular matrix organizationIEA
R-HSA-1474290 Collagen formationTAS
R-HSA-162582 Signal TransductionTAS
R-HSA-1650814 Collagen biosynthesis and modifying enzymesTAS
R-HSA-186797 Signaling by PDGFTAS
R-HSA-2022090 Assembly of collagen fibrils and other multimeric structuresTAS
R-HSA-216083 Integrin cell surface interactionsTAS
R-HSA-216083 Integrin cell surface interactionsIEA
R-HSA-2173782 Binding and Uptake of Ligands by Scavenger ReceptorsIEA
R-HSA-2214320 Anchoring fibril formationTAS
R-HSA-2243919 Crosslinking of collagen fibrilsTAS
R-HSA-3000157 Laminin interactionsIEA
R-HSA-3000157 Laminin interactionsTAS
R-HSA-3000171 Non-integrin membrane-ECM interactionsTAS
R-HSA-3000178 ECM proteoglycansIEA
R-HSA-3000480 Scavenging by Class A ReceptorsIEA
R-HSA-375165 NCAM signaling for neurite out-growthTAS
R-HSA-419037 NCAM1 interactionsTAS
R-HSA-422475 Axon guidanceTAS
R-HSA-5653656 Vesicle-mediated transportIEA
R-HSA-8948216 Collagen chain trimerizationTAS
R-HSA-9006934 Signaling by Receptor Tyrosine KinasesTAS

Drugs and compounds information

loading...

Functional annotations

Keywords

Keyword ID Keyword Term
KW-0002 3D-structure
KW-0037 Angiogenesis
KW-0084 Basement membrane
KW-0176 Collagen
KW-0903 Direct protein sequencing
KW-0225 Disease mutation
KW-1015 Disulfide bond
KW-0272 Extracellular matrix
KW-0325 Glycoprotein
KW-0379 Hydroxylation
KW-0621 Polymorphism
KW-1185 Reference proteome
KW-0677 Repeat
KW-0964 Secreted
KW-0732 Signal

Interpro

InterPro ID InterPro Term
IPR008160 Collagen
IPR001442 Collagen_IV_NC
IPR036954 Collagen_IV_NC_sf
IPR016187 CTDL_fold

PROSITE

PROSITE ID PROSITE Term
PS51403 NC1_IV

Pfam

Pfam ID Pfam Term
PF01413 C4
PF01391 Collagen

Protein-protein interaction

Protein-miRNA interaction