Gene: COL4A2

Basic information

Tag Content
Uniprot ID K7C8W0; A0A2J8KTB0; H2RD60;
Entrez ID 452661
Genbank protein ID JAA28938.1; JAA38281.1;
Genbank nucleotide ID XM_001136859.3
Ensembl protein ID ENSPTRP00000058373
Ensembl nucleotide ID ENSPTRG00000006032
Gene name Collagen type IV alpha 2 chain
Gene symbol COL4A2
Organism Pan troglodytes
NCBI taxa ID 9598
Cleft type
Developmental stage
Data sources Homology search
Reference
Functional description
Sequence
MGRDQRAVAG PALRRWLLGT VTVGFLAQSV LAGVKKFDVP CGGRDCSGGC QCYPEKGGRG 60
QPGPVGPQGY NGPPGLQGFP GLQGRKGDKG ERGAPGITGP KGDVGARGVS GFPGADGIPG 120
HPGQGGPRGR PGYDGCNGTQ GDSGPQGPPG SEGFTGPPGP QGPKGQKGEP YALPKEERDR 180
YRGEPGEPGL VGFQGPPGRP GHVGQMGPVG APGRPGPPGP PGPKGQQGNR GLGFYGVKGE 240
KGDVGQPGPN GIPSDLLHPI IAPTGVTFHP DQYKGEKGSE GEPGIRGISL KGEEGIMGFP 300
GLRGYPGLSG EKGSPGQKGS RGLDGYQGPD GPRGPKGEAG DPGPPGLPAY SPHPSLAKGA 360
RGDPGFPGAQ GEPGSQGEPG DPGLPGAPGL SIGDGDQRRG LPGEMGPKGF IGDPGIPALY 420
GGPPGPDGKR GPPGPPGLPG PPGPDGFLFG LKGAKGRAGF PGLPGSPGAR GPKGWKGDAG 480
ECRCTEGDEA IKGLPGLPGP KGFAGINGEP GRKGDKGDPG QHGLPGFPGL KGVPGNVGAP 540
GPKGAKGDSR TITTKGERGQ PGVPGVPGMK GDDGSPGRDG LDGFPGLPGP PGDGIKGPPG 600
DPGYPGIPGT KGTPGEMGPP GLGLPGLKGQ RGFPGDAGLP GPPGFLGPPG PAGTPGQIDC 660
DTDVKRAIGG DRQEAIQPGC VGGPKGLPGL PGPPGPTGAK GLRGIPGFSG ADGGPGPKGL 720
PGDAGREGFP GPPGFIGPRG SKGAVGLPGP DGSPGPIGLP GPDGPPGERG LPGEVLGAQP 780
GPRGDAGVPG QPGLKGLPGD RGPPGFRGSQ GMPGMPGLKG QPGLPGPSGQ PGLYGPPGLH 840
GFPGAPGQEG PLGLPGIPGR EGLPGDRGDP GDTGAPGPVG MKGLSGDRGD AGFTGERGHP 900
GSPGFKGIDG MPGTPGLKGD RGSPGMDGFQ GMPGLKGRPG FPGSKGEAGF FGIPGLKGLA 960
GEPGFKGSRG DPGPPGPPPV ILPGMKDIKG EKGDEGPMGL KGYLGAKGIQ GMPGIPGLSG 1020
IPGLPGRPGH IKGVKGDIGA PGIPGLPGFP GVAGPPGITG FPGFIGSRGD KGAPGRAGLY 1080
GEIGATGDFG DIGDTINLPG RPGLKGERGT TGIPGLKGFF GEKGTEGDIG FPGITGVTGV 1140
QGPPGLKGQT GFPGLTGPPG SQGEPGRIGL PGGKGDDGWP GAPGLPGFPG LRGIRGLHGL 1200
PGTKGFPGSP GSDIHGDPGF PGPPGERGDP GEANTLPGPV GVPGQKGDQG APGERGPPGS 1260
PGLQGFPGIT PPSNISGAPG DKGAPGIFGL KGYRGPPGPP GSAALPGSKG DTGNPGAPGT 1320
PGTKGWAGDS GPQGRPGVFG LPGEKGPRGE QGFMGNTGPT GAVGDRGPKG PKGDPGFPGA 1380
PGTVGAPGIA GIPQKIAVQP GTVGPQGRRG PPGAPGEMGP QGPPGEPGFR GAPGKAGPQG 1440
RGGVSAVPGF RGDEGPIGHQ GPIGQEGAPG RPGSPGLPGM PGRSVSIGYL LVKHSQTDQE 1500
PMCPVGMNKL WSGYSLLYFE GQEKAHNQDL GLAGSCLARF STMPFLYCNP GDVCYYASRN 1560
DKSYWLSTTA PLPMMPVAED EIKPYISRCS VCEAPAVAIA VHSQDVSIPH CPAGWRSLWI 1620
GYSFLMHTAA GDEGGGQSLV SPGSCLEDFR ATPFIECNGG RGTCHYYANK YSFWLTTIPE 1680
QSFQGSPSAD TLKAGLIRTH ISRCQVCMKN L 1711

Abbreviation :
CLO : cleft lip only. CPO : cleft palate only. CLP : cleft lip and palate. CL/P : cleft lip with/without cleft palate.
For humans: CL/P, CLO, CPO, and CLP. For mice: CLO, CLP, and CPO.

Protein disorder information

Orthologous information

Relation Gene symbol Entrez ID UniProt ID Cleft type Developmental stage Species Evidence Details
1:1 orthologCOL4A2A0A452G6J0Capra hircusPredictionMore>>
1:1 orthologCOL4A21284P08572Homo sapiensPublicationMore>>
1:1 orthologCol4a212827P08122Mus musculusPredictionMore>>
1:1 orthologCOL4A2452661K7C8W0Pan troglodytesPredictionMore>>
1:1 orthologCOL4A2F1RLL9Sus scrofaPredictionMore>>
1:1 orthologCol4a2F1M6Q3Rattus norvegicusPredictionMore>>

Gene ontology

GO ID GO Term
GO:0005587 collagen type IV trimer
GO:0031012 extracellular matrix
GO:0005615 extracellular space
GO:0005201 extracellular matrix structural constituent
GO:0071560 cellular response to transforming growth factor beta stimulus
GO:0038063 collagen-activated tyrosine kinase receptor signaling pathway
GO:0035987 endodermal cell differentiation
GO:0030198 extracellular matrix organization
GO:0016525 negative regulation of angiogenesis
GO:0006351 transcription, DNA-templated

Functional annotations

Keywords

Keyword ID Keyword Term
KW-0176 Collagen
KW-1015 Disulfide bond
KW-1185 Reference proteome
KW-0732 Signal

Interpro

InterPro ID InterPro Term
IPR008160 Collagen
IPR001442 Collagen_IV_NC
IPR036954 Collagen_IV_NC_sf
IPR016187 CTDL_fold

PROSITE

PROSITE ID PROSITE Term
PS51403 NC1_IV

Pfam

Pfam ID Pfam Term
PF01413 C4
PF01391 Collagen