Gene: COL4A2

Basic information

Tag Content
Uniprot ID F1RLL9
Entrez ID
Genbank protein ID
Genbank nucleotide ID
Ensembl protein ID ENSSSCP00000010191
Ensembl nucleotide ID ENSSSCG00000009545
Gene name Collagen type IV alpha 2 chain
Gene symbol COL4A2
Organism Sus scrofa
NCBI taxa ID 9823
Cleft type
Developmental stage
Data sources Homology search
Reference
Functional description
Sequence
MDRDPRAASG PALRRWLLLG TVMVGLLAQS VLAGVKKLDV PCGGRDCSGG CQCYPEKGGR 60
GQPGPVGPQG YTGPPGLQGF PGLQGRKGDK GERGAPGITG PKGDVGARGV SGFPGADGIP 120
GHPGQGGPRG RPGSDGCNGT VGDTGYAGPV GPDGFLGPPG PQGPKGQKGE PYALSREDRD 180
KYRGEPGEPG LVGFQGPPGR PGPVGQMGPV GAPGRPGPPG PPGPKGQPGN RGLGFYGEKG 240
EKGDVGQPGP NGIPSDNHHP IIGPTRETIY LDQYKGEKGS EGEPGRKGIS LKGEEGIMGF 300
SGSRGVPGFD GEKGSPGQKG SRGLDGYEGP DGYPGPKGER GDPGPPGAPA YSPHPSLAKG 360
ARGEPGFPGA LGEPGARGEP GDPGPPGLPG TSVRDEDEKR GLPGEMGPKG YAGEPGAPAL 420
YPGPPGADGK PGLRGPPGPP GPPGPDDFLF GLKGAKGSMG YPGPSGFPGA RGQKGWKGDA 480
GDCKCAEDDQ FVGGLPGPPG PKGFPGINGE PGRKGSQGDP GQHGIPGFPG FKGAPGDAGP 540
PGPKGMKGDS RTITTKGERG QPGVPGVPGL RGDDGAPGRD GLDGFPGLPG PPGDGIKGPP 600
GDAGHPGVPG TKGLAGDRGP PGLGLPGPKG ERGFPGDDGL PGPPGFPGPP GPPGPPGQID 660
CDSGVKRPIG ADGQEVIQPG CVGGPKGSPG QPGPPGPPGA KGLRGIPGPS GADGAPGLKG 720
FPGDPGREGF PGPPGFVGPR GSKGAVGPPG LDGLPGPSGL PGPVGPPGDK GLPGEVLGAQ 780
PGSRGDPGLP GHPGLKGPPG ERGAPGFRGS EGMPGMPGLK GQPGFPGPSG QPGLPGPPGQ 840
HGFPGAPGRE GPLGPPGAPG FGGLPGDRGD PGDTGVPGPV GMKGLSGDRG DPGLLGERGH 900
PGSPGFKGVA GMPGAPGPKG TRGSPGMHGF QGMLGLKGSP GLPGSKGEAG FFGVPGLKGL 960
PGEPGVKGSR GDPGLPGPPP TILPGMKDIK GEKGDEGPMG LKGYLGLKGI PGMPGIPGLS 1020
GVPGLPGKPG HVKGAKGDTG APGVPGSPGF PGLPGPPGII GFPGFTGSRG DKGSPGRAGL 1080
YGEIGATGDF GDIGDTIDLP GSPGLKGERG TAGVPGLKGF FGEKGTVGDV GFPGITGLAG 1140
VQGPPGLKGQ TGFPGLTGLQ GPQGDPGRAG VPGAKGELGW PGNVGLPGLP GIRGISGLHG 1200
LPGTKGFPGS PGADVHGDPG FPGPAGDRGD PGEANTLPGP AGAPGQKGER GAPGERGPVG 1260
NPGLQGFPGI TPPSNVSGLP GDTGAPGIFG PEGYRGPPGP PGPAALPGSK GDEGLPGIPG 1320
NPGGKGWVGD PGPQGRPGVF GLPGEKGPRG EQGFMGNTGA TGSVGDRGPK GPKGDRGLPG 1380
PPGAVGAPGI VGIPQRIAVQ RGPVGPQGRR GPPGAQGEMG PQGPPGEPGF RGAPGKAGPQ 1440
GRGGVSAVPG FRGDQGPVGQ QGPVGQEGEP GRPGSPGLPG MPGRSVSIGY LLVKHSQTDQ 1500
EPMCPVGMNK LWSGYSLLYF EGQEKAHNQD LGLAGSCLAR FSTMPFLYCN PGDICHYASR 1560
NDKSYWLSTT APLPMMPVAE EEIRPYISRC SVCEAPAVAI AVHSQDVSIP HCPAGWRSLW 1620
IGYSFLMHTA AGDEGGGQSL VSPGSCLEDF RATPFIECNG ARGTCHYYAN KYSFWLTTIP 1680
EQSFQSSPSA DTLKAGLIRT HISRCQVCMK NL 1712

Abbreviation :
CLO : cleft lip only. CPO : cleft palate only. CLP : cleft lip and palate. CL/P : cleft lip with/without cleft palate.
For humans: CL/P, CLO, CPO, and CLP. For mice: CLO, CLP, and CPO.

Protein disorder information

Orthologous information

Relation Gene symbol Entrez ID UniProt ID Cleft type Developmental stage Species Evidence Details
1:1 orthologCOL4A2A0A452G6J0Capra hircusPredictionMore>>
1:1 orthologCOL4A21284P08572Homo sapiensPublicationMore>>
1:1 orthologCol4a212827P08122Mus musculusPredictionMore>>
1:1 orthologCOL4A2452661K7C8W0Pan troglodytesPredictionMore>>
1:1 orthologCOL4A2F1RLL9Sus scrofaPredictionMore>>
1:1 orthologCol4a2F1M6Q3Rattus norvegicusPredictionMore>>

Gene ontology

GO ID GO Term
GO:0005587 collagen type IV trimer
GO:0031012 extracellular matrix
GO:0005615 extracellular space
GO:0005201 extracellular matrix structural constituent
GO:0071560 cellular response to transforming growth factor beta stimulus
GO:0038063 collagen-activated tyrosine kinase receptor signaling pathway
GO:0035987 endodermal cell differentiation
GO:0030198 extracellular matrix organization
GO:0016525 negative regulation of angiogenesis
GO:0006351 transcription, DNA-templated

Functional annotations

Keywords

Keyword ID Keyword Term
KW-0176 Collagen
KW-1015 Disulfide bond
KW-1267 Proteomics identification
KW-1185 Reference proteome
KW-0732 Signal

Interpro

InterPro ID InterPro Term
IPR008160 Collagen
IPR001442 Collagen_IV_NC
IPR036954 Collagen_IV_NC_sf
IPR016187 CTDL_fold

PROSITE

PROSITE ID PROSITE Term
PS51403 NC1_IV

Pfam

Pfam ID Pfam Term
PF01413 C4
PF01391 Collagen