Gene: Col4a4

Basic information

Tag Content
Uniprot ID A0A0G2K742
Entrez ID
Genbank protein ID
Genbank nucleotide ID
Ensembl protein ID ENSRNOP00000074055
Ensembl nucleotide ID ENSRNOG00000014851
Gene name Collagen type IV alpha 4 chain
Gene symbol Col4a4
Organism Rattus norvegicus
NCBI taxa ID 10116
Cleft type
Developmental stage
Data sources Homology search
Reference
Functional description
Sequence
MRCFFRWTES YGTGPWSLIF ILFTIQYEYG SSKKYGSPCG GRNCSVCQCF PEKGSRGHPG 60
PLGPQGPIGP LGPPGPIGIP GEKGLRGDSG LPGPPGEKGD KGPTGVPGFP GVDGVPGHPG 120
PPGPRGKPGM DGYNGSRGDP GYPGERGAPG PGGPPGQPGE NGEKGRSVYI SGGVKGTQGD 180
RGDPGPLGLP GSRGAQGSTG PMGHAGTPGL AGPIGHPGSP GMKGDPAMGL KGQKGEPGEV 240
GQHGPLGPTL LVQPPDLGVY KGEKGVKGVP GRIGSPGPPG RKGEPGIGVK GEKGIPGFPG 300
PRGEPGSHGS PGFPGFKGIQ GAAGDPGLFG LRGPKGDPGD RGNPGPPGIL VTPAPPLKGV 360
PGDPGPPGHY GEIGDVGLPG PPGLPGRPGE TCPGIVGPPG PPGVPGPPGF PGDAGIPGRL 420
DCAPGKPGKP GLPGLPGAPG PEGPPGSNVI YCRPGYPGPM GEKGKMGPPG RRGAKGAKGN 480
EGLCDCPPGP MGPPGPPGPP GRQGGKGDLG LPGWHGEKGD PGQPGAEGPP GPPGRPGAVG 540
PPGLKGAKGD MVISRAKGQK GERGLDGPPG FPGPHGQDGR DGRAGERGDP GPRGDHKDAA 600
PGERGLPGLP GPPGKAGPEG PPGLGFPGPP GERGLPGEPG RPGMRGFDGM KGQKGDSIPC 660
NVTYPGKPGP PGFDGPPGLK GFPGPPGAPG MRCPVGQKGQ RGKPGMPGIP GPPGFRGVVG 720
DPGIKGERGT SPFGPPGPPG PPGMDGQKGM PGDSAFGDPG PPGERGLPGA PGMKGQKGYP 780
GCPGAGGPPG IPGSPGLKGP KGREGSPGLP GTPGSPGHSC ERGAPGIPGQ PGLPGTPGDP 840
GPPGWKGQPG DMGPSGPAGM KGLPGLPGLP GADGLRGPPG IPGLTGEEGP PALPALKGAP 900
GLPGFPGFPG ERGKSGPDGE PGRKGEAGEK GWPGLQGAPG ERGAKGDRGP PGDVGETAVS 960
RKGEPGDAGP PGDGGFSGER GDKGIPGIQG GRGDPGRDGP PGLHRGQPGM DGPPGPPGPP 1020
GPPGSPGLRG VIGFPGFPGD QGDPGSPGPP GFSGDDGARG PKGNKGDPAS QYGLPGPKGE 1080
PGSPGYQGHT GDSGEKGFPG DEGPRGPPGR PGQPGSLGPP GCPGDPGMPG QKGHPGEVGD 1140
PGPRGYSGDL GRPGPAGVKG PPGSPGLNGL HGLKGEKGAK GASGLLEMGP PGPMGTPGLK 1200
GEKGDPGSPG ISPPGLPGEK GFPGPPGRPG APGSAGTPGR AAKGDIPDPG PPGDWGPPGP 1260
DGPRGVPGPP GPRGNVSLLK GDPGDHGLPG PPGSRGPPGP PGCQGPPGCD GKDGQKGPMG 1320
LPGLPGPPGL PGAPGEKGLP GPPGRKGPVG PPGCRGEPGP PVDVDSCVPI PGLPGVPGPR 1380
GPEGAMGDPG QRGLPGPGCK GEAGLDGRRG QDGIPGSPGP PGRNGDTGEA GRTGAPGPPG 1440
MTGDPGPKGF GPGYLSGFLL VLHSQTDQEP ACPVGMPRLW TGYSLLYLEG QEKAHNQDLG 1500
LAGSCLPVFS TLPFAYCNIH QVCHYAQRND RSYWLSSAAP LPMMPLSEEE IRPYVSRCAV 1560
CEAPAQAVAV HSQDQSIPPC PQTWRSLWIG YSFLMHTGAG DQGGGQALMS PGSCLEDFRA 1620
APFLECQGRQ GTCHFFANKY SFWLTTVNPD LQFSSGPAPD TLKEVQAQRR KTSRCQVCMK 1680
HS 1682

Abbreviation :
CLO : cleft lip only. CPO : cleft palate only. CLP : cleft lip and palate. CL/P : cleft lip with/without cleft palate.
For humans: CL/P, CLO, CPO, and CLP. For mice: CLO, CLP, and CPO.

Protein disorder information

Orthologous information

Relation Gene symbol Entrez ID UniProt ID Cleft type Developmental stage Species Evidence Details
1:1 orthologCOL4A4A0A452G3N1Capra hircusPredictionMore>>
1:1 orthologCOL4A41286P53420Homo sapiensPublicationMore>>
1:1 orthologCol4a412829Q9QZR9Mus musculusPredictionMore>>
1:1 orthologCOL4A4459986H2R630Pan troglodytesPredictionMore>>
1:1 orthologCol4a4A0A0G2K742Rattus norvegicusPredictionMore>>

Gene ontology

GO ID GO Term
GO:0005604 basement membrane
GO:0005587 collagen type IV trimer
GO:0031012 extracellular matrix
GO:0005615 extracellular space
GO:0005201 extracellular matrix structural constituent
GO:0038063 collagen-activated tyrosine kinase receptor signaling pathway
GO:0030198 extracellular matrix organization
GO:0032836 glomerular basement membrane development

Functional annotations

Keywords

Keyword ID Keyword Term
KW-0176 Collagen
KW-1015 Disulfide bond
KW-1185 Reference proteome

Interpro

InterPro ID InterPro Term
IPR008160 Collagen
IPR001442 Collagen_IV_NC
IPR036954 Collagen_IV_NC_sf
IPR016187 CTDL_fold

PROSITE

PROSITE ID PROSITE Term
PS51403 NC1_IV

Pfam

Pfam ID Pfam Term
PF01413 C4
PF01391 Collagen