Gene: COL2A1

Basic information

Tag Content
Uniprot ID A0A5F5PIB7
Entrez ID 791241
Genbank protein ID
Genbank nucleotide ID XM_005611082.2
Ensembl protein ID ENSECAP00000048092
Ensembl nucleotide ID ENSECAG00000022313
Gene name Collagen type II alpha 1 chain
Gene symbol COL2A1
Organism Equus caballus
NCBI taxa ID 9796
Cleft type
Developmental stage
Data sources Homology search
Reference
Functional description
Sequence
MIRLGAPQTL VLLTLLVAAV LRCHGQDVQK AGSCVQDGQR YNDKDVWKPE PCRICVCDTG 60
TVLCDDIICE DMKDCLSPET PFGECCPICS TDLATTSGQP GPKGQKGEPG DIKDIVGPKG 120
PPGPQGPAGE QGPRGDRGDK GEKGAPGPRG RDGEPGTPGN PGPPGPPGPP GPPGLGGNFA 180
AQMAGGFDEK AGGAQMGVMQ GPMGPMGPRG PPGPAGAPGP QGFQGNPGEP GEPGVSGPMG 240
PRGPPGPPGK PGDDGEAGKP GKSGERGPPG PQGARGFPGT PGLPGVKGHR GYPGLDGAKG 300
EAGAPGVKGE SGSPGENGSP GPMGPRGLPG ERGRTGPAGA AGARGNDGQP GPAGPPGPVG 360
PAGGPGFPGA PGAKGEAGPT GARGPEGAQG PRGEPGTPGS PGPAGAAGNP GTDGIPGAKG 420
SAGAPGIAGA PGFPGPRGPP GPQGATGPLG PKGQTGEPGI AGFKGEQGPK GEPGPAGPQG 480
APGPAGEEGK RGARGEPGGA GPVGPPGERG APGNRGFPGQ DGLAGPKGAP GERGPSGLAG 540
PKGANGDPGR PGEPGLPGAR GLTGRPGDAG PQGKVGPSGA PGEDGRPGPP GPQGARGQPG 600
VMGFPGPKGA NGEPGKAGEK GLPGAPGLRG LPGKDGETGA AGPPGPAGPA GERGEQGAPG 660
PSGFQGLPGP PGPPGEGGKP GDQGVPGEAG APGLVGPRGE RGFPGERGSP GAQGLQGARG 720
LPGTPGTDGP KGASGPAGPP GAQGPPGLQG MPGERGAAGI AGPKGDRGDV GEKGPEGAPG 780
KDGGRGLTGP IGPPGPAGAN GEKGEVGPPG PAGTAGARGA PGERGETGPP GPAGFAGPPG 840
ADGQPGAKGE QGEAGQKGDA GAPGPQGPSG APGPQGPTGV TGPKGARGAQ GPPGATGFPG 900
AAGRVGPPGS NGNPGPPGPP GPSGKDGPKG ARGDSGPPGR AGDPGLQGPA GPPGEKGEPG 960
DDGPSGPDGP PGPQGLAGQR GIVGLPGQRG ERGFPGLPGP SGEPGKQGAP GASGDRGPPG 1020
PVGPPGLTGP AGEPGREGTP GADGPPGRDG AAGVKGDRGE AGALGAPGAP GPPGSPGPAG 1080
PTGKQGDRGE AGAQGPMGPA GPAGARGLPG PQGPRGDKGE AGEAGERGLK GHRGFTGLQG 1140
LPGPPGPSGD QGASGPAGPS GPRGPPGPVG PSGKDGANGI PGPIGPPGPR GRSGETGPAG 1200
PPGNPGPPGP PGPPGPGIDM SAFAGLGPRE KGPDPLQYMR ADEAAGGLRP HDEEVEATLK 1260
SLNNQIESIR SPEGSRKNPA RTCRDLKLCH PEWKSGDYWI DPNQGCTLDA MKVFCNMETG 1320
ETCVYPNPAN VPKKNWWSSK SKDKKHIWFG ETINGGFHFS YGDDNLAPNT ANVQMTFLRL 1380
LSTEGSQNIT YHCKNSIAYL DEAAGNLKKA LLIQGSNDVE IRAEGNSRFT YTVLKDGCTK 1440
HTGKWGKTTI EYRSQKTSRL PIIDIAPMDI GGPEQEFGVD IGPVCFL 1487

Abbreviation :
CLO : cleft lip only. CPO : cleft palate only. CLP : cleft lip and palate. CL/P : cleft lip with/without cleft palate.
For humans: CL/P, CLO, CPO, and CLP. For mice: CLO, CLP, and CPO.

Protein disorder information

Orthologous information

Relation Gene symbol Entrez ID UniProt ID Cleft type Developmental stage Species Evidence Details
1:1 orthologCOL2A1407142P02459Bos taurusPredictionMore>>
1:1 orthologCOL2A1A0A452E0F9Capra hircusPredictionMore>>
1:1 orthologCOL2A1791241A0A5F5PIB7Equus caballusPredictionMore>>
1:1 orthologCOL2A11280P02458Homo sapiensPredictionMore>>
1:1 orthologCol2a112824P28481CPOMus musculusPublicationMore>>
1:1 orthologCOL2A1451860H2Q5S8Pan troglodytesPredictionMore>>
1:1 orthologCOL2A1100009005G1T5V9Oryctolagus cuniculusPredictionMore>>
1:1 orthologCol2a1F1LRM7Rattus norvegicusPredictionMore>>

Gene ontology

GO ID GO Term
GO:0005201 extracellular matrix structural constituent

Functional annotations

Keywords

Keyword ID Keyword Term
KW-1185 Reference proteome
KW-0732 Signal

Interpro

InterPro ID InterPro Term
IPR008160 Collagen
IPR000885 Fib_collagen_C
IPR001007 VWF_dom

PROSITE

PROSITE ID PROSITE Term
PS51461 NC1_FIB
PS01208 VWFC_1
PS50184 VWFC_2

Pfam

Pfam ID Pfam Term
PF01410 COLFI
PF01391 Collagen
PF00093 VWC