Gene: COL2A1

Basic information

Tag Content
Uniprot ID H2Q5S8; A0A2J8LQI4;
Entrez ID 451860
Genbank protein ID
Genbank nucleotide ID XM_001161825.5
Ensembl protein ID ENSPTRP00000008327
Ensembl nucleotide ID ENSPTRG00000004871
Gene name Collagen type II alpha 1 chain
Gene symbol COL2A1
Organism Pan troglodytes
NCBI taxa ID 9598
Cleft type
Developmental stage
Data sources Homology search
Reference
Functional description
Sequence
MIRLGALQTL VLLTLLVAAV LRCQGQDVRQ PGPKGQKGEP GDIKDIVGPK GPPGPQGPAG 60
EQGPRGDRGD KGEKGAPGPR GRDGEPGTPG NPGPPGPPGP PGPPGLGGNF AAQMAGGFDE 120
KAGGAQLGVM QGPMGPMGPR GPPGPAGAPG PQGFQGNPGE PGEPGVSGPM GPRGPPGPPG 180
KPGDDGEAGK PGKAGERGPP GPQGARGFPG TPGLPGVKGH RGYPGLDGAK GEAGAPGVKG 240
ESGSPGENGS PGPMGPRGLP GERGRTGPAG AAGARGNDGQ PGPAGPPGPV GPAGGPGFPG 300
APGAKGEAGP TGARGPEGAQ GPRGEPGTPG SPGPAGASGN PGTDGIPGAK GSAGAPGIAG 360
APGFPGPRGP PGPQGATGPL GPKGQTGEPG IAGFKGEQGP KGEPGPAGPQ GAPGPAGEEG 420
KRGARGEPGG VGPIGPPGER GAPGNRGFPG QDGLAGPKGA PGERGPSGLA GPKGANGDPG 480
RPGEPGLPGA RGLTGRPGDA GPQGKVGPSG APGEDGRPGP PGPQGARGQP GVMGFPGPKG 540
ANGEPGKAGE KGLPGAPGLR GLPGKDGETG AAGPPGPAGP AGERGEQGAP GPSGFQGLPG 600
PPGPPGEGGK PGDQGVPGEA GAPGLVGPRG ERGFPGERGS PGAQGLQGPR GLPGTPGTDG 660
PKGASGPAGP PGAQGPPGLQ GMPGERGAAG IAGPKGDRGD VGEKGPEGAP GKDGGRGLTG 720
PIGPPGPAGA NGEKGEVGPP GPAGSAGARG APGERGETGP PGPAGFAGPP GADGQPGAKG 780
EQGEAGQKGD AGAPGPQGPS GAPGPQGPTG VTGPKGARGA QGPPGATGFP GAAGRVGPPG 840
SNGNPGPPGP PGPSGKDGPK GARGDSGPPG RAGEPGLQGP AGPPGEKGEP GDDGPSGAEG 900
PPGPQGLAGQ RGIVGLPGQR GERGFPGLPG PSGEPGKQGA PGASGDRGPP GPVGPPGLTG 960
PAGEPGREGS PGADGPPGRD GAAGVKGDRG ETGAVGAPGA PGPPGSPGPA GPTGKQGDRG 1020
EAGAQGPMGP SGPAGARGIQ GPQGPRGDKG EAGEPGERGL KGHRGFTGLQ GLPGPPGPSG 1080
DQGASGPAGP SGPRGPPGPV GPSGKDGANG IPGPIGPPGP RGRSGETGPA GPPGNPGPPG 1140
PPGPPGPGID MSAFAGLGPR EKGPDPLQYM RADQAAGGLR QHDAEVDATL KSLNNQIESI 1200
RSPEGSRKNP ARTCRDLKLC HPEWKSGDYW IDPNQGCTLD AMKVFCNMET GETCVYPNPA 1260
NVPKKNWWSS KSKEKKHIWF GETINGGFHF SYGDDNLAPN TANVQMTFLR LLSTEGSQNI 1320
TYHCKNSIAY LDEAAGNLKK ALLIQGSNDV EIRAEGNSRF TYTALKDGCT KHTGKWGKTV 1380
IEYRSQKTSR LPIIDIAPMD IGGPEQEFGV DIGPVCFL 1418

Abbreviation :
CLO : cleft lip only. CPO : cleft palate only. CLP : cleft lip and palate. CL/P : cleft lip with/without cleft palate.
For humans: CL/P, CLO, CPO, and CLP. For mice: CLO, CLP, and CPO.

Protein disorder information

Orthologous information

Relation Gene symbol Entrez ID UniProt ID Cleft type Developmental stage Species Evidence Details
1:1 orthologCOL2A1407142P02459Bos taurusPredictionMore>>
1:1 orthologCOL2A1A0A452E0F9Capra hircusPredictionMore>>
1:1 orthologCOL2A1791241A0A5F5PIB7Equus caballusPredictionMore>>
1:1 orthologCOL2A11280P02458Homo sapiensPredictionMore>>
1:1 orthologCol2a112824P28481CPOMus musculusPublicationMore>>
1:1 orthologCOL2A1451860H2Q5S8Pan troglodytesPredictionMore>>
1:1 orthologCOL2A1100009005G1T5V9Oryctolagus cuniculusPredictionMore>>
1:1 orthologCol2a1F1LRM7Rattus norvegicusPredictionMore>>

Gene ontology

GO ID GO Term
GO:0005201 extracellular matrix structural constituent

Functional annotations

Keywords

Keyword ID Keyword Term
KW-1185 Reference proteome

Interpro

InterPro ID InterPro Term
IPR008160 Collagen
IPR000885 Fib_collagen_C

PROSITE

PROSITE ID PROSITE Term
PS51461 NC1_FIB

Pfam

Pfam ID Pfam Term
PF01410 COLFI
PF01391 Collagen