Gene: ACAN

Basic information

Tag Content
Uniprot ID P16112; B9EK55; E7ENV9; E7EX88; H0YM81; Q13650; Q9UCD3; Q9UCP4; Q9UCP5; Q9UDE0;
Entrez ID 176
Genbank protein ID AAA62824.1; AAA35726.1; CAA35463.1; AAC60643.2; AAI50625.1;
Genbank nucleotide ID NM_013227.3; XM_011521314.1; NM_001135.3;
Ensembl protein ID ENSP00000453499; ENSP00000387356; ENSP00000341615;
Ensembl nucleotide ID ENSG00000157766
Gene name Aggrecan core protein
Gene symbol ACAN
Organism Homo sapiens
NCBI taxa ID 9606
Cleft type
Developmental stage
Data sources Homology search
Reference
Functional description This proteoglycan is a major component of extracellular matrix of cartilagenous tissues. A major function of this protein is to resist compression in cartilage. It binds avidly to hyaluronic acid via an N-terminal globular region.
Sequence
MTTLLWVFVT LRVITAAVTV ETSDHDNSLS VSIPQPSPLR VLLGTSLTIP CYFIDPMHPV 60
TTAPSTAPLA PRIKWSRVSK EKEVVLLVAT EGRVRVNSAY QDKVSLPNYP AIPSDATLEV 120
QSLRSNDSGV YRCEVMHGIE DSEATLEVVV KGIVFHYRAI STRYTLDFDR AQRACLQNSA 180
IIATPEQLQA AYEDGFHQCD AGWLADQTVR YPIHTPREGC YGDKDEFPGV RTYGIRDTNE 240
TYDVYCFAEE MEGEVFYATS PEKFTFQEAA NECRRLGARL ATTGQLYLAW QAGMDMCSAG 300
WLADRSVRYP ISKARPNCGG NLLGVRTVYV HANQTGYPDP SSRYDAICYT GEDFVDIPEN 360
FFGVGGEEDI TVQTVTWPDM ELPLPRNITE GEARGSVILT VKPIFEVSPS PLEPEEPFTF 420
APEIGATAFA EVENETGEAT RPWGFPTPGL GPATAFTSED LVVQVTAVPG QPHLPGGVVF 480
HYRPGPTRYS LTFEEAQQAC LRTGAVIASP EQLQAAYEAG YEQCDAGWLR DQTVRYPIVS 540
PRTPCVGDKD SSPGVRTYGV RPSTETYDVY CFVDRLEGEV FFATRLEQFT FQEALEFCES 600
HNATLATTGQ LYAAWSRGLD KCYAGWLADG SLRYPIVTPR PACGGDKPGV RTVYLYPNQT 660
GLPDPLSRHH AFCFRGISAV PSPGEEEGGT PTSPSGVEEW IVTQVVPGVA AVPVEEETTA 720
VPSGETTAIL EFTTEPENQT EWEPAYTPVG TSPLPGILPT WPPTGAATEE STEGPSATEV 780
PSASEEPSPS EVPFPSEEPS PSEEPFPSVR PFPSVELFPS EEPFPSKEPS PSEEPSASEE 840
PYTPSPPVPS WTELPSSGEE SGAPDVSGDF TGSGDVSGHL DFSGQLSGDR ASGLPSGDLD 900
SSGLTSTVGS GLPVESGLPS GDEERIEWPS TPTVGELPSG AEILEGSASG VGDLSGLPSG 960
EVLETSASGV GDLSGLPSGE VLETTAPGVE DISGLPSGEV LETTAPGVED ISGLPSGEVL 1020
ETTAPGVEDI SGLPSGEVLE TTAPGVEDIS GLPSGEVLET TAPGVEDISG LPSGEVLETT 1080
APGVEDISGL PSGEVLETAA PGVEDISGLP SGEVLETAAP GVEDISGLPS GEVLETAAPG 1140
VEDISGLPSG EVLETAAPGV EDISGLPSGE VLETAAPGVE DISGLPSGEV LETAAPGVED 1200
ISGLPSGEVL ETAAPGVEDI SGLPSGEVLE TAAPGVEDIS GLPSGEVLET AAPGVEDISG 1260
LPSGEVLETA APGVEDISGL PSGEVLETAA PGVEDISGLP SGEVLETAAP GVEDISGLPS 1320
GEVLETAAPG VEDISGLPSG EVLETAAPGV EDISGLPSGE VLETAAPGVE DISGLPSGEV 1380
LETAAPGVED ISGLPSGEVL ETTAPGVEEI SGLPSGEVLE TTAPGVDEIS GLPSGEVLET 1440
TAPGVEEISG LPSGEVLETS TSAVGDLSGL PSGGEVLEIS VSGVEDISGL PSGEVVETSA 1500
SGIEDVSELP SGEGLETSAS GVEDLSRLPS GEEVLEISAS GFGDLSGLPS GGEGLETSAS 1560
EVGTDLSGLP SGREGLETSA SGAEDLSGLP SGKEDLVGSA SGDLDLGKLP SGTLGSGQAP 1620
ETSGLPSGFS GEYSGVDLGS GPPSGLPDFS GLPSGFPTVS LVDSTLVEVV TASTASELEG 1680
RGTIGISGAG EISGLPSSEL DISGRASGLP SGTELSGQAS GSPDVSGEIP GLFGVSGQPS 1740
GFPDTSGETS GVTELSGLSS GQPGISGEAS GVLYGTSQPF GITDLSGETS GVPDLSGQPS 1800
GLPGFSGATS GVPDLVSGTT SGSGESSGIT FVDTSLVEVA PTTFKEEEGL GSVELSGLPS 1860
GEADLSGKSG MVDVSGQFSG TVDSSGFTSQ TPEFSGLPSG IAEVSGESSR AEIGSSLPSG 1920
AYYGSGTPSS FPTVSLVDRT LVESVTQAPT AQEAGEGPSG ILELSGAHSG APDMSGEHSG 1980
FLDLSGLQSG LIEPSGEPPG TPYFSGDFAS TTNVSGESSV AMGTSGEASG LPEVTLITSE 2040
FVEGVTEPTI SQELGQRPPV THTPQLFESS GKVSTAGDIS GATPVLPGSG VEVSSVPESS 2100
SETSAYPEAG FGASAAPEAS REDSGSPDLS ETTSAFHEAN LERSSGLGVS GSTLTFQEGE 2160
ASAAPEVSGE STTTSDVGTE APGLPSATPT ASGDRTEISG DLSGHTSQLG VVISTSIPES 2220
EWTQQTQRPA ETHLEIESSS LLYSGEETHT VETATSPTDA SIPASPEWKR ESESTAAAPA 2280
RSCAEEPCGA GTCKETEGHV ICLCPPGYTG EHCNIDQEVC EEGWNKYQGH CYRHFPDRET 2340
WVDAERRCRE QQSHLSSIVT PEEQEFVNNN AQDYQWIGLN DRTIEGDFRW SDGHPMQFEN 2400
WRPNQPDNFF AAGEDCVVMI WHEKGEWNDV PCNYHLPFTC KKGTVACGEP PVVEHARTFG 2460
QKKDRYEINS LVRYQCTEGF VQRHMPTIRC QPSGHWEEPQ ITCTDPTTYK RRLQKRSSRH 2520
PRRSRPSTAH

Abbreviation :
CLO : cleft lip only. CPO : cleft palate only. CLP : cleft lip and palate. CL/P : cleft lip with/without cleft palate.
For humans: CL/P, CLO, CPO, and CLP. For mice: CLO, CLP, and CPO.

Gene expression information

Gene expression in different tissues (GTEx V7)

  

Gene expression in different tissues (ENCODE)

  

Protein structural annotations

3D structure in PDB database


loading...

Protein disorder information

Orthologous information

Relation Gene symbol Entrez ID UniProt ID Cleft type Developmental stage Species Evidence Details
1:1 orthologACANF1N368Bos taurusPredictionMore>>
1:1 orthologACAN403828A0A5F4DFW2Canis lupus familiarisPredictionMore>>
1:1 orthologACANA0A452FGP3Capra hircusPredictionMore>>
1:1 orthologACAN176P16112Homo sapiensPredictionMore>>
1:1 orthologAcan11595Q61282CPOMus musculusPublicationMore>>
1:1 orthologACANA0A2I3SPH2Pan troglodytesPredictionMore>>
1:1 orthologACANA0A287BRK5Sus scrofaPredictionMore>>
1:1 orthologACAN100009079G1U677Oryctolagus cuniculusPredictionMore>>
1:1 orthologAcanD4A7Y1Rattus norvegicusPredictionMore>>

Other genetic variants/mutations

loading...

Disease or phenotype associated information

loading...

Gene Ontology (GO)/biological pathways

GO:Molecular Function

GO ID GO Term Evidence
GO:0005201 extracellular matrix structural constituentTAS
GO:0005515 protein bindingIPI
GO:0005540 hyaluronic acid bindingIEA
GO:0030021 extracellular matrix structural constituent conferring compression resistanceRCA
GO:0030246 carbohydrate bindingIEA
GO:0046872 metal ion bindingIEA

GO:Biological Process

GO ID GO Term Evidence
GO:0001501 skeletal system developmentNAS
GO:0001501 skeletal system developmentIBA
GO:0001502 cartilage condensationIEA
GO:0002063 chondrocyte developmentIEA
GO:0006508 proteolysisNAS
GO:0007155 cell adhesionIEA
GO:0007417 central nervous system developmentIBA
GO:0007507 heart developmentIEA
GO:0018146 keratan sulfate biosynthetic processTAS
GO:0030166 proteoglycan biosynthetic processIEA
GO:0030198 extracellular matrix organizationTAS
GO:0030199 collagen fibril organizationIEA
GO:0042340 keratan sulfate catabolic processTAS

GO:Cellular Component

GO ID GO Term Evidence
GO:0005576 extracellular regionTAS
GO:0005604 basement membraneIEA
GO:0005796 Golgi lumenTAS
GO:0031012 extracellular matrixIBA
GO:0043202 lysosomal lumenTAS
GO:0062023 collagen-containing extracellular matrixHDA
GO:0098966 perisynaptic extracellular matrixIEA
GO:0098978 glutamatergic synapseIEA
GO:0098982 GABA-ergic synapseIEA

Reactome Pathway

Reactome ID Reactome Term Evidence
R-HSA-1430728 MetabolismTAS
R-HSA-1474228 Degradation of the extracellular matrixIEA
R-HSA-1474228 Degradation of the extracellular matrixTAS
R-HSA-1474244 Extracellular matrix organizationIEA
R-HSA-1474244 Extracellular matrix organizationTAS
R-HSA-1630316 Glycosaminoglycan metabolismTAS
R-HSA-1638074 Keratan sulfate/keratin metabolismTAS
R-HSA-1643685 DiseaseTAS
R-HSA-2022854 Keratan sulfate biosynthesisTAS
R-HSA-2022857 Keratan sulfate degradationTAS
R-HSA-3000178 ECM proteoglycansTAS
R-HSA-3560782 Diseases associated with glycosaminoglycan metabolismTAS
R-HSA-3656225 Defective CHST6 causes MCDC1TAS
R-HSA-3656243 Defective ST3GAL3 causes MCT12 and EIEE15TAS
R-HSA-3656244 Defective B4GALT1 causes B4GALT1-CDG (CDG-2d)TAS
R-HSA-3781865 Diseases of glycosylationTAS
R-HSA-71387 Metabolism of carbohydratesTAS

Drugs and compounds information

loading...

Functional annotations

Keywords

Keyword ID Keyword Term
KW-0002 3D-structure
KW-0025 Alternative splicing
KW-0106 Calcium
KW-0903 Direct protein sequencing
KW-0225 Disease mutation
KW-1015 Disulfide bond
KW-0242 Dwarfism
KW-0245 EGF-like domain
KW-0272 Extracellular matrix
KW-0325 Glycoprotein
KW-0393 Immunoglobulin domain
KW-0430 Lectin
KW-0479 Metal-binding
KW-0621 Polymorphism
KW-0654 Proteoglycan
KW-1185 Reference proteome
KW-0677 Repeat
KW-0964 Secreted
KW-0732 Signal
KW-0768 Sushi

Interpro

InterPro ID InterPro Term
IPR001304 C-type_lectin-like
IPR016186 C-type_lectin-like/link_sf
IPR018378 C-type_lectin_CS
IPR033987 CSPG_CTLD
IPR016187 CTDL_fold
IPR013032 EGF-like_CS
IPR000742 EGF-like_dom
IPR007110 Ig-like_dom
IPR036179 Ig-like_dom_sf
IPR013783 Ig-like_fold
IPR003006 Ig/MHC_CS
IPR003599 Ig_sub
IPR013106 Ig_V-set
IPR000538 Link_dom
IPR035976 Sushi/SCR/CCP_sf
IPR000436 Sushi_SCR_CCP_dom

PROSITE

PROSITE ID PROSITE Term
PS00615 C_TYPE_LECTIN_1
PS50041 C_TYPE_LECTIN_2
PS00022 EGF_1
PS01186 EGF_2
PS50026 EGF_3
PS50835 IG_LIKE
PS00290 IG_MHC
PS01241 LINK_1
PS50963 LINK_2
PS50923 SUSHI

Pfam

Pfam ID Pfam Term
PF00059 Lectin_C
PF00084 Sushi
PF07686 V-set
PF00193 Xlink

Protein-protein interaction

Protein-miRNA interaction