Pulmonary Arterial Hypertension KnowledgeBase (bioinfom_tsdb)
bioinfom_tsdb
Pulmonary Arterial Hypertension KnowledgeBase
General information | Literature | Expression | Regulation | Variant | Interaction

Basic Information

Gene ID

4436

Name

MSH2

Synonym

COCA1|FCC1|HNPCC|HNPCC1|LCFS2;mutS homolog 2;MSH2;mutS homolog 2

Definition

DNA mismatch repair protein Msh2|hMSH2|mutS homolog 2, colon cancer, nonpolyposis type 1

Position

2p21

Gene Type

protein-coding

TSG scores

Description

TUSON ranking

354

TUSON P-value

0.004457398

Pathways and Diseases

Pathway

Mismatch repair;KEGG PATHWAY;hsa03430

Pathway

Pathways in cancer;KEGG PATHWAY;hsa05200

Pathway

Direct p53 effectors;PID Curated;200101

Pathway

Colorectal cancer;KEGG PATHWAY;hsa05210

Disease

Colorectal cancer;KEGG DISEASE;H00020

Disease

Cancers;KEGG DISEASE

Disease

endometrial cancer;GAD

Disease

Mismatch repair cancer syndrome;OMIM

Disease

leukemia;GAD

Disease

Female reproductive cancer;FunDO

Disease

Muir-Torre syndrome;OMIM

Disease

Colorectal cancer, hereditary nonpolyposis, type 1;OMIM

Disease

CANCER;GAD

Disease

Ovarian cancer;KEGG DISEASE;H00027

Disease

Testicular dysfunction;FunDO

Disease

Gastritis;FunDO

Disease

Cancer;FunDO

Disease

Cancers of the digestive system;KEGG DISEASE

Disease

Yersinia infection;FunDO

Disease

Cancers of the breast and female genital organs;KEGG DISEASE

Disease

Lymphoma, Non-Hodgkin;GAD

Disease

colorectal cancer;GAD

External Links

Links to Entrez Gene

4436

Links to all GeneRIF Items

4436

Links to iHOP

4436

Sequence Information

The sequences provided here are only the longest representative sequences, not covering all the isoforms.

Nucleotide Sequence

>4436 : length: 2805
atggcggtgcagccgaaggagacgctgcagttggagagcgcggccgaggtcggcttcgtg
cgcttctttcagggcatgccggagaagccgaccaccacagtgcgccttttcgaccggggc
gacttctatacggcgcacggcgaggacgcgctgctggccgcccgggaggtgttcaagacc
cagggggtgatcaagtacatggggccggcaggagcaaagaatctgcagagtgttgtgctt
agtaaaatgaattttgaatcttttgtaaaagatcttcttctggttcgtcagtatagagtt
gaagtttataagaatagagctggaaataaggcatccaaggagaatgattggtatttggca
tataaggcttctcctggcaatctctctcagtttgaagacattctctttggtaacaatgat
atgtcagcttccattggtgttgtgggtgttaaaatgtccgcagttgatggccagagacag
gttggagttgggtatgtggattccatacagaggaaactaggactgtgtgaattccctgat
aatgatcagttctccaatcttgaggctctcctcatccagattggaccaaaggaatgtgtt
ttacccggaggagagactgctggagacatggggaaactgagacagataattcaaagagga
ggaattctgatcacagaaagaaaaaaagctgacttttccacaaaagacatttatcaggac
ctcaaccggttgttgaaaggcaaaaagggagagcagatgaatagtgctgtattgccagaa
atggagaatcaggttgcagtttcatcactgtctgcggtaatcaagtttttagaactctta
tcagatgattccaactttggacagtttgaactgactacttttgacttcagccagtatatg
aaattggatattgcagcagtcagagcccttaacctttttcagggttctgttgaagatacc
actggctctcagtctctggctgccttgctgaataagtgtaaaacccctcaaggacaaaga
cttgttaaccagtggattaagcagcctctcatggataagaacagaatagaggagagattg
aatttagtggaagcttttgtagaagatgcagaattgaggcagactttacaagaagattta
cttcgtcgattcccagatcttaaccgacttgccaagaagtttcaaagacaagcagcaaac
ttacaagattgttaccgactctatcagggtataaatcaactacctaatgttatacaggct
ctggaaaaacatgaaggaaaacaccagaaattattgttggcagtttttgtgactcctctt
actgatcttcgttctgacttctccaagtttcaggaaatgatagaaacaactttagatatg
gatcaggtggaaaaccatgaattccttgtaaaaccttcatttgatcctaatctcagtgaa
ttaagagaaataatgaatgacttggaaaagaagatgcagtcaacattaataagtgcagcc
agagatcttggcttggaccctggcaaacagattaaactggattccagtgcacagtttgga
tattactttcgtgtaacctgtaaggaagaaaaagtccttcgtaacaataaaaactttagt
actgtagatatccagaagaatggtgttaaatttaccaacagcaaattgacttctttaaat
gaagagtataccaaaaataaaacagaatatgaagaagcccaggatgccattgttaaagaa
attgtcaatatttcttcaggctatgtagaaccaatgcagacactcaatgatgtgttagct
cagctagatgctgttgtcagctttgctcacgtgtcaaatggagcacctgttccatatgta
cgaccagccattttggagaaaggacaaggaagaattatattaaaagcatccaggcatgct
tgtgttgaagttcaagatgaaattgcatttattcctaatgacgtatactttgaaaaagat
aaacagatgttccacatcattactggccccaatatgggaggtaaatcaacatatattcga
caaactggggtgatagtactcatggcccaaattgggtgttttgtgccatgtgagtcagca
gaagtgtccattgtggactgcatcttagcccgagtaggggctggtgacagtcaattgaaa
ggagtctccacgttcatggctgaaatgttggaaactgcttctatcctcaggtctgcaacc
aaagattcattaataatcatagatgaattgggaagaggaacttctacctacgatggattt
gggttagcatgggctatatcagaatacattgcaacaaagattggtgctttttgcatgttt
gcaacccattttcatgaacttactgccttggccaatcagataccaactgttaataatcta
catgtcacagcactcaccactgaagagaccttaactatgctttatcaggtgaagaaaggt
gtctgtgatcaaagttttgggattcatgttgcagagcttgctaatttccctaagcatgta
atagagtgtgctaaacagaaagccctggaacttgaggagtttcagtatattggagaatcg
caaggatatgatatcatggaaccagcagcaaagaagtgctatctggaaagagagcaaggt
gaaaaaattattcaggagttcctgtccaaggtgaaacaaatgccctttactgaaatgtca
gaagaaaacatcacaataaagttaaaacagctaaaagctgaagtaatagcaaagaataat
agctttgtaaatgaaatcatttcacgaataaaagttactacgtga

Protein Sequence

>4436 : length: 934
MAVQPKETLQLESAAEVGFVRFFQGMPEKPTTTVRLFDRGDFYTAHGEDALLAAREVFKT
QGVIKYMGPAGAKNLQSVVLSKMNFESFVKDLLLVRQYRVEVYKNRAGNKASKENDWYLA
YKASPGNLSQFEDILFGNNDMSASIGVVGVKMSAVDGQRQVGVGYVDSIQRKLGLCEFPD
NDQFSNLEALLIQIGPKECVLPGGETAGDMGKLRQIIQRGGILITERKKADFSTKDIYQD
LNRLLKGKKGEQMNSAVLPEMENQVAVSSLSAVIKFLELLSDDSNFGQFELTTFDFSQYM
KLDIAAVRALNLFQGSVEDTTGSQSLAALLNKCKTPQGQRLVNQWIKQPLMDKNRIEERL
NLVEAFVEDAELRQTLQEDLLRRFPDLNRLAKKFQRQAANLQDCYRLYQGINQLPNVIQA
LEKHEGKHQKLLLAVFVTPLTDLRSDFSKFQEMIETTLDMDQVENHEFLVKPSFDPNLSE
LREIMNDLEKKMQSTLISAARDLGLDPGKQIKLDSSAQFGYYFRVTCKEEKVLRNNKNFS
TVDIQKNGVKFTNSKLTSLNEEYTKNKTEYEEAQDAIVKEIVNISSGYVEPMQTLNDVLA
QLDAVVSFAHVSNGAPVPYVRPAILEKGQGRIILKASRHACVEVQDEIAFIPNDVYFEKD
KQMFHIITGPNMGGKSTYIRQTGVIVLMAQIGCFVPCESAEVSIVDCILARVGAGDSQLK
GVSTFMAEMLETASILRSATKDSLIIIDELGRGTSTYDGFGLAWAISEYIATKIGAFCMF
ATHFHELTALANQIPTVNNLHVTALTTEETLTMLYQVKKGVCDQSFGIHVAELANFPKHV
IECAKQKALELEEFQYIGESQGYDIMEPAAKKCYLEREQGEKIIQEFLSKVKQMPFTEMS
EENITIKLKQLKAEVIAKNNSFVNEIISRIKVTT



')