Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_81012 |
Symbol | MSH6 |
ID | 4851846 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2994912 |
End bp | 2998707 |
Gene Length | 3796 bp |
Protein Length | 1212 aa |
Translation table | |
GC content | 41% |
IMG OID | 640393554 |
Product | Mismatch repair ATPase MSH6 (MutS family) |
Protein accession | XP_001387139 |
Protein GI | 126275773 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0833152 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAAGG TTCGCAAACC CTCCACACCA TTGAGATCTG GTTCAGCTGC TGTGAAGAAT GGCTCTGCCA GCTCTTCATC TAAACTTAAG CAGCTGAGCT TGATGTCTTT CTTCAAGCCA GCTTCCAAAC CAGATTCTGA AAAGACAAAA GATAGTGCCA ACCCGACCAT GACTCAACCT CCGGCTTCAT CCTCGCCCTT GAGAGCCAAA TCGGAACAAT CAAGACAGAA TCTGAACGAA GCTTCAGATA CGTTGAATAC TTCTGTATTG GCCAGTGAAG CTGATTCTCA TTCAGATAAG GAAAACGAAA ACCAAATCAT GTCCGAACAC AACGCTGTTG AAACAGACAC CCCGTTGTCT TCTGAAACAG GAACTACACC GATTCGTGTA TCCAAGAAAG TTTCATTGCC CAAATCGTCT CCAATCCAGC CGAAAAAGGG AGAATTCAAG CGAAAGCCCA AACCGTTGGC CGAGAATCAC CTCAATTCAA GTCCGTTAGT CAAACGTCGT TCTGCATCTC GTAGCGTCAG CTACGCTGAG TCAGATAGCG AAGACGAAGC TGTCAATCAA ACATCAAGGA AGAGGAGAAA GGTTATAGAA AGTGATGATG ACGAAGAGGA TGATTTCAAA CCGGCTGAAG AAGATGACGA TGACGATGAT ATGAGCGACT TCATCGTTGA TGATGATAAG GAAAGTGAGC CTGAGATAGA AGAAGATGAT GAAGATGACT TTGCAGAAGA GACGCCAAGA TCTAAGAAGT CCAAGTCCAA GTCAAGTTCG TCCTCATCCA GAAGCTCTCC TTCAAAGGAA TCTTCTATTT CCAGTAACGT TCTAGGCGAT AAATTCAAGG CTGGTTCCTC GTATAAAGCC ACTCAACCTG CTACTAAGCC AAAATCTATT ACTCCAGTTA AAACAACTCC AAAGAAGAAC TTCTCCAAAG AAAACGAAGA GAGATACCAA TGGCTCGTAG ATGTCAGAGA TGCAGAAAAG AGAACTACAG ATGACCCCAA CTACGATCCT CGAACATTGC ACGTACCCCA ATCTGCTTGG TCGAAATTTA CTGCGTTTGA AAAACAGTAC TGGGAAATCA AGTCTAAGAT GTACAACACT GTAGTTTTCT TCAAGAAAGG TAAGTTCTAC GAATTATACG AAAACGATGC TACGATTGCC AACACTGAAT TTGATTTGAA AATAGCTGGC GGAGGACGGG CCAACATGAA GTTGGCGGGC ATTCCTGAGA TGTCGTTTGA GTACTGGGCA AAAGAGTTTA TTAGCCATGG ATACAAAGTC GCTAAAGTTG ATCAAGTAGA AAGTCTTTTA GCAAAAGAGA TGAGAGGTGG CGGTACTAAA GAAGAAAAGA TTATCAAAAG AGAGTTGACT GGTGTCTTGA CCGGGGGCAC CTTAACTGAC ATGGATATGA TCAGTGATGA TATGGCAGTA TACTGCTTGA GTGTCAAGGA AGAAATCTTG GATGACGGAA GCAAAATCTT TGGTGTTGTG TTTGTAGATA CTGCTACTTC TGAAGTGAAT TTCATTGAGT TCCCAGACGA TGCCGAATGC ACCAAGTTGG AAACCTTGAT TACGCAAATC AAGCCCAAAG AGATCTTGTG TATGAAAGGA AACTTGTGTT CAATTGCAGT GAAGATATTG AAGTTCAATG CACAGGGACA TCAAATCTGG AACCAATTGA ACCCAATTTC TGAGTTCTGG GACTATGATA CCACCTGTGA GAACTTAGTT TCAGCCAAAT ATTATGATGC CGAGGACTTA GATGATTATT CTAACTATCC TCCGACATTA ATTGATTACA AAGACAACCA TAAGGTTGCA TTCGGTGCAT TTGGTGGTTT GCTTTTTTAT TTGAGGTCAT TGAAATTAGA TAGCAGTATC ATGACTTTGG GTCATATTTC GGAATATCAG ATTTCTAAGA ATTCAAGTAC TCATATGTTA TTGGATGGTA TCACCCTCAA CAATTTGGAG ATATTAAGCA ACTCTTTCGA TGGCGGAGAC AAGGGTACGT TGTTCAAATT GATCAACAAG GCTTCCACAC CATTTGGGAA AAGAGCAATG AAGTCATTGG TATTACACCC ACTTATGAAA ATCAATGAAA TTAATGAACG ATATGATGCC ATAGAATACT TGATGAACGA GGGCCTTGAA TTGAGAAGTA AATTGGAACA AACATTGACT TCCTTGCCAG ATTTGGAGAG GCTCTTGGCT AGAATTCATA GTAAAACTTT GAAATTCAAG GATTTCTTGA AAGTAGTAGA AAGTTTTGAA GGTATTTCTA AATCATTAGG GCCATTGCAT GAGTTTATTC CTGAGGAATC AGGAGCTTTG TTCAAACACT TGAAGAGCTT TCCAAGGGAA CTTCCAGAAC TTGTTTCTCA GTGGGACGAT GCATTTGACA GAGAAGAAGC AAAGAAAGAC GTTGTTGTCC CAACTGAGGG AGTGGATGCT GAATTTGACG ACTCACAATG TAAAATGAAG ATTTTAGAAG ATAAGCTCGA GCAGTACTTG AAGGAATACA AGAGGACCTA CAAATCTCAT GAAGTGGTCT ACAGAGATTC CGGTAAGGAA ATCTACTTGA TTGAACTTCC AAACAAGTTG GTCAAGCAAG TTCCAAATGA CTGGCAACAG ATGGGATCAA CTTCTAAGGT GAAGCGATAC TGGTCGCCAG AAGTTAAGAG AACTGCAAGA GAATTGATGG AACAGCGTGA ATTGCACAAG ATGGTATGTG AATCATTGAA AAGTAGAATG TACGAGAGAT TTGACGCACA TTATAAGACG TGGTTGAAAG CAGTTCATTC ATTAGGTAAG ATTGATTGCA TACTTGCATT GACTAGAACT TCTGAAACCA TTGGGTATCC ATCATGCAGA CCAGAGTTTG TTGATTCGGA AAAAGGTCAA ATTGAATTTA GAGAACTCAG ACATCCTTGT TTCCTCGCAA GCTCTGATTT TATTCCTAAT GATGTTATCC TTGGAGGATC AGAGGCAAAT TTTGGATTAT TGACAGGAGC AAATGCTGCT GGTAAATCAA CCTTGATGAG AACAACAGCT TTGGCAGTGA TATTGGCCCA GATTGGTTGT TTTGTCCCTG CGTCGAGTGC CAAATTAAGC ACTGTTGACA AGATTATGAC TCGTTTAGGG GCTAATGACA ACATCATGCA AGGTAAATCT ACTTTCTTTG TAGAATTATC AGAAACTAAG AAGATCATCA GCAACGCGAC TACAAGATCG TTGGTCATTT TAGATGAATT GGGAAGAGGT GGGTCTAGTA GTGATGGCTT TGCTATCGCG GAATCAACTT TGCACCATTT GGCAACGCAC ATTCAACCGC TAGGCTTCTT TGCAATACAC TATGGTACGT TGGGATTGTC GTTCCAAAAT CATCCCCAAA TTAAGCCACT CAGAATGGCA ATCATAATTG ACAACAACTC TAGAAATATC ACATTTTTGT ACAAACTTGA AGAAGGTACA GCTCCAGGCT CATTTGGTAT GAATGTAGCT TCCATGTGTG GTATAGCGAA TACCATTGTG GATCTGGCAG AAGTGGCAGC CAAAGAATAC GAACAGACGT CGAAGTTGAA GAAGACTCAT AAGAATAACA GCCTTGGTTT AGGATTGCAG AGTGACTTTT CTTGGTTTGC TCAAGGTCGG ACCTCCATTT TGAGTCTGGA TATTTTGAAC TACAGCGAGG ATGTCAAGCA AGGAGCTCTC TCAAGTATTT TTGGAATGAT CGAAAAGTTA TAGCACAACA TTGTATAGTA GAATAAATTC GATTATAATC AGTTTT
|
Protein sequence | MSKVRKPSTP LRSGSAAVKN GSASSSSKLK QLSLMSFFKP ASKPDSEKTK DSANPTMTQP PASSSPLRAK SEQSRQNLNE ASDTLNTSVL ASEADSHSDK ENENQIMSEH NARKPKPLAE NHLNSSPLVK RRSASRSVSY AESDSEDEAV NQTSRKRRKV IESDDDEEDD FKPAEEDDDD DDMSDFIVDD DKESEPEIEE DDEDDFAEET PRSKKSKSKS SSSSSRSSPS KESSISSNVL GDKFKAGSSY KATQPATKPK SITPVKTTPK KNFSKENEER YQWLVDVRDA EKRTTDDPNY DPRTLHVPQS AWSKFTAFEK QYWEIKSKMY NTVVFFKKGK FYELYENDAT IANTEFDLKI AGGGRANMKL AGIPEMSFEY WAKEFISHGY KVAKVDQVES LLAKEMRGGG TKEEKIIKRE LTGVLTGGTL TDMDMISDDM AVYCLSVKEE ILDDGSKIFG VVFVDTATSE VNFIEFPDDA ECTKLETLIT QIKPKEILCM KGNLCSIAVK ILKFNAQGHQ IWNQLNPISE FWDYDTTCEN LVSAKYYDAE DLDDYSNYPP TLIDYKDNHK VAFGAFGGLL FYLRSLKLDS SIMTLGHISE YQISKNSSTH MLLDGITLNN LEILSNSFDG GDKGTLFKLI NKASTPFGKR AMKSLVLHPL MKINEINERY DAIEYLMNEG LELRSKLEQT LTSLPDLERL LARIHSKTLK FKDFLKVVES FEGISKSLGP LHEFIPEESG ALFKHLKSFP RELPELVSQW DDAFDREEAK KDVVVPTEGV DAEFDDSQCK MKILEDKLEQ YLKEYKRTYK SHEVVYRDSG KEIYLIELPN KLVKQVPNDW QQMGSTSKVK RYWSPEVKRT ARELMEQREL HKMVCESLKS RMYERFDAHY KTWLKAVHSL GKIDCILALT RTSETIGYPS CRPEFVDSEK GQIEFRELRH PCFLASSDFI PNDVILGGSE ANFGLLTGAN AAGKSTLMRT TALAVILAQI GCFVPASSAK LSTVDKIMTR LGANDNIMQG KSTFFVELSE TKKIISNATT RSLVILDELG RGGSSSDGFA IAESTLHHLA THIQPLGFFA IHYGTLGLSF QNHPQIKPLR MAIIIDNNSR NITFLYKLEE GTAPGSFGMN VASMCGIANT IVDLAEVAAK EYEQTSKLKK THKNNSLGLG LQSDFSWFAQ GRTSILSLDI LNYSEDVKQG ALSSIFGMIE KL
|
| |