Gene PICST_81012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81012 
SymbolMSH6 
ID4851846 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2994912 
End bp2998707 
Gene Length3796 bp 
Protein Length1212 aa 
Translation table 
GC content41% 
IMG OID640393554 
ProductMismatch repair ATPase MSH6 (MutS family) 
Protein accessionXP_001387139 
Protein GI126275773 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0833152 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAAGG TTCGCAAACC CTCCACACCA TTGAGATCTG GTTCAGCTGC TGTGAAGAAT 
GGCTCTGCCA GCTCTTCATC TAAACTTAAG CAGCTGAGCT TGATGTCTTT CTTCAAGCCA
GCTTCCAAAC CAGATTCTGA AAAGACAAAA GATAGTGCCA ACCCGACCAT GACTCAACCT
CCGGCTTCAT CCTCGCCCTT GAGAGCCAAA TCGGAACAAT CAAGACAGAA TCTGAACGAA
GCTTCAGATA CGTTGAATAC TTCTGTATTG GCCAGTGAAG CTGATTCTCA TTCAGATAAG
GAAAACGAAA ACCAAATCAT GTCCGAACAC AACGCTGTTG AAACAGACAC CCCGTTGTCT
TCTGAAACAG GAACTACACC GATTCGTGTA TCCAAGAAAG TTTCATTGCC CAAATCGTCT
CCAATCCAGC CGAAAAAGGG AGAATTCAAG CGAAAGCCCA AACCGTTGGC CGAGAATCAC
CTCAATTCAA GTCCGTTAGT CAAACGTCGT TCTGCATCTC GTAGCGTCAG CTACGCTGAG
TCAGATAGCG AAGACGAAGC TGTCAATCAA ACATCAAGGA AGAGGAGAAA GGTTATAGAA
AGTGATGATG ACGAAGAGGA TGATTTCAAA CCGGCTGAAG AAGATGACGA TGACGATGAT
ATGAGCGACT TCATCGTTGA TGATGATAAG GAAAGTGAGC CTGAGATAGA AGAAGATGAT
GAAGATGACT TTGCAGAAGA GACGCCAAGA TCTAAGAAGT CCAAGTCCAA GTCAAGTTCG
TCCTCATCCA GAAGCTCTCC TTCAAAGGAA TCTTCTATTT CCAGTAACGT TCTAGGCGAT
AAATTCAAGG CTGGTTCCTC GTATAAAGCC ACTCAACCTG CTACTAAGCC AAAATCTATT
ACTCCAGTTA AAACAACTCC AAAGAAGAAC TTCTCCAAAG AAAACGAAGA GAGATACCAA
TGGCTCGTAG ATGTCAGAGA TGCAGAAAAG AGAACTACAG ATGACCCCAA CTACGATCCT
CGAACATTGC ACGTACCCCA ATCTGCTTGG TCGAAATTTA CTGCGTTTGA AAAACAGTAC
TGGGAAATCA AGTCTAAGAT GTACAACACT GTAGTTTTCT TCAAGAAAGG TAAGTTCTAC
GAATTATACG AAAACGATGC TACGATTGCC AACACTGAAT TTGATTTGAA AATAGCTGGC
GGAGGACGGG CCAACATGAA GTTGGCGGGC ATTCCTGAGA TGTCGTTTGA GTACTGGGCA
AAAGAGTTTA TTAGCCATGG ATACAAAGTC GCTAAAGTTG ATCAAGTAGA AAGTCTTTTA
GCAAAAGAGA TGAGAGGTGG CGGTACTAAA GAAGAAAAGA TTATCAAAAG AGAGTTGACT
GGTGTCTTGA CCGGGGGCAC CTTAACTGAC ATGGATATGA TCAGTGATGA TATGGCAGTA
TACTGCTTGA GTGTCAAGGA AGAAATCTTG GATGACGGAA GCAAAATCTT TGGTGTTGTG
TTTGTAGATA CTGCTACTTC TGAAGTGAAT TTCATTGAGT TCCCAGACGA TGCCGAATGC
ACCAAGTTGG AAACCTTGAT TACGCAAATC AAGCCCAAAG AGATCTTGTG TATGAAAGGA
AACTTGTGTT CAATTGCAGT GAAGATATTG AAGTTCAATG CACAGGGACA TCAAATCTGG
AACCAATTGA ACCCAATTTC TGAGTTCTGG GACTATGATA CCACCTGTGA GAACTTAGTT
TCAGCCAAAT ATTATGATGC CGAGGACTTA GATGATTATT CTAACTATCC TCCGACATTA
ATTGATTACA AAGACAACCA TAAGGTTGCA TTCGGTGCAT TTGGTGGTTT GCTTTTTTAT
TTGAGGTCAT TGAAATTAGA TAGCAGTATC ATGACTTTGG GTCATATTTC GGAATATCAG
ATTTCTAAGA ATTCAAGTAC TCATATGTTA TTGGATGGTA TCACCCTCAA CAATTTGGAG
ATATTAAGCA ACTCTTTCGA TGGCGGAGAC AAGGGTACGT TGTTCAAATT GATCAACAAG
GCTTCCACAC CATTTGGGAA AAGAGCAATG AAGTCATTGG TATTACACCC ACTTATGAAA
ATCAATGAAA TTAATGAACG ATATGATGCC ATAGAATACT TGATGAACGA GGGCCTTGAA
TTGAGAAGTA AATTGGAACA AACATTGACT TCCTTGCCAG ATTTGGAGAG GCTCTTGGCT
AGAATTCATA GTAAAACTTT GAAATTCAAG GATTTCTTGA AAGTAGTAGA AAGTTTTGAA
GGTATTTCTA AATCATTAGG GCCATTGCAT GAGTTTATTC CTGAGGAATC AGGAGCTTTG
TTCAAACACT TGAAGAGCTT TCCAAGGGAA CTTCCAGAAC TTGTTTCTCA GTGGGACGAT
GCATTTGACA GAGAAGAAGC AAAGAAAGAC GTTGTTGTCC CAACTGAGGG AGTGGATGCT
GAATTTGACG ACTCACAATG TAAAATGAAG ATTTTAGAAG ATAAGCTCGA GCAGTACTTG
AAGGAATACA AGAGGACCTA CAAATCTCAT GAAGTGGTCT ACAGAGATTC CGGTAAGGAA
ATCTACTTGA TTGAACTTCC AAACAAGTTG GTCAAGCAAG TTCCAAATGA CTGGCAACAG
ATGGGATCAA CTTCTAAGGT GAAGCGATAC TGGTCGCCAG AAGTTAAGAG AACTGCAAGA
GAATTGATGG AACAGCGTGA ATTGCACAAG ATGGTATGTG AATCATTGAA AAGTAGAATG
TACGAGAGAT TTGACGCACA TTATAAGACG TGGTTGAAAG CAGTTCATTC ATTAGGTAAG
ATTGATTGCA TACTTGCATT GACTAGAACT TCTGAAACCA TTGGGTATCC ATCATGCAGA
CCAGAGTTTG TTGATTCGGA AAAAGGTCAA ATTGAATTTA GAGAACTCAG ACATCCTTGT
TTCCTCGCAA GCTCTGATTT TATTCCTAAT GATGTTATCC TTGGAGGATC AGAGGCAAAT
TTTGGATTAT TGACAGGAGC AAATGCTGCT GGTAAATCAA CCTTGATGAG AACAACAGCT
TTGGCAGTGA TATTGGCCCA GATTGGTTGT TTTGTCCCTG CGTCGAGTGC CAAATTAAGC
ACTGTTGACA AGATTATGAC TCGTTTAGGG GCTAATGACA ACATCATGCA AGGTAAATCT
ACTTTCTTTG TAGAATTATC AGAAACTAAG AAGATCATCA GCAACGCGAC TACAAGATCG
TTGGTCATTT TAGATGAATT GGGAAGAGGT GGGTCTAGTA GTGATGGCTT TGCTATCGCG
GAATCAACTT TGCACCATTT GGCAACGCAC ATTCAACCGC TAGGCTTCTT TGCAATACAC
TATGGTACGT TGGGATTGTC GTTCCAAAAT CATCCCCAAA TTAAGCCACT CAGAATGGCA
ATCATAATTG ACAACAACTC TAGAAATATC ACATTTTTGT ACAAACTTGA AGAAGGTACA
GCTCCAGGCT CATTTGGTAT GAATGTAGCT TCCATGTGTG GTATAGCGAA TACCATTGTG
GATCTGGCAG AAGTGGCAGC CAAAGAATAC GAACAGACGT CGAAGTTGAA GAAGACTCAT
AAGAATAACA GCCTTGGTTT AGGATTGCAG AGTGACTTTT CTTGGTTTGC TCAAGGTCGG
ACCTCCATTT TGAGTCTGGA TATTTTGAAC TACAGCGAGG ATGTCAAGCA AGGAGCTCTC
TCAAGTATTT TTGGAATGAT CGAAAAGTTA TAGCACAACA TTGTATAGTA GAATAAATTC
GATTATAATC AGTTTT
 
Protein sequence
MSKVRKPSTP LRSGSAAVKN GSASSSSKLK QLSLMSFFKP ASKPDSEKTK DSANPTMTQP 
PASSSPLRAK SEQSRQNLNE ASDTLNTSVL ASEADSHSDK ENENQIMSEH NARKPKPLAE
NHLNSSPLVK RRSASRSVSY AESDSEDEAV NQTSRKRRKV IESDDDEEDD FKPAEEDDDD
DDMSDFIVDD DKESEPEIEE DDEDDFAEET PRSKKSKSKS SSSSSRSSPS KESSISSNVL
GDKFKAGSSY KATQPATKPK SITPVKTTPK KNFSKENEER YQWLVDVRDA EKRTTDDPNY
DPRTLHVPQS AWSKFTAFEK QYWEIKSKMY NTVVFFKKGK FYELYENDAT IANTEFDLKI
AGGGRANMKL AGIPEMSFEY WAKEFISHGY KVAKVDQVES LLAKEMRGGG TKEEKIIKRE
LTGVLTGGTL TDMDMISDDM AVYCLSVKEE ILDDGSKIFG VVFVDTATSE VNFIEFPDDA
ECTKLETLIT QIKPKEILCM KGNLCSIAVK ILKFNAQGHQ IWNQLNPISE FWDYDTTCEN
LVSAKYYDAE DLDDYSNYPP TLIDYKDNHK VAFGAFGGLL FYLRSLKLDS SIMTLGHISE
YQISKNSSTH MLLDGITLNN LEILSNSFDG GDKGTLFKLI NKASTPFGKR AMKSLVLHPL
MKINEINERY DAIEYLMNEG LELRSKLEQT LTSLPDLERL LARIHSKTLK FKDFLKVVES
FEGISKSLGP LHEFIPEESG ALFKHLKSFP RELPELVSQW DDAFDREEAK KDVVVPTEGV
DAEFDDSQCK MKILEDKLEQ YLKEYKRTYK SHEVVYRDSG KEIYLIELPN KLVKQVPNDW
QQMGSTSKVK RYWSPEVKRT ARELMEQREL HKMVCESLKS RMYERFDAHY KTWLKAVHSL
GKIDCILALT RTSETIGYPS CRPEFVDSEK GQIEFRELRH PCFLASSDFI PNDVILGGSE
ANFGLLTGAN AAGKSTLMRT TALAVILAQI GCFVPASSAK LSTVDKIMTR LGANDNIMQG
KSTFFVELSE TKKIISNATT RSLVILDELG RGGSSSDGFA IAESTLHHLA THIQPLGFFA
IHYGTLGLSF QNHPQIKPLR MAIIIDNNSR NITFLYKLEE GTAPGSFGMN VASMCGIANT
IVDLAEVAAK EYEQTSKLKK THKNNSLGLG LQSDFSWFAQ GRTSILSLDI LNYSEDVKQG
ALSSIFGMIE KL