Gene Msil_2326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2326 
Symbol 
ID7090310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2522297 
End bp2525074 
Gene Length2778 bp 
Protein Length925 aa 
Translation table11 
GC content69% 
IMG OID643465649 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_002362619 
Protein GI217978472 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.160487 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATCG AACCCGAAGC AAGAAAATCG CGCGCGCAGG CGGCGGCGCC CGAGAGCGCG 
GCCCGACGCC CCAGTGAGCC GCAGGACGGT CCCGCCAAAA TCTCGCCGAT GATGGCGCAG
TTCATCGCGA TCAAGGCGGC CCATCCCGGC TCGCTCCTGT TCTACCGCAT GGGCGATTTC
TACGAACTCT TCTTCGAGGA CGCGGAGATC GCCGCCAAGG CGCTCGGCAT CGTGCTCACG
CGGCGCGGCA AATATCAGGG CGAAGACATT CCGATGTGCG GCGTGCCCGT CGAGCGCGCG
CAGGAATATC TGCACCGGCT GATCTCGCTC GGCCATCGCG TTTCCGTCTG CGAACAGATC
GAAGATCCAG CCGAGGCGAA AAAGCGCGGC GCGAAATCGG TAGTGCGGCG CGAGGTGCGC
CGGCTCGTCA CCCCCGGCAC GATTACCGAG GAGACCCTGC TCGATCCCGC CAGAGCCAAC
AGGCTGCTCG CGATCGCGCG CGCGCGCCAG GCTGACGGGC AATGGTCTTA CGGCCTCGCC
GCGCTCGATA TTTCGACCGG CGAATTTCTT CTCAGCGAAG CGCCCGAGGC CCAGATCGAG
ACCGAGATCG CGCGCATCGA GCCTGCCGAG ATCGTCATTC CCGAGGGGCT GATGGATCAG
CCCCTGTTCG TCCGCCTCGC CCGCGAAGCG CGGGCGCCGC TGACGCCGCT CGGGCGCCTC
GCCGCCGAGG GCCCGGCAGC CGAACGGCGC ATTTGCGATT TCTTCGGCCT CGCGACGCTC
GATGGGCTTG GCGCGCTGTC GCCTGCGGAA ATCGCGGCGG CGGCGGCGGC GCTATTTTAC
GTCGATCGCA CGCAATTCTC TGCGCGCCCG GCCTTGAGTC TGCCGACGCG GGTCGCGCGT
GAGGCGCATA TGTCGATCGA CGCCGCGACG CGGGCCAATC TGGAGCTGAC CCGGACGCTG
GGCGGCGCGC GTGAAGGATC GCTGATCGGC GCGATCGACC GCAGCGTGAC CGCCGCCGGC
GGCCGGCTGC TGGCCGAGCG GTTGGCGGCA CCCTTGACCG ATCCGGACGA GATCACGCGG
CGCCAGGAGG CGGTCGCCTT CTTCTTCGAC GAGCCGGCGC TGCGCGAGGC GGCGCGGCGG
GCGCTGAAAG CCGCGCCCGA TCTCATGCGC GCAATCGCGC GGCTTGCGCT CGAGCGCGGC
GGTCCGCGCG ATCTCGCCGC CCTGCGCGAC GGATTCTTCG CCGCCCGCGC GCTCGTGGAG
ACGCTCCGCG CCGCGGAAAA TCTTCCCGGC GAACTTGCGC AGGCATGCGC GGCGGCTGGC
GCGCTCGATC CGCTGGTCCC GCAGAAGCTT CAATCGGCGC TCGCCGACGC CCTGCCGCTC
AACCGGCGCG ACGGCGGCTT CGTCGCCCCA GATTTCGACG CCGGCCTCGA CGAGTTGCGG
GCGCTGCGCG ACGATACGCG CAAGGTCGTC GCGGCTCTGC AGGCGCGCTA TTGCGACCTC
GCCGACATGC GCCAGCTCAA GCTGAAGCAT AATAATTTTC TCGGGTTCTA TCTGGAAGCG
CCGCAGGCGC AGGGCGAGAA ACTGCTGAAG CCGCCGTTTG ACGCCATGTT CATCCATCGT
CAAACGATGG CTGGCGCAAT GCGCTTCTCG ACGCCGGAGC TCTCCGAGCT CGACGTGAAA
ATCTCAACGG CCGCCGATCA GGCGCTTGTC CGGGAACTGT CGATTTTCGA TGAGCTCGCA
GCGACCCTGC TGGCGCAGGG CGAGGCGATC AAGCGCGCCG CAGCGGCGCT CGCCTGCATC
GACGCGGCGG CGGGCCTGGC GGAACTCGCC GCGGATTGCG GCTGGACCCG GCCGGAGGTC
GACGGCTCGC TGAAATTTCA CATCGAAGGC GGCCGCCATC CGGTGGTTGA GGCGGCGCTG
CGCCGCGGCG GCGCGCCCTT CGTCGCCAAT GATTGCGATC TGTCCGGGCT CGCGGAGGAG
GGCGGCCGCA TCGCCGTCGT CACCGGGCCG AATATGGCCG GCAAATCGAC CTATCTGCGC
CAGAACGCGC TGATCGCCGT GCTGGCGCAG ATCGGCTCCT TCGTGCCCGC GCGGCGCGCC
CATATCGGCG TCGTCGACCG GCTGTTTTCG CGCGTCGGCG CGTCGGACGA TCTCGCGCGC
GGGCGCTCGA CCTTCATGGT CGAGATGGTC GAGACAGCGG CGATCCTCAA TTCCGCCAGA
GCCCGATCGC TGGTCATTCT CGACGAGATT GGCCGCGGCA CGGCGACCTT CGACGGCCTC
TCGATCGCCT GGGCGGTGAT GGAGCATCTG CATGAGGTCA ATCGCAGCCG GGCGCTGTTC
GCCACCCATT TCCACGAGCT GACCCAGCTT GGCAAACGGC TGGCGCGGAT CGACAATCTG
ACCGTGCGCG TCAGCGAGTG GAAGGGCGAT GTCATATTTC TGCACGAGAT CATCGCTGGC
GCCGCCGACC AATCCTATGG CGTGCAGGTG GCGAAGCTGG CCGGGCTGCC GGCGCAGGTG
GTCGCGCGGG CAAGGCTGCT CCTGTCCGAA TTCGAGGCGG CGGAGCGCAT GAGCAGCCGC
GACCGGCTCG TCGCCGATCT GCCGCTGTTT TCCGCGTCTT ACGTCAAGGC CCCGGAGCCG
GCTGAGATCG CCCATGGCGG CGGGTCCGCC GACGCCCTTG GAGCTGCGCT CGACGCGGTG
AACCCGGACG AGCTCTCGCC GCGGGCCGCG CTGGAGGAGC TCTATCGCCT GAAACGTTTG
CGCGCCGGCG AGGCGTAA
 
Protein sequence
MSIEPEARKS RAQAAAPESA ARRPSEPQDG PAKISPMMAQ FIAIKAAHPG SLLFYRMGDF 
YELFFEDAEI AAKALGIVLT RRGKYQGEDI PMCGVPVERA QEYLHRLISL GHRVSVCEQI
EDPAEAKKRG AKSVVRREVR RLVTPGTITE ETLLDPARAN RLLAIARARQ ADGQWSYGLA
ALDISTGEFL LSEAPEAQIE TEIARIEPAE IVIPEGLMDQ PLFVRLAREA RAPLTPLGRL
AAEGPAAERR ICDFFGLATL DGLGALSPAE IAAAAAALFY VDRTQFSARP ALSLPTRVAR
EAHMSIDAAT RANLELTRTL GGAREGSLIG AIDRSVTAAG GRLLAERLAA PLTDPDEITR
RQEAVAFFFD EPALREAARR ALKAAPDLMR AIARLALERG GPRDLAALRD GFFAARALVE
TLRAAENLPG ELAQACAAAG ALDPLVPQKL QSALADALPL NRRDGGFVAP DFDAGLDELR
ALRDDTRKVV AALQARYCDL ADMRQLKLKH NNFLGFYLEA PQAQGEKLLK PPFDAMFIHR
QTMAGAMRFS TPELSELDVK ISTAADQALV RELSIFDELA ATLLAQGEAI KRAAAALACI
DAAAGLAELA ADCGWTRPEV DGSLKFHIEG GRHPVVEAAL RRGGAPFVAN DCDLSGLAEE
GGRIAVVTGP NMAGKSTYLR QNALIAVLAQ IGSFVPARRA HIGVVDRLFS RVGASDDLAR
GRSTFMVEMV ETAAILNSAR ARSLVILDEI GRGTATFDGL SIAWAVMEHL HEVNRSRALF
ATHFHELTQL GKRLARIDNL TVRVSEWKGD VIFLHEIIAG AADQSYGVQV AKLAGLPAQV
VARARLLLSE FEAAERMSSR DRLVADLPLF SASYVKAPEP AEIAHGGGSA DALGAALDAV
NPDELSPRAA LEELYRLKRL RAGEA