Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2326 |
Symbol | |
ID | 7090310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 2522297 |
End bp | 2525074 |
Gene Length | 2778 bp |
Protein Length | 925 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643465649 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_002362619 |
Protein GI | 217978472 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.160487 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATCG AACCCGAAGC AAGAAAATCG CGCGCGCAGG CGGCGGCGCC CGAGAGCGCG GCCCGACGCC CCAGTGAGCC GCAGGACGGT CCCGCCAAAA TCTCGCCGAT GATGGCGCAG TTCATCGCGA TCAAGGCGGC CCATCCCGGC TCGCTCCTGT TCTACCGCAT GGGCGATTTC TACGAACTCT TCTTCGAGGA CGCGGAGATC GCCGCCAAGG CGCTCGGCAT CGTGCTCACG CGGCGCGGCA AATATCAGGG CGAAGACATT CCGATGTGCG GCGTGCCCGT CGAGCGCGCG CAGGAATATC TGCACCGGCT GATCTCGCTC GGCCATCGCG TTTCCGTCTG CGAACAGATC GAAGATCCAG CCGAGGCGAA AAAGCGCGGC GCGAAATCGG TAGTGCGGCG CGAGGTGCGC CGGCTCGTCA CCCCCGGCAC GATTACCGAG GAGACCCTGC TCGATCCCGC CAGAGCCAAC AGGCTGCTCG CGATCGCGCG CGCGCGCCAG GCTGACGGGC AATGGTCTTA CGGCCTCGCC GCGCTCGATA TTTCGACCGG CGAATTTCTT CTCAGCGAAG CGCCCGAGGC CCAGATCGAG ACCGAGATCG CGCGCATCGA GCCTGCCGAG ATCGTCATTC CCGAGGGGCT GATGGATCAG CCCCTGTTCG TCCGCCTCGC CCGCGAAGCG CGGGCGCCGC TGACGCCGCT CGGGCGCCTC GCCGCCGAGG GCCCGGCAGC CGAACGGCGC ATTTGCGATT TCTTCGGCCT CGCGACGCTC GATGGGCTTG GCGCGCTGTC GCCTGCGGAA ATCGCGGCGG CGGCGGCGGC GCTATTTTAC GTCGATCGCA CGCAATTCTC TGCGCGCCCG GCCTTGAGTC TGCCGACGCG GGTCGCGCGT GAGGCGCATA TGTCGATCGA CGCCGCGACG CGGGCCAATC TGGAGCTGAC CCGGACGCTG GGCGGCGCGC GTGAAGGATC GCTGATCGGC GCGATCGACC GCAGCGTGAC CGCCGCCGGC GGCCGGCTGC TGGCCGAGCG GTTGGCGGCA CCCTTGACCG ATCCGGACGA GATCACGCGG CGCCAGGAGG CGGTCGCCTT CTTCTTCGAC GAGCCGGCGC TGCGCGAGGC GGCGCGGCGG GCGCTGAAAG CCGCGCCCGA TCTCATGCGC GCAATCGCGC GGCTTGCGCT CGAGCGCGGC GGTCCGCGCG ATCTCGCCGC CCTGCGCGAC GGATTCTTCG CCGCCCGCGC GCTCGTGGAG ACGCTCCGCG CCGCGGAAAA TCTTCCCGGC GAACTTGCGC AGGCATGCGC GGCGGCTGGC GCGCTCGATC CGCTGGTCCC GCAGAAGCTT CAATCGGCGC TCGCCGACGC CCTGCCGCTC AACCGGCGCG ACGGCGGCTT CGTCGCCCCA GATTTCGACG CCGGCCTCGA CGAGTTGCGG GCGCTGCGCG ACGATACGCG CAAGGTCGTC GCGGCTCTGC AGGCGCGCTA TTGCGACCTC GCCGACATGC GCCAGCTCAA GCTGAAGCAT AATAATTTTC TCGGGTTCTA TCTGGAAGCG CCGCAGGCGC AGGGCGAGAA ACTGCTGAAG CCGCCGTTTG ACGCCATGTT CATCCATCGT CAAACGATGG CTGGCGCAAT GCGCTTCTCG ACGCCGGAGC TCTCCGAGCT CGACGTGAAA ATCTCAACGG CCGCCGATCA GGCGCTTGTC CGGGAACTGT CGATTTTCGA TGAGCTCGCA GCGACCCTGC TGGCGCAGGG CGAGGCGATC AAGCGCGCCG CAGCGGCGCT CGCCTGCATC GACGCGGCGG CGGGCCTGGC GGAACTCGCC GCGGATTGCG GCTGGACCCG GCCGGAGGTC GACGGCTCGC TGAAATTTCA CATCGAAGGC GGCCGCCATC CGGTGGTTGA GGCGGCGCTG CGCCGCGGCG GCGCGCCCTT CGTCGCCAAT GATTGCGATC TGTCCGGGCT CGCGGAGGAG GGCGGCCGCA TCGCCGTCGT CACCGGGCCG AATATGGCCG GCAAATCGAC CTATCTGCGC CAGAACGCGC TGATCGCCGT GCTGGCGCAG ATCGGCTCCT TCGTGCCCGC GCGGCGCGCC CATATCGGCG TCGTCGACCG GCTGTTTTCG CGCGTCGGCG CGTCGGACGA TCTCGCGCGC GGGCGCTCGA CCTTCATGGT CGAGATGGTC GAGACAGCGG CGATCCTCAA TTCCGCCAGA GCCCGATCGC TGGTCATTCT CGACGAGATT GGCCGCGGCA CGGCGACCTT CGACGGCCTC TCGATCGCCT GGGCGGTGAT GGAGCATCTG CATGAGGTCA ATCGCAGCCG GGCGCTGTTC GCCACCCATT TCCACGAGCT GACCCAGCTT GGCAAACGGC TGGCGCGGAT CGACAATCTG ACCGTGCGCG TCAGCGAGTG GAAGGGCGAT GTCATATTTC TGCACGAGAT CATCGCTGGC GCCGCCGACC AATCCTATGG CGTGCAGGTG GCGAAGCTGG CCGGGCTGCC GGCGCAGGTG GTCGCGCGGG CAAGGCTGCT CCTGTCCGAA TTCGAGGCGG CGGAGCGCAT GAGCAGCCGC GACCGGCTCG TCGCCGATCT GCCGCTGTTT TCCGCGTCTT ACGTCAAGGC CCCGGAGCCG GCTGAGATCG CCCATGGCGG CGGGTCCGCC GACGCCCTTG GAGCTGCGCT CGACGCGGTG AACCCGGACG AGCTCTCGCC GCGGGCCGCG CTGGAGGAGC TCTATCGCCT GAAACGTTTG CGCGCCGGCG AGGCGTAA
|
Protein sequence | MSIEPEARKS RAQAAAPESA ARRPSEPQDG PAKISPMMAQ FIAIKAAHPG SLLFYRMGDF YELFFEDAEI AAKALGIVLT RRGKYQGEDI PMCGVPVERA QEYLHRLISL GHRVSVCEQI EDPAEAKKRG AKSVVRREVR RLVTPGTITE ETLLDPARAN RLLAIARARQ ADGQWSYGLA ALDISTGEFL LSEAPEAQIE TEIARIEPAE IVIPEGLMDQ PLFVRLAREA RAPLTPLGRL AAEGPAAERR ICDFFGLATL DGLGALSPAE IAAAAAALFY VDRTQFSARP ALSLPTRVAR EAHMSIDAAT RANLELTRTL GGAREGSLIG AIDRSVTAAG GRLLAERLAA PLTDPDEITR RQEAVAFFFD EPALREAARR ALKAAPDLMR AIARLALERG GPRDLAALRD GFFAARALVE TLRAAENLPG ELAQACAAAG ALDPLVPQKL QSALADALPL NRRDGGFVAP DFDAGLDELR ALRDDTRKVV AALQARYCDL ADMRQLKLKH NNFLGFYLEA PQAQGEKLLK PPFDAMFIHR QTMAGAMRFS TPELSELDVK ISTAADQALV RELSIFDELA ATLLAQGEAI KRAAAALACI DAAAGLAELA ADCGWTRPEV DGSLKFHIEG GRHPVVEAAL RRGGAPFVAN DCDLSGLAEE GGRIAVVTGP NMAGKSTYLR QNALIAVLAQ IGSFVPARRA HIGVVDRLFS RVGASDDLAR GRSTFMVEMV ETAAILNSAR ARSLVILDEI GRGTATFDGL SIAWAVMEHL HEVNRSRALF ATHFHELTQL GKRLARIDNL TVRVSEWKGD VIFLHEIIAG AADQSYGVQV AKLAGLPAQV VARARLLLSE FEAAERMSSR DRLVADLPLF SASYVKAPEP AEIAHGGGSA DALGAALDAV NPDELSPRAA LEELYRLKRL RAGEA
|
| |