Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2842 |
Symbol | |
ID | 4785536 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3021912 |
End bp | 3024524 |
Gene Length | 2613 bp |
Protein Length | 870 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640091413 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001022031 |
Protein GI | 124268027 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.147304 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGCCG ACCCCGCCCG CGCCGAGTCC GGTCACACCC CCATGATGCA GCAGTACCTG CGCATCAAGG CCGAGCATCC CGACACGCTC GTGCTCTACC GCATGGGCGA TTTCTACGAG CTGTTCTTCG ACGACGCCCG CAAGGCGCAC CGCCTGCTCG ACATCACGCT CACCAGCCGC GGCCAGAGCG CCGGCGAGCC CGTCGTGATG GCCGGCGTGC CGGTGCACGC GCTCGAGAAC TATCTCGGCA AGCTGGTCAA GCTTGGCGAG TCGGTCGCGA TCGCCGAGCA GGTGGGCGAC GTGGCGACTG CCAAGGGGCC GGTCGAGCGC AAGGTGGTGC GCGTCGTCAC GCCGGGCACC GTGACCGACA CCGAGCTGCT GTCGGAGCGC GTCGACACCC TGCTGCTGGC GGTGACGCGC CAGCGTGCGA GTTACGGCCT GGCCTGGCTG GGCCTGAGCA GCGGCACGCT GGGTCTCAGC GAATGCGGCG AGCGCGAACT CGCGGGCTGG CTGGCGCGCC TGGGCCCGGC CGAGGTGCTG GTCGACGACC GCGACGACGC GGCGCTGACG GCCGCGCTGT CCACCACTCG CACGGCGCTC ACGCGGCGAC CCGCGTGGCA GTTCGACACG GCGCTGGGCC GGCGCAAGCT GTGCGAACAA CTGCGCGTTG CCAGTCTCGC CGGCTTCAAT GCCGAGGACG TGTCCGCAGC CCACGCCGCC GCCGCGGCGC TGCTCAGCTA CGCCGAGCAC ACCCAGGGCC GGGCGCTGGC CCATGTGGGT TCGCTGGCGG TGGAGCGCGC GAGCGATCTG CTGGAACTGC CGCCGGCCAC GCACCGCAAC CTCGAGCTGA CGCAGACGCT GCGCGGCGAG GACGCGCCGA CGCTGCTGTC GCTGCTCGAC GTGTGTCGCA CCGGCATGGG CTCGCGCGCC TTGCGCCACT GGCTCACGCA CCCGGCGCGC GTGCGCACGG CGGCCCGCGC GCGCCATGAA GCGATCGGCG CGCTGATGGA ACGTGGCGGC GAGGTGTTGC GCGAGGCGCT GCGCGGCGTC AGCGACGTCG AGCGCATCAC CGCGCGCATC GCGCTGCGCC AGGTGCGTCC GCGCGAACTG ACCGGACTGC GCGCCACGCT GCTGGCGCTG CCCGCGCTGC GCGACGGCGT GCCGCGCGAC AGCGCGCTGC TGGCCGATCT GGCACAGCAG CTCTCGCCGC CCCCGGAGAT CGCCGACCTG CTGCAGCGCG CCATCGCCGA CGAGCCGGCG GTGCTGTTGC GCGACGGCGG CGTCATCGCA ACGGGCCACG ACGCCGCGCT CGACGAGCTG CGCGGCATTG CGCAGAACTG CGAAGCCTTC CTGCTGGACC TGGAGGCGCG CGAGCGCAAC CGCACGGGCA TCGCCAACCT GCGCGTGCAG TTCAATCGCG TGCACGGCTT CTACATCGAG GTCACGCAGG GCCAGGTCGA CAAGGTGCCG GCCGACTACC AGCGCCGCCA GACGCTGAAG AACGCCGAGC GCTACATCAC GCCGGAGCTG AAGGCCTTCG AAGACAAGGC GCTGTCGGCA CAGGAGCGTG CTCTGGCGCG CGAGAAGCTG CTGTACGACG GCGTGCTCGA TGCGCTGCAG CCGCAGCTGG CCGCGCTGGG CGCCGTGGCG CGCGCGCTGG CCAGTCTGGA CGCGCTGGCC GCGCTGGCCG AGCGCGCGGC GGTGCTGAAC TGGTGCTGCC CGGAGTTCGT GAGCCAGCCT TGCATCGAGA TCGAGCAGGG CCGCCATCCG GTCGTCGAGG CGCGGCTGGC CGAGACCGGC GGCGGTAGCT TCATCCCCAA CGACTGCCGG CTCGACGCCA ACCGCCGCAT GCTGGTGATC ACCGGTCCCA ACATGGGCGG CAAGTCGACT TTCATGCGCC AGGTGGCGCT GGTGGTGCTG CTCGCCGCGA TGGGCTCCTA CGTGCCGGCC GCAGCCTGCC GGCTTGGCCC GGTCGACGCC ATCCACACCC GCATCGGCGC GGCCGACGAC CTGGCGAACG CACAGTCGAC CTTCATGCTC GAGATGACCG AGGCGGCGGC CATCGTCCAC GGCGCCACCG AGCACTCGCT GGTGCTGATG GACGAGATCG GCCGCGGCAC CTCCACGTTC GACGGCCTGG CGCTGGCCGG GGCGATCGCC AGCCAGCTGC ACGACCGCAA CCGCGCCTTC ACGCTGTTCG CCACCCACTA TTTCGAGCTG ACCGCCTTCC CCGAGAAGCA CCCGCGTGCG CTGAACGTGC ACGTCAGCGC GGCAGAGTCG CACACCGGCC ACGGCGACGA CATCGTGTTC CTGCACGAGA TCCAGCCCGG CCCGGCCAGC CGCAGCTACG GCGTGCAGGT GGCGCGGCTG GCCGGCATGC CGGCGCCGCT GTTGCGCCAG GCGCGCGCGA CGCTGGAGGC GCTGGAGGCC CAGCAGGCGG CCAGCGCCTC GCAGATCGAC CTGTTCGCCG CGCCCCCGCC CTCGCCGCCG CGCGAAGCCA CCGAGCTCGA GCGGGCCCTG GCCGGCGTCG AACCCGATAC ATTGACCCCG CGGGAAGCGC TCGATGCGCT CTACCGCCTC AAATCGCTCC ACGAGCGTTC CAAGGAAAAC TGA
|
Protein sequence | MSADPARAES GHTPMMQQYL RIKAEHPDTL VLYRMGDFYE LFFDDARKAH RLLDITLTSR GQSAGEPVVM AGVPVHALEN YLGKLVKLGE SVAIAEQVGD VATAKGPVER KVVRVVTPGT VTDTELLSER VDTLLLAVTR QRASYGLAWL GLSSGTLGLS ECGERELAGW LARLGPAEVL VDDRDDAALT AALSTTRTAL TRRPAWQFDT ALGRRKLCEQ LRVASLAGFN AEDVSAAHAA AAALLSYAEH TQGRALAHVG SLAVERASDL LELPPATHRN LELTQTLRGE DAPTLLSLLD VCRTGMGSRA LRHWLTHPAR VRTAARARHE AIGALMERGG EVLREALRGV SDVERITARI ALRQVRPREL TGLRATLLAL PALRDGVPRD SALLADLAQQ LSPPPEIADL LQRAIADEPA VLLRDGGVIA TGHDAALDEL RGIAQNCEAF LLDLEARERN RTGIANLRVQ FNRVHGFYIE VTQGQVDKVP ADYQRRQTLK NAERYITPEL KAFEDKALSA QERALAREKL LYDGVLDALQ PQLAALGAVA RALASLDALA ALAERAAVLN WCCPEFVSQP CIEIEQGRHP VVEARLAETG GGSFIPNDCR LDANRRMLVI TGPNMGGKST FMRQVALVVL LAAMGSYVPA AACRLGPVDA IHTRIGAADD LANAQSTFML EMTEAAAIVH GATEHSLVLM DEIGRGTSTF DGLALAGAIA SQLHDRNRAF TLFATHYFEL TAFPEKHPRA LNVHVSAAES HTGHGDDIVF LHEIQPGPAS RSYGVQVARL AGMPAPLLRQ ARATLEALEA QQAASASQID LFAAPPPSPP REATELERAL AGVEPDTLTP REALDALYRL KSLHERSKEN
|
| |