Gene Mpe_A2842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2842 
Symbol 
ID4785536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3021912 
End bp3024524 
Gene Length2613 bp 
Protein Length870 aa 
Translation table11 
GC content72% 
IMG OID640091413 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001022031 
Protein GI124268027 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.147304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGCCG ACCCCGCCCG CGCCGAGTCC GGTCACACCC CCATGATGCA GCAGTACCTG 
CGCATCAAGG CCGAGCATCC CGACACGCTC GTGCTCTACC GCATGGGCGA TTTCTACGAG
CTGTTCTTCG ACGACGCCCG CAAGGCGCAC CGCCTGCTCG ACATCACGCT CACCAGCCGC
GGCCAGAGCG CCGGCGAGCC CGTCGTGATG GCCGGCGTGC CGGTGCACGC GCTCGAGAAC
TATCTCGGCA AGCTGGTCAA GCTTGGCGAG TCGGTCGCGA TCGCCGAGCA GGTGGGCGAC
GTGGCGACTG CCAAGGGGCC GGTCGAGCGC AAGGTGGTGC GCGTCGTCAC GCCGGGCACC
GTGACCGACA CCGAGCTGCT GTCGGAGCGC GTCGACACCC TGCTGCTGGC GGTGACGCGC
CAGCGTGCGA GTTACGGCCT GGCCTGGCTG GGCCTGAGCA GCGGCACGCT GGGTCTCAGC
GAATGCGGCG AGCGCGAACT CGCGGGCTGG CTGGCGCGCC TGGGCCCGGC CGAGGTGCTG
GTCGACGACC GCGACGACGC GGCGCTGACG GCCGCGCTGT CCACCACTCG CACGGCGCTC
ACGCGGCGAC CCGCGTGGCA GTTCGACACG GCGCTGGGCC GGCGCAAGCT GTGCGAACAA
CTGCGCGTTG CCAGTCTCGC CGGCTTCAAT GCCGAGGACG TGTCCGCAGC CCACGCCGCC
GCCGCGGCGC TGCTCAGCTA CGCCGAGCAC ACCCAGGGCC GGGCGCTGGC CCATGTGGGT
TCGCTGGCGG TGGAGCGCGC GAGCGATCTG CTGGAACTGC CGCCGGCCAC GCACCGCAAC
CTCGAGCTGA CGCAGACGCT GCGCGGCGAG GACGCGCCGA CGCTGCTGTC GCTGCTCGAC
GTGTGTCGCA CCGGCATGGG CTCGCGCGCC TTGCGCCACT GGCTCACGCA CCCGGCGCGC
GTGCGCACGG CGGCCCGCGC GCGCCATGAA GCGATCGGCG CGCTGATGGA ACGTGGCGGC
GAGGTGTTGC GCGAGGCGCT GCGCGGCGTC AGCGACGTCG AGCGCATCAC CGCGCGCATC
GCGCTGCGCC AGGTGCGTCC GCGCGAACTG ACCGGACTGC GCGCCACGCT GCTGGCGCTG
CCCGCGCTGC GCGACGGCGT GCCGCGCGAC AGCGCGCTGC TGGCCGATCT GGCACAGCAG
CTCTCGCCGC CCCCGGAGAT CGCCGACCTG CTGCAGCGCG CCATCGCCGA CGAGCCGGCG
GTGCTGTTGC GCGACGGCGG CGTCATCGCA ACGGGCCACG ACGCCGCGCT CGACGAGCTG
CGCGGCATTG CGCAGAACTG CGAAGCCTTC CTGCTGGACC TGGAGGCGCG CGAGCGCAAC
CGCACGGGCA TCGCCAACCT GCGCGTGCAG TTCAATCGCG TGCACGGCTT CTACATCGAG
GTCACGCAGG GCCAGGTCGA CAAGGTGCCG GCCGACTACC AGCGCCGCCA GACGCTGAAG
AACGCCGAGC GCTACATCAC GCCGGAGCTG AAGGCCTTCG AAGACAAGGC GCTGTCGGCA
CAGGAGCGTG CTCTGGCGCG CGAGAAGCTG CTGTACGACG GCGTGCTCGA TGCGCTGCAG
CCGCAGCTGG CCGCGCTGGG CGCCGTGGCG CGCGCGCTGG CCAGTCTGGA CGCGCTGGCC
GCGCTGGCCG AGCGCGCGGC GGTGCTGAAC TGGTGCTGCC CGGAGTTCGT GAGCCAGCCT
TGCATCGAGA TCGAGCAGGG CCGCCATCCG GTCGTCGAGG CGCGGCTGGC CGAGACCGGC
GGCGGTAGCT TCATCCCCAA CGACTGCCGG CTCGACGCCA ACCGCCGCAT GCTGGTGATC
ACCGGTCCCA ACATGGGCGG CAAGTCGACT TTCATGCGCC AGGTGGCGCT GGTGGTGCTG
CTCGCCGCGA TGGGCTCCTA CGTGCCGGCC GCAGCCTGCC GGCTTGGCCC GGTCGACGCC
ATCCACACCC GCATCGGCGC GGCCGACGAC CTGGCGAACG CACAGTCGAC CTTCATGCTC
GAGATGACCG AGGCGGCGGC CATCGTCCAC GGCGCCACCG AGCACTCGCT GGTGCTGATG
GACGAGATCG GCCGCGGCAC CTCCACGTTC GACGGCCTGG CGCTGGCCGG GGCGATCGCC
AGCCAGCTGC ACGACCGCAA CCGCGCCTTC ACGCTGTTCG CCACCCACTA TTTCGAGCTG
ACCGCCTTCC CCGAGAAGCA CCCGCGTGCG CTGAACGTGC ACGTCAGCGC GGCAGAGTCG
CACACCGGCC ACGGCGACGA CATCGTGTTC CTGCACGAGA TCCAGCCCGG CCCGGCCAGC
CGCAGCTACG GCGTGCAGGT GGCGCGGCTG GCCGGCATGC CGGCGCCGCT GTTGCGCCAG
GCGCGCGCGA CGCTGGAGGC GCTGGAGGCC CAGCAGGCGG CCAGCGCCTC GCAGATCGAC
CTGTTCGCCG CGCCCCCGCC CTCGCCGCCG CGCGAAGCCA CCGAGCTCGA GCGGGCCCTG
GCCGGCGTCG AACCCGATAC ATTGACCCCG CGGGAAGCGC TCGATGCGCT CTACCGCCTC
AAATCGCTCC ACGAGCGTTC CAAGGAAAAC TGA
 
Protein sequence
MSADPARAES GHTPMMQQYL RIKAEHPDTL VLYRMGDFYE LFFDDARKAH RLLDITLTSR 
GQSAGEPVVM AGVPVHALEN YLGKLVKLGE SVAIAEQVGD VATAKGPVER KVVRVVTPGT
VTDTELLSER VDTLLLAVTR QRASYGLAWL GLSSGTLGLS ECGERELAGW LARLGPAEVL
VDDRDDAALT AALSTTRTAL TRRPAWQFDT ALGRRKLCEQ LRVASLAGFN AEDVSAAHAA
AAALLSYAEH TQGRALAHVG SLAVERASDL LELPPATHRN LELTQTLRGE DAPTLLSLLD
VCRTGMGSRA LRHWLTHPAR VRTAARARHE AIGALMERGG EVLREALRGV SDVERITARI
ALRQVRPREL TGLRATLLAL PALRDGVPRD SALLADLAQQ LSPPPEIADL LQRAIADEPA
VLLRDGGVIA TGHDAALDEL RGIAQNCEAF LLDLEARERN RTGIANLRVQ FNRVHGFYIE
VTQGQVDKVP ADYQRRQTLK NAERYITPEL KAFEDKALSA QERALAREKL LYDGVLDALQ
PQLAALGAVA RALASLDALA ALAERAAVLN WCCPEFVSQP CIEIEQGRHP VVEARLAETG
GGSFIPNDCR LDANRRMLVI TGPNMGGKST FMRQVALVVL LAAMGSYVPA AACRLGPVDA
IHTRIGAADD LANAQSTFML EMTEAAAIVH GATEHSLVLM DEIGRGTSTF DGLALAGAIA
SQLHDRNRAF TLFATHYFEL TAFPEKHPRA LNVHVSAAES HTGHGDDIVF LHEIQPGPAS
RSYGVQVARL AGMPAPLLRQ ARATLEALEA QQAASASQID LFAAPPPSPP REATELERAL
AGVEPDTLTP REALDALYRL KSLHERSKEN