Gene Mpe_A0326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0326 
Symbol 
ID4786876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp350084 
End bp351919 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content53% 
IMG OID640088878 
Productphosphoenolpyruvate--protein phosphotransferase 
Protein accessionYP_001019523 
Protein GI124265519 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00077396 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0251738 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GGTCAAGTGA CTAAGTGCAT GTGGTGGATG CCTTGGCGAT TACAGGCGAT GAAGGACGTG 
ATAGCCTGCG ATAAGCTTCG GGGAGCTGGC AAATTAGCTT TGATCCGGAG ATTTCCGAAT
GGGGAAACCC ACCCGCAAGG GTATCGCATG ATGAATACAT AGTCATGCGA GGCGAACCGG
GTGAACTGAA ACATCTCAGT AGCTCGAGGA ATAGACATCA ACCGAGATTC CGAAAGTAGT
GGCGAGCGAA ATCGGACCAG CCTGCACGTT TTAGCAGTCG AATTATCAGA ACAGTCTGGA
AAGGCTGGCC ATAGCGGGTG ATAGCCCCGT ATGAAAAAAT TCGGCTGTGG AACTGGGCGT
GCGACAAGTA GGGCGGGACA CGAGAAATCC TGTCTGAAGA TGGGGGGACC ATCCTCCAAG
GCTAAATACT CGTAATCGAC CGATAGTGAA CTAGTACCGT GAGGGAAAGG CGAAAAGAAC
CCCGGGAGGG GAGTGAAATA GATCCTGAAA CCGCATGCAT ACAAAAAGTA GGAGCCCGCA
AGGGTGACTG CGTACCTTTT GTATAATGGG TCAGCGACTT ACATTCAGTG GCAAGCTTAA
CCGAATAGGG AAGGCGTAGA GAAATCGAGT CCGAATAGGG CGTTCAGTCG CTGGGTGTAG
ACCCGAAACC AAGTGATCTA TCCATGGCCA GGATGAAGGT GCGGTAACAC GCACTGGAGG
TCCGAACCGA CTAGTGTTGC AAAACTAGCG GATGAGCTGT GGATAGGGGT GAAAGGCTAA
ACAAACTTGG AAATAGCTGG TTCTCTCCGA AAACTATTTA GGTAGTGCCT CAAGTATTAC
CATCGGGGGT AGAGCACTGT TATGGCTAGG GGGTCATGGC GACTTACCAA ACCATTGCAA
ACTCCGAATA CCGATGAGTA CAGCTTGGGA GACAGTGCAC CGGGTGCTAA CGTCCGGACA
CAAGAGGGAA ACAACCCAGA CCGCCAGCTA AGGTCCCTAA TATTGGCTAA GTGGGAAACG
AAGTGGGAAG GCTAAAACAG TCAGGATGTT GGCTTAGAAG CAGCCATCAT TTAAAGAAAG
CGTAATAGCT CACTGATCGA GTCGTCCTGC GCGGAAGATG TAACGGGGCT AAGCCAGTAA
CCGAAGCTGC GGATGTGCGC GTAAGCGTAC GTGGTAGGAG AGCGTTCCGT AAGCCTGTGA
AGGTGGGTTG TGAAGCCTGC TGGAGGTATC GGAAGTGCGA ATGCTGACAT GAGTAGCGTT
AAAGGGGGTG AAAAGCCCCC TCGCCGAAAG CGCAAGGTTT TCTACGCAAC GTTCATCGAC
GTAGAGTGAG TCGGCCCCTA AGGCGAGGCA GAGATGCGTA GCTGATGGGA AACAGGTCAA
TATTCCTGTA CCGATGTGTA GTGCGATGTG GGGACGGAGA AGGTTAGCTC AGCCGGGTGT
TGGATGTCCC GGTTCAAGCG TGTAGTCGTG GTCTCTAGGC AAATCCGGAG ATCTTAGATG
AGGCGTGATA ACGAGGCGGC TTGCCGCTGA AGTGAGTGAT ACCCTGCTTC CAGGAAAAGC
CACTAAGCTC CAGCTACACA CGACCGTACC GCAAACCGAC ACTGGTGCGC GAGATGAGTA
TTCTAAGGCG CTTGAGAGAA CTCTGGAGAA GGAACTCGGC AAATTGACAC CGTAACTTCG
GAAGAAGGTG TGCCTTTAGT AGGTGATCCC GTACAGGGGG AGCCCAATGA GGCCGCAGAG
AATCGGTGGC TGCGACTGTT TATTAAAAAC ACAGCACTCT GCAAAGACGA AAGTCGACGT
ATAGGGTGTG ACGCCTGCCC GGTGCTGGAA GATTAAATGA TGGGGTGCAA GCTCTTGATT
GAAGTCCCAG TAAACGGCGG CCGTAACTAT AACGGTCCTA AGGTAGCGAA ATTCCTTGTC
GGGTAAGTTC CGACCTGCAC GAATGGCGTA ACGATGGCCA CACTGTCTCC TCCAGAGACT
CAGCGAAGTT GAAATGTTTG TGATGATGCA ATCTCCCCGC GGAAAGACGG AAAGACCCCA
TGAACCTTTA CTGTAGCTTT GTATTGGACT TTGAACAGAT CTGTGTAGGA TAGGTGGGAG
GCTTTGAAGC GGTGCCGCTA GGTGTCGTGG AGCCAACGTT GAAATACCAC CCTGGTGTGT
TTGAGGTTCT AACCTTGGCC CGTTATCCGG GTTGGGGACA GTGCATGGTG GGCAGTTTGA
CTGGGGCGGT CTCCTCCCAA AGCGTAACGG AGGAGTTCGA AGGTACGCTA GGCACGGTCG
GAAATCGTGC TGATAGTGCA TAGGCATAAG CGTGCTTGAC TGCGAGACTG ACAAGTCGAG
CAGGTACGAA AGTAGGACTA AGTGATCCGG TGGTTCTGTA TGGAAGGGCC ATCGCTCAAC
GGATAAAAGG TACTCTGGGG ATAACAGGCT GATACCGCCC AAGAGTTCAT ATCGACGGCG
GTGTTTGGCA CCTCGATGTC GGCTCATCTC ATCCTGGGGC TGTAGCCGGT CCCAAGGGTA
TGGCTGTTCG CCATTTAAAG AGGTACGTGA GCTGGGTTTA AAACGTCGTG AGACAGTTTG
GTCCCTATCT TCCGTGGGCG CTGCAGATTT GAGGAAGCCT GCTCCTAGTA CGAGAGGACC
GGAGTGGACG CACCTCTGGT GTATCGGTTG TCACGCCAGT GGCATTGCCG AGTAGCTAAG
TGCGGAAGAG ATAACCGCTG AAAGCATCTA AGCGGGAAAC TCGTTTCAAG ATGAGATCTG
CCGGGGCCTT GAGCCCCCTG AAGAGTCGTT CGAGACCAGG ACGTTGATAG GCCGGGTGTG
GAAGCGCAGT AATGCGTTAA GCTAACCGGT ACTAATTGCT CGTGAGGCTT GACCCTA
 
Protein sequence
MCAVVTPAGS WTGRCYDRFM SLQMFGIPVS RGVAIGRAVL VASSRVDVAH YFIEPAQVER 
EIARLLQARD AVAAELGGLQ RDLPEDAPAE LSALLDVHLM LLHDEALTGA TSQWVHERHY
NAEWALSAQL EVLARHFDDM ENDYLRERKA DLEQVVERLL RVLMHDSSAV PPSIGVNPRD
FAGEDPLVLV ANDIAPADML QFKRSVFTGF VTDVGGKTSH TAIVARSLDI PAVVGAREAS
RIIRQDDWVV IDGDAGVVIV DPSSIVLEEY RFRQRQSELE RVRLTRLRHT PAVTLDGERV
ELFANIELPG DAAAALEAGA VGVGLFRSEF LFMNRTDDLP GEDEQYQAYC AVVDAMKGLP
VTIRTVDIGA DKPLDRMSAH ELRHEHALNP ALGLRAIRWS LSEPSMFRQQ LRAILRASAH
GQVRLLVPML AHESEIRGTF DALARAKQQL TESGRAFGDV QVGAMIEVPA AALMIDRFLD
AFDFVSLGTN DLIQYTLAID RADEAVAHLY DPWHPAVLEL VARTIRAARA RGRAVSVCGE
MAGDPSFTSV LLAMGLRSFS MHPSQIAAIK QQILRTDTRR LSDLLLGARS DAPTFTPLRN
GGGVAATPPR P