Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0326 |
Symbol | |
ID | 4786876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 350084 |
End bp | 351919 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640088878 |
Product | phosphoenolpyruvate--protein phosphotransferase |
Protein accession | YP_001019523 |
Protein GI | 124265519 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00077396 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0251738 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GGTCAAGTGA CTAAGTGCAT GTGGTGGATG CCTTGGCGAT TACAGGCGAT GAAGGACGTG ATAGCCTGCG ATAAGCTTCG GGGAGCTGGC AAATTAGCTT TGATCCGGAG ATTTCCGAAT GGGGAAACCC ACCCGCAAGG GTATCGCATG ATGAATACAT AGTCATGCGA GGCGAACCGG GTGAACTGAA ACATCTCAGT AGCTCGAGGA ATAGACATCA ACCGAGATTC CGAAAGTAGT GGCGAGCGAA ATCGGACCAG CCTGCACGTT TTAGCAGTCG AATTATCAGA ACAGTCTGGA AAGGCTGGCC ATAGCGGGTG ATAGCCCCGT ATGAAAAAAT TCGGCTGTGG AACTGGGCGT GCGACAAGTA GGGCGGGACA CGAGAAATCC TGTCTGAAGA TGGGGGGACC ATCCTCCAAG GCTAAATACT CGTAATCGAC CGATAGTGAA CTAGTACCGT GAGGGAAAGG CGAAAAGAAC CCCGGGAGGG GAGTGAAATA GATCCTGAAA CCGCATGCAT ACAAAAAGTA GGAGCCCGCA AGGGTGACTG CGTACCTTTT GTATAATGGG TCAGCGACTT ACATTCAGTG GCAAGCTTAA CCGAATAGGG AAGGCGTAGA GAAATCGAGT CCGAATAGGG CGTTCAGTCG CTGGGTGTAG ACCCGAAACC AAGTGATCTA TCCATGGCCA GGATGAAGGT GCGGTAACAC GCACTGGAGG TCCGAACCGA CTAGTGTTGC AAAACTAGCG GATGAGCTGT GGATAGGGGT GAAAGGCTAA ACAAACTTGG AAATAGCTGG TTCTCTCCGA AAACTATTTA GGTAGTGCCT CAAGTATTAC CATCGGGGGT AGAGCACTGT TATGGCTAGG GGGTCATGGC GACTTACCAA ACCATTGCAA ACTCCGAATA CCGATGAGTA CAGCTTGGGA GACAGTGCAC CGGGTGCTAA CGTCCGGACA CAAGAGGGAA ACAACCCAGA CCGCCAGCTA AGGTCCCTAA TATTGGCTAA GTGGGAAACG AAGTGGGAAG GCTAAAACAG TCAGGATGTT GGCTTAGAAG CAGCCATCAT TTAAAGAAAG CGTAATAGCT CACTGATCGA GTCGTCCTGC GCGGAAGATG TAACGGGGCT AAGCCAGTAA CCGAAGCTGC GGATGTGCGC GTAAGCGTAC GTGGTAGGAG AGCGTTCCGT AAGCCTGTGA AGGTGGGTTG TGAAGCCTGC TGGAGGTATC GGAAGTGCGA ATGCTGACAT GAGTAGCGTT AAAGGGGGTG AAAAGCCCCC TCGCCGAAAG CGCAAGGTTT TCTACGCAAC GTTCATCGAC GTAGAGTGAG TCGGCCCCTA AGGCGAGGCA GAGATGCGTA GCTGATGGGA AACAGGTCAA TATTCCTGTA CCGATGTGTA GTGCGATGTG GGGACGGAGA AGGTTAGCTC AGCCGGGTGT TGGATGTCCC GGTTCAAGCG TGTAGTCGTG GTCTCTAGGC AAATCCGGAG ATCTTAGATG AGGCGTGATA ACGAGGCGGC TTGCCGCTGA AGTGAGTGAT ACCCTGCTTC CAGGAAAAGC CACTAAGCTC CAGCTACACA CGACCGTACC GCAAACCGAC ACTGGTGCGC GAGATGAGTA TTCTAAGGCG CTTGAGAGAA CTCTGGAGAA GGAACTCGGC AAATTGACAC CGTAACTTCG GAAGAAGGTG TGCCTTTAGT AGGTGATCCC GTACAGGGGG AGCCCAATGA GGCCGCAGAG AATCGGTGGC TGCGACTGTT TATTAAAAAC ACAGCACTCT GCAAAGACGA AAGTCGACGT ATAGGGTGTG ACGCCTGCCC GGTGCTGGAA GATTAAATGA TGGGGTGCAA GCTCTTGATT GAAGTCCCAG TAAACGGCGG CCGTAACTAT AACGGTCCTA AGGTAGCGAA ATTCCTTGTC GGGTAAGTTC CGACCTGCAC GAATGGCGTA ACGATGGCCA CACTGTCTCC TCCAGAGACT CAGCGAAGTT GAAATGTTTG TGATGATGCA ATCTCCCCGC GGAAAGACGG AAAGACCCCA TGAACCTTTA CTGTAGCTTT GTATTGGACT TTGAACAGAT CTGTGTAGGA TAGGTGGGAG GCTTTGAAGC GGTGCCGCTA GGTGTCGTGG AGCCAACGTT GAAATACCAC CCTGGTGTGT TTGAGGTTCT AACCTTGGCC CGTTATCCGG GTTGGGGACA GTGCATGGTG GGCAGTTTGA CTGGGGCGGT CTCCTCCCAA AGCGTAACGG AGGAGTTCGA AGGTACGCTA GGCACGGTCG GAAATCGTGC TGATAGTGCA TAGGCATAAG CGTGCTTGAC TGCGAGACTG ACAAGTCGAG CAGGTACGAA AGTAGGACTA AGTGATCCGG TGGTTCTGTA TGGAAGGGCC ATCGCTCAAC GGATAAAAGG TACTCTGGGG ATAACAGGCT GATACCGCCC AAGAGTTCAT ATCGACGGCG GTGTTTGGCA CCTCGATGTC GGCTCATCTC ATCCTGGGGC TGTAGCCGGT CCCAAGGGTA TGGCTGTTCG CCATTTAAAG AGGTACGTGA GCTGGGTTTA AAACGTCGTG AGACAGTTTG GTCCCTATCT TCCGTGGGCG CTGCAGATTT GAGGAAGCCT GCTCCTAGTA CGAGAGGACC GGAGTGGACG CACCTCTGGT GTATCGGTTG TCACGCCAGT GGCATTGCCG AGTAGCTAAG TGCGGAAGAG ATAACCGCTG AAAGCATCTA AGCGGGAAAC TCGTTTCAAG ATGAGATCTG CCGGGGCCTT GAGCCCCCTG AAGAGTCGTT CGAGACCAGG ACGTTGATAG GCCGGGTGTG GAAGCGCAGT AATGCGTTAA GCTAACCGGT ACTAATTGCT CGTGAGGCTT GACCCTA
|
Protein sequence | MCAVVTPAGS WTGRCYDRFM SLQMFGIPVS RGVAIGRAVL VASSRVDVAH YFIEPAQVER EIARLLQARD AVAAELGGLQ RDLPEDAPAE LSALLDVHLM LLHDEALTGA TSQWVHERHY NAEWALSAQL EVLARHFDDM ENDYLRERKA DLEQVVERLL RVLMHDSSAV PPSIGVNPRD FAGEDPLVLV ANDIAPADML QFKRSVFTGF VTDVGGKTSH TAIVARSLDI PAVVGAREAS RIIRQDDWVV IDGDAGVVIV DPSSIVLEEY RFRQRQSELE RVRLTRLRHT PAVTLDGERV ELFANIELPG DAAAALEAGA VGVGLFRSEF LFMNRTDDLP GEDEQYQAYC AVVDAMKGLP VTIRTVDIGA DKPLDRMSAH ELRHEHALNP ALGLRAIRWS LSEPSMFRQQ LRAILRASAH GQVRLLVPML AHESEIRGTF DALARAKQQL TESGRAFGDV QVGAMIEVPA AALMIDRFLD AFDFVSLGTN DLIQYTLAID RADEAVAHLY DPWHPAVLEL VARTIRAARA RGRAVSVCGE MAGDPSFTSV LLAMGLRSFS MHPSQIAAIK QQILRTDTRR LSDLLLGARS DAPTFTPLRN GGGVAATPPR P
|
| |