Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3003 |
Symbol | |
ID | 4784692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3192868 |
End bp | 3195672 |
Gene Length | 2805 bp |
Protein Length | 934 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640091574 |
Product | phosphoenolpyruvate carboxylase |
Protein accession | YP_001022191 |
Protein GI | 124268187 |
COG category | [C] Energy production and conversion |
COG ID | [COG2352] Phosphoenolpyruvate carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.371181 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACCG CCAAGACCCC CCGCAGCGCC GACCGCGGCG CCGACAAGAA CCGCCCGCTG ATCGAGGACA TCCGCCTGCT CGGCCGCATC CTGGGCGACG TGATCCGCGA GCAGGAAGGC GCACTGGCCT TCGAGCTGAT CGAGCGCATC CGCCAGCTCG CCGTGGCCTA CCGGCTCAAG CGCGACACCC AGGCCGGCCG CGCACTCGAC CGGCTGTTGA AGAACCTGTC GGTCGAGCAG GCGGTGTCGG TGGTGCGCGC CTTCAGCTAC TTCTCCCACC TCGCCAACCT GGCCGAGGAC CGCCACCACG TGCGGCGTCG CGAGCACCAC GAGCAGCTCG GCCACGTGCA GGAGGGCTCG CTGGCGATGA GCTTCGAGCG GCTCGCCAAG CGCGGCGTGC GCGCCACCGA GATCGCCGAG CTGCTGGGCC ACGCCTATCT GTCGCCGGTG CTCACCGCCC ACCCGACCGA AGTGCAGCGC AAGAGCGTGC TCGACGCCGA GCGCGCGGTG GCCGAGCTGA TCGGCGCGCG CGACACGCTG CCGACGCAGC GTGAGCGCGC CGCCAACGAG GCGATGCTGC GCGCCCGCGT GACGCAGTTG TGGCAGACGC GGCTGCTGCG CACCTCCAAG CTCAGCGTCG CCAACGAGAT CGACAACGCG CTGTCGTACT ACCAGAGCAC CTTCCTGCGC CAGATCCCCA GGCTCTATGC CGAGCTCGAG GCGCTGCTGC CCGGCTTCGA GGTCGCGCCG TTCTTCCGCA TGGGCAACTG GATCGGCGGC GACCGCGACG GCAACCCCAA CGTCACCGCC GAGACGCTGC GACTCGCGCT GGCGCGCCAC AGCGAGACGG TGCTGCGCTT CTACCTGACC GAGGTGCACG AACTCGGCGC CGAGCTGTCG ATCTCGGCGC TGCTGGTGCA GGTCACGCCC GAGCTGCAGG CGCTGGCCGA CCGCTCCGGC GACCACAACG CGCACCGGCT CGACGAGCCC TACCGCCGCG CGCTGATCGG CGTGTACGCA CGCCTCGCCG GCACATTGAC GGCGCTGACC GGCACCGAGG CGCTGCGCCA CGCGGTCGCG CCGTCCACGC CCTATGCGAC CGCCGAGGAG CTGCTGGCCG ACCTGCGCAC CGTCGAGGCC TCGCTGGCCT CGCACCACGG CGCGGCGCTC GCGGCGGCGC GCCTCAAGCC GCTGATCCGC GCCGTGCAGG TGTTCGGCTT CCACCTCGCC ACCGTCGACC TTCGCCAGAG CTCCGACCAG CACGAGGCGG TGCTCGCAGA GCTGATGGCC GGCGCCCGCA TCGAGGCCGA CTACGCGGCG ATGCCCGAGG AGGCCAAGGT CGCGTTGCTG CTGGGCCTGC TGAACGACGC GCGCAGCCTG CAGGTGCGCG GCGCCGTCTA CAGCGAGCGC ACGCGCGGCG AGCTGGCGAT CTTCGAGGCC GCGCGCGAGG GCCGGGCCCT CTATGGCCAC GCCGCGATCC GCCACTGCAT CATCTCCCAC ACCGAGACGG TGAGCGACCT GCTGGAAGTG CTGGTGCTGC AGAAGGAAGC CGGCCTGCTG CAGGGCACGC TCGACACCGA CGCACGCTGC GACCTGATCG TCGTGCCGCT GTTCGAGACC ATCACCGACC TGCGCCAGGC CGCGCCCATC ATGCGCGAGT TCTACGCGCT GCCGGGCGTG CTGCCGCTGG TGCTGCGCAG CGGCGCCGAC CAGTACTGCG AGCAGGACGT GATGCTCGGC TACTCCGACA GCAACAAGGA CGGCGGCTTC TTCACCAGCA ACTGGGAGCT CTACCGCGCC GAAACCGCGC TGGTCGAGCT GTTCGAGCCG CTCAAGCGCG AGCACGGCCT CACGCTGCGC CTGTTCCACG GCCGTGGCGG CACCGTGGGC CGCGGCGGCG GCCCGAGCTA CCAGGCCATC CTGGCGCAGC CTCCGGGTAC CGTGAACGGC CAGATCCGCC TCACCGAGCA GGGCGAGGTG ATCGCATCGA AGTACGCCAA CCCCGAGATC GGCCGGCGCA ATCTCGAAAC GCTGGTAGCG GCGACGCTGG AGGCCACGCT GCTGCCGCCG AAGCGCCACG CGCCGAAGCT GTTCCTCGAC ACCGCCGACA CGCTGTCGCA GCTCAGCATG GCGGCCTACC GCAAGCTGGT CTACGAAACC CCGGGCTTCG CCGACTACTT CTTCGCCGCC ACGCCGATCC GCGAGATCGC CGAGCTCAAC ATCGGCTCGC GGCCGGCCTC GCGCAAGGCC ACGCGCGCCA TCGAAGACCT GCGCGCCATC CCCTGGGGCT TCAGTTGGGG CCAGTGCCGC GTGGCGCTGC CGGGCTGGTA CGGCTTCGGC TCGGCCGTCG AGGGCTTCCT GGGGGATGCG CCCAAGCAGC GCAAGGAGCG CCTCGCGCTG CTGCAGCGCA TGCACGCGCA GTGGCCCTTC TTCGGCACGC TGCTGTCGAA CATCGACATG GTGCTGGCCA AGAGCGACCT GGCGATCGCG ACGCGCTACG TGGAGCTGGT GCCGGACAAG CGGGCCGCGA AGAAGATCTT CGCGGCCGTC CAGGCCGAAT GGCAGCGCAC CGATGCCGTG CTGGCCGCCA TCACCGGCGA GCCGCGCCGG CTGGCCGGCA ACGCCGCGCT GGCGCGCTCG ATCGAGCACC GCCTGCCCTA CATCGACCCG CTGAACCACC TGCAGGTCGA GCTGATGCGC CGCTACCGCG CCCAGCAGGG TCGCGGCGAG CTGCACGAGC GCGTGCAGCG CGGCATCCAC ATGTCGATCA ACGGCGTGGC GGCCGGGCTG CGCAACTCCG GCTGA
|
Protein sequence | MATAKTPRSA DRGADKNRPL IEDIRLLGRI LGDVIREQEG ALAFELIERI RQLAVAYRLK RDTQAGRALD RLLKNLSVEQ AVSVVRAFSY FSHLANLAED RHHVRRREHH EQLGHVQEGS LAMSFERLAK RGVRATEIAE LLGHAYLSPV LTAHPTEVQR KSVLDAERAV AELIGARDTL PTQRERAANE AMLRARVTQL WQTRLLRTSK LSVANEIDNA LSYYQSTFLR QIPRLYAELE ALLPGFEVAP FFRMGNWIGG DRDGNPNVTA ETLRLALARH SETVLRFYLT EVHELGAELS ISALLVQVTP ELQALADRSG DHNAHRLDEP YRRALIGVYA RLAGTLTALT GTEALRHAVA PSTPYATAEE LLADLRTVEA SLASHHGAAL AAARLKPLIR AVQVFGFHLA TVDLRQSSDQ HEAVLAELMA GARIEADYAA MPEEAKVALL LGLLNDARSL QVRGAVYSER TRGELAIFEA AREGRALYGH AAIRHCIISH TETVSDLLEV LVLQKEAGLL QGTLDTDARC DLIVVPLFET ITDLRQAAPI MREFYALPGV LPLVLRSGAD QYCEQDVMLG YSDSNKDGGF FTSNWELYRA ETALVELFEP LKREHGLTLR LFHGRGGTVG RGGGPSYQAI LAQPPGTVNG QIRLTEQGEV IASKYANPEI GRRNLETLVA ATLEATLLPP KRHAPKLFLD TADTLSQLSM AAYRKLVYET PGFADYFFAA TPIREIAELN IGSRPASRKA TRAIEDLRAI PWGFSWGQCR VALPGWYGFG SAVEGFLGDA PKQRKERLAL LQRMHAQWPF FGTLLSNIDM VLAKSDLAIA TRYVELVPDK RAAKKIFAAV QAEWQRTDAV LAAITGEPRR LAGNAALARS IEHRLPYIDP LNHLQVELMR RYRAQQGRGE LHERVQRGIH MSINGVAAGL RNSG
|
| |