Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1594 |
Symbol | |
ID | 4787000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 1719973 |
End bp | 1721928 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640090162 |
Product | glucose dehydrogenase |
Protein accession | YP_001020791 |
Protein GI | 124266787 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4993] Glucose dehydrogenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATCA GAACCACGGC GCTGCTGACG GCGACGGCCG CTGCGTGCAT CGCTGCAAGT GCCGCCGAGA CCGGCAACTG GCGCGAGTAC AACGGCGACA AGGCGGGATC CCGCTTCGAG CCGGGCGTGA ACGTCACGCT GCAGAGCGTG AAGTCGATGA AGGTGGCCTG GCGCGTGCCA CTGCCCGGCA ACCAGTTGCC GATCGACAAC CCGGAGTTGC GCACCTGGGT CAACCAGTCC ACGCCGCTGG CGGTCAACGG CGTGCTGTAC TCCAGTTCGC CGCTGGGCAT CGTGACCGCG CTCGACGGCG CCACCGGCAA GACGCTGTGG ACCTGGGACG GCGGCGGTTG GGCCGACGGC ACACCGCCGA ACCTGGGCTT CATCAGCCGC GGCGTGACCT ACTGGGAAAA GGGCGAGGAC CGGCGGCTCT TCGTCGGCAC GCCCGACGCC TACCTCGTCG CACTGAACGC GAAGACCGGC AAGCCGATCG AGAGCTGGGG TGACAAGGGT CGCATCGATC TGACCAAGGG CCTGCGCCGC CCGGTCGACC GCTCGCTGGT GAGCCCGACC ACGCCGCCGA TCATCTGCGG CGGACAGGTG ATCCCCAGCC TCGCCGTCCT CGACTCCTTC GCGATCGGCC GCGACCCGCT GAAGAACCAC CCGCCCGGCG ACGTGCGCGG CTTCGACCTG ATGACCGGCA AGCAGAGCTG GGTGTTCCAT GCGCCGCCGC AGAAGGGCGA GCCCGGCAAC GAGACGTGGG AAGGCGACTC GGCGCAGTCG ACCGGCGGCA TGAACATGTG GACGCGCGCG AGCTGCGACG ACGAGCAGGG GCTGGTCTAC CTGCCGCTGT CGACGCCGGC GAACGACTTC TACGGCGGCC AGCGCAAGGG CGACGGCCTG TTCGGCGAGT CGCTGGTGGC GCTGAACGCC AAGACCGGCA AGATCGCGTG GTACTACCAG ATCGTGCACC ACGGCATCTG GGACTACGAC CTGCCGGCCG CGCCGAACCT GATGGACCTC ACCGTCGACG GCCGCAAGAT CAAGGCCGCG GTGCAGGTGA CCAAGCAGGG CTTCGTGTTC GCGTTCGACC GAGTCAACGG CAAGCCGATC TGGCCGATCG AGGAGAAGGC GGTGCCGCAG TCCACCGTGC CCGGCGAGAA GTCGTCGCCG ACGCAGCCCT TCCCCAGCAA GCCGGCGCCC TTCGTGCAGC AGGGCGTAAC CGAGGACGAC CTGATCGACC TGACGCCCAA GCTGAAGGAG GAGGCGAAGA AGATCCTGGC GCGCTACAAC TACGGCCCGC TCTACTTCCC GCCGACGCTC GACAAGGCCG GCACGCTGGA AGTGCCGGGC GTGCTCGGCG GTGCCAGCTG GGTGGGCGCC GCGCACAACC CGAAGTCCAA CGTGCTCTAC GTGCCCTCGT TCACGATCCC GTTCGGCATC AAGCTGAAGA AGGGGACGAC CGGCCTGTAC GACTACACCG GCACCTGGGC CGGTGTCGGT GGTCCGGACG GGCTGCCGCT GTTCAAGCCG CCGTTCAGCA CCGTCACCGC GATCGACATG AACACCGGCA AGCACCTGTG GCGCATCCCG GCCGGCCGCG GCCCGGTCGA TCACCCGGCC CTGAAGGACC TGAAGCTCGA CCGCGTGGGT GTGCCGCGCC AGAGCCACAT CGCTCTGACC GAGAACGTGC TGTTCCTCGC ACCGGAGGGC ACCAACAGCG TCATCGGCCT GTCGGCACGC GGCAACGCGC TGATCACGCA GGCCACGAAG GAGGAGCCCG AGCCCTTCCT CTACGCGCAT GACGCGAAGA CCGGCGCGCT GGTTGGCGAG GTCCGTCTGC CCGGCGGCGT GTTCGGCAAC CTGATGACCT ACTCCGCCGG CGGCAAGCAG TTCGTCGTCG CGCCGATCGG GGGAGCCGGC CTCCCGGCCG AACTGGTGGC CGTGCAGGTG AACTGA
|
Protein sequence | MSIRTTALLT ATAAACIAAS AAETGNWREY NGDKAGSRFE PGVNVTLQSV KSMKVAWRVP LPGNQLPIDN PELRTWVNQS TPLAVNGVLY SSSPLGIVTA LDGATGKTLW TWDGGGWADG TPPNLGFISR GVTYWEKGED RRLFVGTPDA YLVALNAKTG KPIESWGDKG RIDLTKGLRR PVDRSLVSPT TPPIICGGQV IPSLAVLDSF AIGRDPLKNH PPGDVRGFDL MTGKQSWVFH APPQKGEPGN ETWEGDSAQS TGGMNMWTRA SCDDEQGLVY LPLSTPANDF YGGQRKGDGL FGESLVALNA KTGKIAWYYQ IVHHGIWDYD LPAAPNLMDL TVDGRKIKAA VQVTKQGFVF AFDRVNGKPI WPIEEKAVPQ STVPGEKSSP TQPFPSKPAP FVQQGVTEDD LIDLTPKLKE EAKKILARYN YGPLYFPPTL DKAGTLEVPG VLGGASWVGA AHNPKSNVLY VPSFTIPFGI KLKKGTTGLY DYTGTWAGVG GPDGLPLFKP PFSTVTAIDM NTGKHLWRIP AGRGPVDHPA LKDLKLDRVG VPRQSHIALT ENVLFLAPEG TNSVIGLSAR GNALITQATK EEPEPFLYAH DAKTGALVGE VRLPGGVFGN LMTYSAGGKQ FVVAPIGGAG LPAELVAVQV N
|
| |