Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3393 |
Symbol | |
ID | 4786380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3607851 |
End bp | 3609671 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640091969 |
Product | putative methanol dehydrogenase protein, large subunit |
Protein accession | YP_001022581 |
Protein GI | 124268577 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4993] Glucose dehydrogenase |
TIGRFAM ID | [TIGR03075] PQQ-dependent dehydrogenase, methanol/ethanol family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.078697 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0119159 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGTTT CTAAGCATTC GGGTTGGCGC CTGATGCGCC CCCTCGGATT GGCGCTGCTC GCGATCCCGG CAGTCGTGCA AGCGAATGCG GACGTCGAGA AGAACATTGC CAATTCCAAG AACTGGGCGA TGCAGGCTGG TGACATGTTC AACCAGCGCT ACAGCAAGCT CGACCAGATC AACAAGGGCA ACGTCGGCAA GATGCAGGTC GCGTGGACCT TCTCCACCGG CGTGCTGCGC GGCCACGAGG GCTCGCCGCT GGTCATCGAC GGCACGATGT ACCTGCACTC GCCGTTCCCC AACAAGGTCT TCGCGATCGA CCTGAACACC CAGAAGATCC TCTGGAAGTA CGAGCCGAAG CAGGATCCGG CGGTCATCCC GCAGATGTGC TGCGACACGG TGAACCGTGG CCTGGCGTAC GCCGAAGGCA AGGTCATCCT GCAGCAGGCC GACTCCAACC TCGTGGCCCT GGACGCCAAG TCGGGCAAGG TGGTGTGGAG CGTGAAGAAC GGCGATCCGA AGCTCGGCGC CGTGAACACC AACGCCCCGC ACGTCTTCAA GGACAAGGTC ATCACCGGCA TCTCCGGTGG TGAGTGGGGT GTGCGCGGCT TCATCGCTGC CTACAACCTG AAGGACGGCA AGCCGGCGTG GAAGGGCTAC AGCGTGGGCC CCGACGCCGA GATGCTGATC GACCCGGCCA AGACCACCAC CTGGATCGAC GGCAAGGTCG CTCCGGTGGG CGCCGACTCG TCGCTGAAGA CCTGGAAGGG CGATCAGTGG AAGATCGGTG GCGGCACCAC CTGGGGCTGG TACAGCTACG ACAAGGCGCT GAACGCCATG TACTACGGCA CCGGCAACCC GTCGACCTGG AACCCGAGCC AGCGCCCGGG CGACAACAAG TGGTCGATGT CAATCTGGTC GCGTGACGTC GACACGGGCA AGGTCAACTG GGTCTACCAG ATGACGCCGT TCGACGAGTG GGACTTCGAC GGCATCAACG AGATGATCCT CGCGGACATC AACGTGAAGG GCAAGCCGAC CAAGGCGCTG GTGCACTTCG ACCGCAACGG CTTCGCGTAC ACGATGGACC GCACCAACGG TGCGCTGCTC GTGGCCGAGA AGTACGACCC GAAGGTGAAC TGGGCGACCC ACGTCGACAT GAAGACCGGC CGTCCCCAGG TCGTGAAGCA GTACTCGACG GCGCAAAACG GCCCCGATGT CAACACCAAG GGCATCTGCC CGGCGGCGCT GGGCTCGAAG GACCAGCAGC CGGCCTCGTT CGACCCGAAC ACCAAGCTCT TCTACGTGCC GACCAACCAC GTCTGCATGG ACTACGAGCC GTTCAAGGTC GAGTACACCG CGGGCCAGCC GTACGTGGGC GCGACGCTGT CGATGTTCCC GGCTCCGGGC AGCCATGGTG GCATGGGCAA CTACATCACC TGGGATGCCG GTACCGGCAA GATCGTGCAG AGCAAGGCCG AGAAGTTCTC GGTGTGGAGC GGTTCGCTCA ACACCGCGGG CGGCCTGAGC TGCTACGGCA CGCTGGAGGG CTACTTCAAG TGCGTCGATG CCAAGGACAT CAGCAAGGAA CTGTTCAAGT TCAAGACTCC GTCCGGCATC ATCGGCAACG TGTTCACCTA TGAGCACAAG GGCAAGCAGT ACATGGGCGT GTTCTCGGGC ATCGGCGGCT GGGCCGGCAT CGGCATGGCA GCGGGCCTCG AGAAGGACCA GGACGGCCTG GGTGCTGTGG GCGGCTACAA GGAGCTGAAC CAGTACACGG AACTCGGCGG CTCGCTGACG GTCTTTGCAC TGCCGAACTG A
|
Protein sequence | MKVSKHSGWR LMRPLGLALL AIPAVVQANA DVEKNIANSK NWAMQAGDMF NQRYSKLDQI NKGNVGKMQV AWTFSTGVLR GHEGSPLVID GTMYLHSPFP NKVFAIDLNT QKILWKYEPK QDPAVIPQMC CDTVNRGLAY AEGKVILQQA DSNLVALDAK SGKVVWSVKN GDPKLGAVNT NAPHVFKDKV ITGISGGEWG VRGFIAAYNL KDGKPAWKGY SVGPDAEMLI DPAKTTTWID GKVAPVGADS SLKTWKGDQW KIGGGTTWGW YSYDKALNAM YYGTGNPSTW NPSQRPGDNK WSMSIWSRDV DTGKVNWVYQ MTPFDEWDFD GINEMILADI NVKGKPTKAL VHFDRNGFAY TMDRTNGALL VAEKYDPKVN WATHVDMKTG RPQVVKQYST AQNGPDVNTK GICPAALGSK DQQPASFDPN TKLFYVPTNH VCMDYEPFKV EYTAGQPYVG ATLSMFPAPG SHGGMGNYIT WDAGTGKIVQ SKAEKFSVWS GSLNTAGGLS CYGTLEGYFK CVDAKDISKE LFKFKTPSGI IGNVFTYEHK GKQYMGVFSG IGGWAGIGMA AGLEKDQDGL GAVGGYKELN QYTELGGSLT VFALPN
|
| |