Gene Mpe_A3393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3393 
Symbol 
ID4786380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3607851 
End bp3609671 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content63% 
IMG OID640091969 
Productputative methanol dehydrogenase protein, large subunit 
Protein accessionYP_001022581 
Protein GI124268577 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4993] Glucose dehydrogenase 
TIGRFAM ID[TIGR03075] PQQ-dependent dehydrogenase, methanol/ethanol family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.078697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0119159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTT CTAAGCATTC GGGTTGGCGC CTGATGCGCC CCCTCGGATT GGCGCTGCTC 
GCGATCCCGG CAGTCGTGCA AGCGAATGCG GACGTCGAGA AGAACATTGC CAATTCCAAG
AACTGGGCGA TGCAGGCTGG TGACATGTTC AACCAGCGCT ACAGCAAGCT CGACCAGATC
AACAAGGGCA ACGTCGGCAA GATGCAGGTC GCGTGGACCT TCTCCACCGG CGTGCTGCGC
GGCCACGAGG GCTCGCCGCT GGTCATCGAC GGCACGATGT ACCTGCACTC GCCGTTCCCC
AACAAGGTCT TCGCGATCGA CCTGAACACC CAGAAGATCC TCTGGAAGTA CGAGCCGAAG
CAGGATCCGG CGGTCATCCC GCAGATGTGC TGCGACACGG TGAACCGTGG CCTGGCGTAC
GCCGAAGGCA AGGTCATCCT GCAGCAGGCC GACTCCAACC TCGTGGCCCT GGACGCCAAG
TCGGGCAAGG TGGTGTGGAG CGTGAAGAAC GGCGATCCGA AGCTCGGCGC CGTGAACACC
AACGCCCCGC ACGTCTTCAA GGACAAGGTC ATCACCGGCA TCTCCGGTGG TGAGTGGGGT
GTGCGCGGCT TCATCGCTGC CTACAACCTG AAGGACGGCA AGCCGGCGTG GAAGGGCTAC
AGCGTGGGCC CCGACGCCGA GATGCTGATC GACCCGGCCA AGACCACCAC CTGGATCGAC
GGCAAGGTCG CTCCGGTGGG CGCCGACTCG TCGCTGAAGA CCTGGAAGGG CGATCAGTGG
AAGATCGGTG GCGGCACCAC CTGGGGCTGG TACAGCTACG ACAAGGCGCT GAACGCCATG
TACTACGGCA CCGGCAACCC GTCGACCTGG AACCCGAGCC AGCGCCCGGG CGACAACAAG
TGGTCGATGT CAATCTGGTC GCGTGACGTC GACACGGGCA AGGTCAACTG GGTCTACCAG
ATGACGCCGT TCGACGAGTG GGACTTCGAC GGCATCAACG AGATGATCCT CGCGGACATC
AACGTGAAGG GCAAGCCGAC CAAGGCGCTG GTGCACTTCG ACCGCAACGG CTTCGCGTAC
ACGATGGACC GCACCAACGG TGCGCTGCTC GTGGCCGAGA AGTACGACCC GAAGGTGAAC
TGGGCGACCC ACGTCGACAT GAAGACCGGC CGTCCCCAGG TCGTGAAGCA GTACTCGACG
GCGCAAAACG GCCCCGATGT CAACACCAAG GGCATCTGCC CGGCGGCGCT GGGCTCGAAG
GACCAGCAGC CGGCCTCGTT CGACCCGAAC ACCAAGCTCT TCTACGTGCC GACCAACCAC
GTCTGCATGG ACTACGAGCC GTTCAAGGTC GAGTACACCG CGGGCCAGCC GTACGTGGGC
GCGACGCTGT CGATGTTCCC GGCTCCGGGC AGCCATGGTG GCATGGGCAA CTACATCACC
TGGGATGCCG GTACCGGCAA GATCGTGCAG AGCAAGGCCG AGAAGTTCTC GGTGTGGAGC
GGTTCGCTCA ACACCGCGGG CGGCCTGAGC TGCTACGGCA CGCTGGAGGG CTACTTCAAG
TGCGTCGATG CCAAGGACAT CAGCAAGGAA CTGTTCAAGT TCAAGACTCC GTCCGGCATC
ATCGGCAACG TGTTCACCTA TGAGCACAAG GGCAAGCAGT ACATGGGCGT GTTCTCGGGC
ATCGGCGGCT GGGCCGGCAT CGGCATGGCA GCGGGCCTCG AGAAGGACCA GGACGGCCTG
GGTGCTGTGG GCGGCTACAA GGAGCTGAAC CAGTACACGG AACTCGGCGG CTCGCTGACG
GTCTTTGCAC TGCCGAACTG A
 
Protein sequence
MKVSKHSGWR LMRPLGLALL AIPAVVQANA DVEKNIANSK NWAMQAGDMF NQRYSKLDQI 
NKGNVGKMQV AWTFSTGVLR GHEGSPLVID GTMYLHSPFP NKVFAIDLNT QKILWKYEPK
QDPAVIPQMC CDTVNRGLAY AEGKVILQQA DSNLVALDAK SGKVVWSVKN GDPKLGAVNT
NAPHVFKDKV ITGISGGEWG VRGFIAAYNL KDGKPAWKGY SVGPDAEMLI DPAKTTTWID
GKVAPVGADS SLKTWKGDQW KIGGGTTWGW YSYDKALNAM YYGTGNPSTW NPSQRPGDNK
WSMSIWSRDV DTGKVNWVYQ MTPFDEWDFD GINEMILADI NVKGKPTKAL VHFDRNGFAY
TMDRTNGALL VAEKYDPKVN WATHVDMKTG RPQVVKQYST AQNGPDVNTK GICPAALGSK
DQQPASFDPN TKLFYVPTNH VCMDYEPFKV EYTAGQPYVG ATLSMFPAPG SHGGMGNYIT
WDAGTGKIVQ SKAEKFSVWS GSLNTAGGLS CYGTLEGYFK CVDAKDISKE LFKFKTPSGI
IGNVFTYEHK GKQYMGVFSG IGGWAGIGMA AGLEKDQDGL GAVGGYKELN QYTELGGSLT
VFALPN