Gene Mpe_A0476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0476 
Symbol 
ID4784195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp515914 
End bp517770 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content67% 
IMG OID640089034 
Productquinoprotein alcohol dehydrogenase 
Protein accessionYP_001019673 
Protein GI124265669 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4993] Glucose dehydrogenase 
TIGRFAM ID[TIGR03075] PQQ-dependent dehydrogenase, methanol/ethanol family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0260666 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC ATCCCGTCCG TCACGCCCTG TCCGTCGCAG CGGCGCTGGC CGTCCTGGGA 
CTGTCCCAGG GTGCGCACGC CGTCAAGAAC GTCACCTGGG AAGACATCTC GAACGACGAC
CGCACGAGCA CCGACGTCCT GAGCTACGGT CTGGGCCTGA AGGCCCAGCG CTACAGCCCG
CTGAAGCAGA TATCGACCGC CAACGTGCAG AAGCTCGTCC CGGCATGGAG CCACTCCTTC
GGCGGCGAGA AGCAGCGCGG CCAGGAAGGC CAGGTGCTGG TGCATGACGG CGTGATCTAC
GCGACCAGTT CCTACTCTCG CTTCACGGCG CTCGACGCGA AGACCGGCCG GCAGCTGTGG
ACCTACGAAC ACCGCCTGCC CGACGATATC CGTCCGTGCT GCGACGTCGT CAACCGCGGC
CCCGCCATCT ACGGCGACAA GGTCTACTTC GGTACGCTCG ACGCACGCGT CGTGGCGCTG
GACCGCGCCA CCGGCAAGGT GGTGTGGAAC GAGAAGTTCG GTGACCACAA GGTGGGCTAC
ACGATGACCG GCGCGCCCTT CATCGTGAAG GACAAGAAGT CCGGTCGCGT GCTGCTGATC
CACGGCTCGT CGGGCGACGA GTTCGGCGTC GTCGGATGGC TGTTCGCACG CGATCCCGAC
ACCGGCGCCG AGGTCTGGGC TCGCCCGATG GTCGAGGGCC ACATGGGCCG CCTGAACGGC
AAGGACAGCA CGGTGACCGG CGATGCGAAG GCCCCCTCGT GGCCGCGTGA CAAAGACGGC
AAGCTGGTCG AGGCGTGGCA CCAGGGCGGC GGCGCGCCGT GGCAGACCGC GTCGTTCGAT
GCCGAGAACA ACACCATCGT GATCGGCACC GGCAACCCGG CGCCGTGGAA CACCTGGAAG
CGCACGAAGG AAGGCGACGA CCCGCGCAAC TGGGACAGCC TGTTCACCTC GGGTCAGGCC
TACGTCGATG CGTCGACCGG CGAACTCAAG GGCTTCTTCC AGCACACGCC GAACGATGCC
TGGGACTTCT CGGGCAACAA CTCGATCGTG CTGTTCGAGT ACAAGGACCC GAAGTCCGGC
AAGCTGGTGA AGGCCGGCGC GCATGCCGAC CGCAACGGCT TTTTCTTCGT GACCGACCGC
GAGAAGCTCG CGACCGGCGC CGGCTATCCG AACAAGCCGA CCGCGCTGCT CGGTGCGTGG
CCGTTCGTCG ACGGCATCAC CTGGGCCAAG GGTTTCGACC TGAAGACCGG CAAGCCGATC
GAGAACAACA ACCGTCCGCC GGCCCCCAAG CCCGGCGCCG ACAAGGGCGA GTCGATCTTC
GTGTCGCCGC CGTTCCTGGG CGGCACCAAC TGGATGCCGA TGAGCTACAG CCCGGACACC
GGCCTGTTCT ACATCCCGGC GAACCACTGG GCGATGGACT ACTGGACCGA GCACCTGACC
TACAAGGCCG GCTCGGCCTA CCTCGGCCAG GGCTTCCGCA TCAAGCGGCT GTACGAGGAC
CACGTCGGCA CGCTGCGGGC AATCGACCCG GTGACCGGCA AGATCGCGTG GGAACACAAG
GAGAAGCTGC CGCTGTGGGC CGGCACGATG ACGACGGCCG GCGGCCTGCT GTTCACCGGC
ACCTCCGACG GCTACGTGAA GGCCTTCGAC AGCAAGACCG GCAAGGAACT GTGGAAGTTC
CAGACCGGCT CGGGCGTGGT CTCGGTCCCG GTGACCTGGG AGCAGGACGG CGAGCAGTAC
GTCGGCATCC AGTCGGGCTA CGGCGGCGCC GTGCCCCTGT GGGGCGGTGA CATGGCCGAG
ATGACCAAGA AGGTCACGCA GGGCGGCTCG ATGTGGGTCT TCAAGCTGCC CAAGTAG
 
Protein sequence
MSKHPVRHAL SVAAALAVLG LSQGAHAVKN VTWEDISNDD RTSTDVLSYG LGLKAQRYSP 
LKQISTANVQ KLVPAWSHSF GGEKQRGQEG QVLVHDGVIY ATSSYSRFTA LDAKTGRQLW
TYEHRLPDDI RPCCDVVNRG PAIYGDKVYF GTLDARVVAL DRATGKVVWN EKFGDHKVGY
TMTGAPFIVK DKKSGRVLLI HGSSGDEFGV VGWLFARDPD TGAEVWARPM VEGHMGRLNG
KDSTVTGDAK APSWPRDKDG KLVEAWHQGG GAPWQTASFD AENNTIVIGT GNPAPWNTWK
RTKEGDDPRN WDSLFTSGQA YVDASTGELK GFFQHTPNDA WDFSGNNSIV LFEYKDPKSG
KLVKAGAHAD RNGFFFVTDR EKLATGAGYP NKPTALLGAW PFVDGITWAK GFDLKTGKPI
ENNNRPPAPK PGADKGESIF VSPPFLGGTN WMPMSYSPDT GLFYIPANHW AMDYWTEHLT
YKAGSAYLGQ GFRIKRLYED HVGTLRAIDP VTGKIAWEHK EKLPLWAGTM TTAGGLLFTG
TSDGYVKAFD SKTGKELWKF QTGSGVVSVP VTWEQDGEQY VGIQSGYGGA VPLWGGDMAE
MTKKVTQGGS MWVFKLPK