Gene Mpe_A0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0473 
Symbol 
ID4784192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp512867 
End bp514609 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content66% 
IMG OID640089031 
Productquinoprotein ethanol dehydrogenase, putative 
Protein accessionYP_001019670 
Protein GI124265666 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4993] Glucose dehydrogenase 
TIGRFAM ID[TIGR03075] PQQ-dependent dehydrogenase, methanol/ethanol family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0101481 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTGC ACCTGCTTTC TTTGTTGGTG GCTGCCGCCT GCGCCAGTGT TGGCGCCCAG 
GCCGCCGGCG TCACCGACGC GATGATCGAC AACGATGCCA AGACCCCGAA CGACGTTCTG
AGCTGGGGCA TCGGCTCGCA GGGGCAGCGC TATTCCCCGC TCAAGCAGGT CAACACCTCG
ACCGTGCAGA AGCTGGTGCC GGCCTGGGCC TTCTCGTTCG GCGGCGAAAA GCAGCGCGGG
CAGCAATCGC AGCCGCTGAT CCATGACGGC AAGATGTTCG TCACGGCGTC TTACTCGCGG
ATCTACGCGC TGGACGTCAA GACCGGCACC AAGCTCTGGA AGTATGAGCA CCGCCTGCCC
GAAGGCATCA TGCCCTGCTG CGATGTGATC AACCGCGGTG CCGCGCTGTA CGACAACCTG
GTCATCTTCG GCACGCTCGA CGCCCAGCTC GTCGCGCTCG ACCAGAAGAC CGGCGACGTG
GTCTGGAAGG AGAAGATCGA CGACTACGCG GCCGGCTACA GCTACACCGC CGCCCCGCTG
ATCGCCGGTG GCCTGCTGCT GACCGGTGTG TCCGGCGGCG AATTCGGCAT CGTCGGACGC
GTCGAGGCGC GCGATCCCAA GACCGGCAAG ATGGTCTGGA TCCGCCCGAC CGTCGAAGGT
CACATGGGCT ACAAGTTCGA CAAGGACGGC AACAAGACCG AAATCGGCAT CTCGGGCACG
ACGGGCAAGA CCTGGCCGGG CGACATGTGG AAGTCGGGCG GCGCGGCCAC CTGGCTGGGC
GGCACCTACG ACGCCAAGAC CGGCCTGGCC TACTTCGGCA CCGGCAACCC CGGCCCGTGG
AACAGCCACC TGCGTCCCGG CGACAACCTG TTCTCGTGCG CGACGGTCGC GATCGACGTC
AAGACCGGCC AGATCAAGTG GCACTATCAG ACCACACCGC ACGACGGCTG GGACTTCGAC
GGCGTGAACG AGTTCGTCAC CTTCGACATG GACGGCAAGC GCGTCGGCGG CAAGGCCGAC
CGCAACGGCT TCTTCTACGT GATCGACGCC GCCAACGGCA AGCTCGAGAA CGCTTTCCCC
TTCGTCAAGA AGATCACCTG GGCCACCGGC ATCGATCTGA AGACCGGCCG TCCCAACTAC
GTGCCGGAGA ACCGTCCGGG CGACCCGACC GCGGGTGCCG ACGGCAAGAA GGGCAACTCG
GTGTTCGCGG CGCCGTCCTT CCTTGGCGGC AAGAACCAGA TGCCGATGGC TTACTCGCCG
GACACCAAGC TGTTCTACGT GCCGTCGAAC GAGTGGGGCA TGGAGATCTG GAACGAGCCC
GTGACCTACA AGAAGGGCGC GGCCTACCTG GGCGCCGGCT TCACGATCAA GACGATCAAC
GACGAGTACA TCGGCGCGAT GCGTGCGATC GACCCGAAGA CCGGCAAGAC CGTCTGGGAA
GTGAAGAACA ACGCGCCGCT GTGGGGCGGT GTGCTGACGA CGGCCGGCAA CCTGGTGTTC
TGGGGCACGC CGGAAGGCTT CCTGAAGGCC GCCGACGCGA AGACCGGCAA GGTCGTCTGG
GAATTCCAGA CCGGTTCGGG CGTCGTGGCG CCGCCGGTCA CCTGGCAGCA GGACGGCGAG
CAGTACGTCT CGGTCGTTTC CGGCTGGGGC GGCGCCGTGC CGCTGTGGGG TGGCGACGTG
GCCAAGAAGG TCAACTTCCT GGAGCAGGGC GGCACCGTGT GGGTGTTCAA GCTGCCGAAG
TGA
 
Protein sequence
MRLHLLSLLV AAACASVGAQ AAGVTDAMID NDAKTPNDVL SWGIGSQGQR YSPLKQVNTS 
TVQKLVPAWA FSFGGEKQRG QQSQPLIHDG KMFVTASYSR IYALDVKTGT KLWKYEHRLP
EGIMPCCDVI NRGAALYDNL VIFGTLDAQL VALDQKTGDV VWKEKIDDYA AGYSYTAAPL
IAGGLLLTGV SGGEFGIVGR VEARDPKTGK MVWIRPTVEG HMGYKFDKDG NKTEIGISGT
TGKTWPGDMW KSGGAATWLG GTYDAKTGLA YFGTGNPGPW NSHLRPGDNL FSCATVAIDV
KTGQIKWHYQ TTPHDGWDFD GVNEFVTFDM DGKRVGGKAD RNGFFYVIDA ANGKLENAFP
FVKKITWATG IDLKTGRPNY VPENRPGDPT AGADGKKGNS VFAAPSFLGG KNQMPMAYSP
DTKLFYVPSN EWGMEIWNEP VTYKKGAAYL GAGFTIKTIN DEYIGAMRAI DPKTGKTVWE
VKNNAPLWGG VLTTAGNLVF WGTPEGFLKA ADAKTGKVVW EFQTGSGVVA PPVTWQQDGE
QYVSVVSGWG GAVPLWGGDV AKKVNFLEQG GTVWVFKLPK