Gene Mpe_A0337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0337 
Symbol 
ID4786887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp367484 
End bp370351 
Gene Length2868 bp 
Protein Length955 aa 
Translation table11 
GC content71% 
IMG OID640088892 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_001019534 
Protein GI124265530 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAC CGCTGACTTC GAAGCTGCTC GCGCCACAGG CCGCGATGGT GGAGTTCTCG 
CTCGACGGCC GCACGGTCAG CGCCGGCGCT GACGAGACGA TCTGGACCGT CGCGAAGCGC
GAAGGCACGA CCATCCCCCA CCTGTGCCAC AAGGAGGGCC TGACGCCAGC CGGCAACTGC
CGCGCCTGCG TGGTCGAGGT GGAGGGCGAA CGGGCACTGG CCGCCTCGTG CTGCCGCAAC
GTCGCCGCCG GCATGAAGGT GCAGACGCAG AGCCCGCGCG CCGCGTCGGC CCGCAAGATG
GTGGTCGAGC TGCTGGTCAG CGACGCCGGT GCCACCACCG ACACCTACAC GAAGGCCTCG
GAACTCAGCC AATGGGCCGA ACAGCTGGCC GTGCCGCGCG GGCGCCTGCC GCAGCGCGAC
GCGCCGCATT ACGCGGCGGC CGACCCTTCG CACCCGGCGA TCGCGGTGAA CCTGGACGCC
TGCATCCAGT GCACGCGCTG CCTGCGCGCC TGCCGCGACG AGCAGGGCAA CGACGTGATC
GGCCTGGCCT TCCGCGGCGC GCATGCGCAG ATCACCTTCG ATGCCGGTGC GACGCTCGGC
GAGTCGAGCT GCGTGGCCTG CGGCGAATGC GTGCAGGCCT GCCCGACCGG CGCACTGATG
CCGGCCCGCG GCGCCGGGCT GCTGGAAGTG ACGAAGCAGG TGGATTCGGT CTGCCCCTAC
TGCGGCGTAG GCTGCCAGCT CACCTGGAAC GTCGGTCCGA ACGCACAGGG CGAGGAGCGC
ATCCATTTCG TGACCGGCCG CGACGGCCCG GCCAACCACG GGCGGCTGTG CGTGAAGGGT
CGCTATGGCT TCGACTACAT CCACAACCCG CGCCGCCTGA CCACGCCGCT GATCCGACGC
GAAGGCGTGG CCAAGGACCC GGCCGACATC GAGCGCCTCA AGCAGGGCCA GCTGAAGCCG
ACGGACATCT TCCGCACGGC GACCTGGGAC GAGGCGATGG AGCTCGCGGC CGGCGGCCTG
GCCCGGCTGC GCGACGAGGC GCTGGCGGCC GGCCTGCGCG GCAACGACAT CCCGCTCGCG
GGCTTCGGCT CGGCCAAGGG CAGCAACGAG GAGGCCTACC TGTTCCAGAA GCTGGTGCGG
CAGGGCTTCC GGACCAACAA CGTCGATCAC TGCACGCGGC TGTGCCATGC CAGCTCGGTG
GCGGCGCTGC TGGAGGGCAT CGGCTCGGGG GCGGTCAGCA ACCCGGTGGA GGATGTCGCC
CACGCCGACC TGATCTTCCT GATCGGCGCG AACCCGGCGG TGAACCACCC GGTCGCGGCG
AGCTGGATCA AGAACGCGGT CGACCGCGGC GCCCGGCTGG TGATCTGCGA TCCGCGTCAC
ACCGCGCTGA CGCGTCGCGC CACCTGGCAC CTGCAGTTCC GTCCCGACAC CGACGTCGCG
CTGCTCAACG GGCTGCTGCA CGTGATCGTC GCCGAGGGAC TGGTCGATGA GGCCTTCGTC
GCGGCGCGCG TCAACGGCTA CGAGGCACTG AAGGCCTCGG TGGCCGAGGC GACGCCGGAG
CGCATGAGCG AGATCTGCGG CATCGACGCG CAGACGATCC GCGACGTGGC CCGCGCCTAT
GCCACCAGCA AGGGCTCGAT GATTCTCTGG GGCATGGGCG TGAGCCAGCA TGTGCACGGC
ACCGACAACG CGCGCGGGCT GATCGCGTTG GCGATGCTGA CTGGCCAGAT CGGACGTGTC
GGCACCGGCC TGCATCCGCT GCGCGGCCAG AACAACGTGC AGGGCGCCAG CGACGCCGGG
CTGATCCCGA TGATGCTGCC CAACTACCAG CGCGTCATCA ACCCGACGGT GCGCCAGGCC
TTCGAGCGCC TGTGGGCCAC GCCCGAGCCA CTGGATGCGA CGCCCGGCCT GACCGTCGTC
GAGATCATGC ATGCGGCCAG CGAAGGCCGC ATCCGCGGCA TCTACGTCGA GGGCGAGAAC
CCGGCGATGT CGGACCCCGA CCTCAGCCAT GCCCGCCGGG CACTGGCAGG CCTGGAGCAT
CTGGTCGTGC AGGACATCTT CCTCACCGAG ACCGCGATGC TGGCCGACGT GGTGCTGCCG
GCCTCGGCCC ATGCCGAGAA GTGGGGCAGC TACACGAACA CCGACCGGCT GATCCAGATC
GGCCGCCCCG CGCTCGATCC GCCCGAGCTC GCGATGCAGG ATCTGTGGAT CATCGAGCGC
GTCGGCCGGC GTCTGGGCCT GGCCTGGAAC TACTGGCGCG ACGAAGACGG CGGCGGCAAG
CGCGCCTCGC AGGCGGCGGT GGCGCGCGTC TACGAGGAGA TGCGCGTCAG CATGCCGCCG
CTGGCCGGCG TCCCCTGGAG CCGCCTGGTC AAGGCCGACG CGGTGATGAC GCCCGCGGCG
AGCGAGGACG ACCCCGGCGC TGCGGTGGTC TTCATCGATC GCTTCCCGAC GGCCGACGGC
CGAGCGACCG TGGTACCGAC CGTGTTCCGC CCCGGCGCCG AGCAGATCGA CGCCGAGTAC
CCCTTCGTCC TGACCACCGG CCGCGTGCTC GAGCATTGGC ACACCGGCGC GATGACACGG
CACGCCAGCA TGCTGGACGC CATCGCGCCC GAGGCGCTGG TGTCGCTGCA TCCGGCGGAT
GCGCTGACGG TCGGCGTGCG CGACGGCCAG GCGGTGCTGA TGTCGACGCG GCACGGTGCG
GTGCAGGCGC GCGTGCGCGT CAGCACCGAG GTGCAGCCCG GCCAGGTGTT TCTGCCGTTC
GCCTTCTGGG AGGCGGCGGC GAACAAGCTG ACCGGCGACG CCCTGGACGA CGTGGCGAAG
ATCCCTGGCT TCAAGGTCAC GGCCGCCAAG CTCAGCGTGA TCGCCTGA
 
Protein sequence
MNAPLTSKLL APQAAMVEFS LDGRTVSAGA DETIWTVAKR EGTTIPHLCH KEGLTPAGNC 
RACVVEVEGE RALAASCCRN VAAGMKVQTQ SPRAASARKM VVELLVSDAG ATTDTYTKAS
ELSQWAEQLA VPRGRLPQRD APHYAAADPS HPAIAVNLDA CIQCTRCLRA CRDEQGNDVI
GLAFRGAHAQ ITFDAGATLG ESSCVACGEC VQACPTGALM PARGAGLLEV TKQVDSVCPY
CGVGCQLTWN VGPNAQGEER IHFVTGRDGP ANHGRLCVKG RYGFDYIHNP RRLTTPLIRR
EGVAKDPADI ERLKQGQLKP TDIFRTATWD EAMELAAGGL ARLRDEALAA GLRGNDIPLA
GFGSAKGSNE EAYLFQKLVR QGFRTNNVDH CTRLCHASSV AALLEGIGSG AVSNPVEDVA
HADLIFLIGA NPAVNHPVAA SWIKNAVDRG ARLVICDPRH TALTRRATWH LQFRPDTDVA
LLNGLLHVIV AEGLVDEAFV AARVNGYEAL KASVAEATPE RMSEICGIDA QTIRDVARAY
ATSKGSMILW GMGVSQHVHG TDNARGLIAL AMLTGQIGRV GTGLHPLRGQ NNVQGASDAG
LIPMMLPNYQ RVINPTVRQA FERLWATPEP LDATPGLTVV EIMHAASEGR IRGIYVEGEN
PAMSDPDLSH ARRALAGLEH LVVQDIFLTE TAMLADVVLP ASAHAEKWGS YTNTDRLIQI
GRPALDPPEL AMQDLWIIER VGRRLGLAWN YWRDEDGGGK RASQAAVARV YEEMRVSMPP
LAGVPWSRLV KADAVMTPAA SEDDPGAAVV FIDRFPTADG RATVVPTVFR PGAEQIDAEY
PFVLTTGRVL EHWHTGAMTR HASMLDAIAP EALVSLHPAD ALTVGVRDGQ AVLMSTRHGA
VQARVRVSTE VQPGQVFLPF AFWEAAANKL TGDALDDVAK IPGFKVTAAK LSVIA