Gene Mpe_A1170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1170 
Symbol 
ID4785569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1254608 
End bp1257574 
Gene Length2967 bp 
Protein Length988 aa 
Translation table11 
GC content67% 
IMG OID640089733 
Productformate dehydrogenase large subunit precursor 
Protein accessionYP_001020366 
Protein GI124266362 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing
[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.970148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCTCA CGCGCAAGTC GCCTGATGCG GGCTCCGGCT CGCCTGCATC GTCGCTGGTC 
TCGAGCCTCT CGCGCGGCCT CTCGCGCGCC ATCCCGACGA TGGACCGTCG GTCCTTCCTG
CGCCGCTCCG GCCTGGGCGT GGGGGCCGGT CTCGCCGCCG GGCAGCTGAC GTTGGTGCGC
AAGGCGCAGG CCTCGGATGC GTCCAAGGCC GAGGCGGGCC CGGGCAAGGT GGAGGTCAGG
CGCACGGTGT GCGGTCACTG CTCGGTGGGC TGCGCGGTCG ACGCGGTGGT GCACAACGGC
GTGTGGGTGC GCCAGGAGCC GGTGTTCGAC TCGCCGATCA ACCTGGGCGC GCACTGCGCC
AAGGGCGCTG CGCTGCGCGA GCACGGCCGC GGCGAATACC GCCTGCGCTA CCCGATGAAG
CTGGTGAACG GCAAGTACGA GCGCATCAGC TGGGACACCG CACTCGACGA GATCAGCGCG
CGCATGCTCG AGCTGCGCAA GCAGAGCGGC CCGGACTCGG TGTTCGTGGT CGGGTCGAGC
AAGCACAACA ACGAGCAGGC CTACCTGCTG CGCAAGTGGA TGAGCCTGTG GGGCAGCAAC
AACTGCGATC ACCAGGCACG CATCTGCCAC AGCACCACGG TCGCGGGCGT CGCGAACACC
TGGGGCTACG GCGCGATGAC CAACTCCTAC AACGACATGC AGAACGCCCG CATGGCGCTC
TACATCGGCT CCAACGCCGC CGAGGCGCAC CCGGTGTCGA TGCTGCACAT GCTGCATGCC
AAGGAGACCG GCTGCAAGAT GATCGTCGTG GACCCGCGCT TCACGCGCAC CGCGGCGCGT
GCCGACGAAC ACGTGCGCAT CCGTTCGGGC TCGGACATCC CCTTCCTGTT CGGCGTGCTG
CACCACATCT TCAAGAACGG CTGGGAAGAC AAGCAGTACA TCAACGACCG CGTCTACGGC
ATGGACAAGG TGCGCGAGGA CGTGCTCGCG AACTGGACGC CGGACAAGGT GCAGGAGGCC
TGTGGCGTCG ACGAGGCCAC CGTGTTCCGC GTGGCCAAGA CCATGGCCGA CAACCGCCCC
GGCACCATCG TCTGGTGCAT GGGCCAGACC CAGCACACCA TCGGCAACGC GATGGTGCGC
GCCTCGTGCA TTCTGCAGCT CGCGCTCGGC AACATCGGCA AGAGCGGCGG CGGCGCCAAC
ATCTTCCGTG GCCACGACAA CGTGCAGGGC GCCACGGACG TGGGCCCCAA CCCCGATTCG
CTGCCTGGCT ACTACGGCCT GGCCGAGGGC TCGTTCAAGC ATTTCGCCAA GACCTGGGGC
GTGGACTTCG AGTGGATCAA GAAGCAGTAC GCGCCGGGCA TGATGACCAA ACCCGGCATG
ACGGTGTCGC GCTGGGTCGA CGGCGTGCTC GAAAAGAACG AGCTGATCGA CCAGGACAGC
AACCTGCGCG GCCTGTTCTT CTGGGGCCAT GCGCCGAACT CGCAGACGCG CGGCCTGGAA
ATGAAGAAGG CGATGGACAA GCTCGACCTG CTGGTGGTCG TCGATCCCTT CCCGTCGGCC
ACCGCGGCGA TGGCGGCGAT GCCCGGCAAG CCCGAGGACC AGAACCCCAA CCGCGCCGTC
TACCTGCTGC CGGCGACGAC GCAGTTCGAG ACCAGTGGCT CGTGCACCGC GTCGAACCGC
TCGATCCAGT GGCGCGAGCA GGTCATCGAG CCGCTGTGGG AGAGCCGCAA CGACCACATG
ATCATGTACC AGCTCGCGCA GAAGCTGGGC TTCGACAAGG AGCTGACGAA GAACTACAAG
CTGGTCGCCG GCAAGGGCGG CATGATGGAG CCCGAGCCCG AATCGATCCT GCGCGAGATC
AACAAGAGCG TCTGGACCAT CGGCTACACC GGCCAGAGCC CCGAGCGCCT GAAGGTGCAC
ATGCGCAACA TGCACGTCTT TGACGTGAAG ACGCTGCGCG CCAAGGGCGG CACCGACAAG
GAGACCGGGT ACTCCCTCGA CGGCGATTAC TTCGGCCTGC CTTGGCCGTG CTACGGCACG
CCGGAAATGA AGCACCCGGG CTCGCCCAAC CTCTACGACA CCTCCAAGCA CGTGATGGAC
GGCGGCGGCA ACTTCCGCGC CAACTTCGGT GTCGAGAAGG ACGGCGTCAA CCTGCTGGCC
GAGGATGGCT CGTTCTCCAA GGGTGCGGAC CTCACCACCG GCTATCCGGA GTTCGACCAC
GTGCTGCTGA AGAAGCTCGG CTGGTGGGAC GAACTGAGCG AGCCCGAGAA GGCCGCGGCC
GAGGGCAAGA ACTGGAAGAC CGATTCGTCG GGCGGGATCA TCCGCGTCGC GATGAAGAAC
CACGGCTGCC ATCCGTTCGG CAACGCGAAG GCGCGGGCCG TGGTGTGGAA CTTCCCCGAT
GCGATCCCGA AGCACCGCGA GCCGCTGTAC TCGACGCGTC CGGACCTGGT GGCCAAGTAC
CCGACGCACG ACGACAAGAA GGCGTTCTGG CGCCTGCCCA CCTTGTTCAA GACCGTGCAG
GACAAGAACA AGGACATCGG CAAGGACTTC CCGCTGATCA TGAGCAGCGG GCGGCTGGTG
GAGTACGAGG GCGGCGGCGA GGAGACCCGC TCCAACCCCT ACCTGGCCGA GCTGCAGCAG
GAGATGTTCG TCGAGATCAA CCCAGCCACC GCCAACGACC GCGGCATCCG CAACGGCGAG
ACGGTCTGGG TCCGCACGCC GACCGGCGCG CGCCTCACGG TCAAGGCGCT GGTCACCGAG
CGAGTGGACC GCGAGACGGT GTGGCTGCCC TTCCACTTCT CGGGCCGCTG GCAGGGCGTC
GATCTCGCGC CCTACTACCC GCAGGGGGCG ATGCCGGTCA TCCGCGGCGA GGCGATCAAC
ACCGCCACCA CCTACGGCTA CGACAGCGTC ACGATGATGC AGGAGACCAA GACCACGGTC
TGCCAGGTCG AGCGCGCCTC TGTGTGA
 
Protein sequence
MLLTRKSPDA GSGSPASSLV SSLSRGLSRA IPTMDRRSFL RRSGLGVGAG LAAGQLTLVR 
KAQASDASKA EAGPGKVEVR RTVCGHCSVG CAVDAVVHNG VWVRQEPVFD SPINLGAHCA
KGAALREHGR GEYRLRYPMK LVNGKYERIS WDTALDEISA RMLELRKQSG PDSVFVVGSS
KHNNEQAYLL RKWMSLWGSN NCDHQARICH STTVAGVANT WGYGAMTNSY NDMQNARMAL
YIGSNAAEAH PVSMLHMLHA KETGCKMIVV DPRFTRTAAR ADEHVRIRSG SDIPFLFGVL
HHIFKNGWED KQYINDRVYG MDKVREDVLA NWTPDKVQEA CGVDEATVFR VAKTMADNRP
GTIVWCMGQT QHTIGNAMVR ASCILQLALG NIGKSGGGAN IFRGHDNVQG ATDVGPNPDS
LPGYYGLAEG SFKHFAKTWG VDFEWIKKQY APGMMTKPGM TVSRWVDGVL EKNELIDQDS
NLRGLFFWGH APNSQTRGLE MKKAMDKLDL LVVVDPFPSA TAAMAAMPGK PEDQNPNRAV
YLLPATTQFE TSGSCTASNR SIQWREQVIE PLWESRNDHM IMYQLAQKLG FDKELTKNYK
LVAGKGGMME PEPESILREI NKSVWTIGYT GQSPERLKVH MRNMHVFDVK TLRAKGGTDK
ETGYSLDGDY FGLPWPCYGT PEMKHPGSPN LYDTSKHVMD GGGNFRANFG VEKDGVNLLA
EDGSFSKGAD LTTGYPEFDH VLLKKLGWWD ELSEPEKAAA EGKNWKTDSS GGIIRVAMKN
HGCHPFGNAK ARAVVWNFPD AIPKHREPLY STRPDLVAKY PTHDDKKAFW RLPTLFKTVQ
DKNKDIGKDF PLIMSSGRLV EYEGGGEETR SNPYLAELQQ EMFVEINPAT ANDRGIRNGE
TVWVRTPTGA RLTVKALVTE RVDRETVWLP FHFSGRWQGV DLAPYYPQGA MPVIRGEAIN
TATTYGYDSV TMMQETKTTV CQVERASV