Gene Mpe_A3710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A3710 
Symbol 
ID4786059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3921181 
End bp3924114 
Gene Length2934 bp 
Protein Length977 aa 
Translation table11 
GC content69% 
IMG OID640092293 
ProductNAD-dependent formate dehydrogenase alpha subunit 
Protein accessionYP_001022898 
Protein GI124268894 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.749683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.699855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGACC CCCACGCCGA GATCGACTAC GGCACCCCGG CCAGCGCGTC GACGCAGGCC 
GTCACGCTCG AGATCGACGG CCAGGCCGTC ACCGTGCCGG CCGGCACCTC GCTGATGCGC
GCGGCGATCG ACGCCGGCGT GCAGGTGCCC AAGCTGTGCG CGACCGACAG CCTGGAGCCC
TTCGGCTCGT GCCGCCTGTG CCTGGTCGAG ATCGAGGGCC GCCGCGGCTA CCCGGCCTCG
TGCACCACGC CGGCCGAGGC CGGCATGAAG GTGCGCACGC AGAGCCCGAA GCTGCAGGAG
CTGCGCAAGG GGGTGATGGA GCTCTACATC TCCGACCACC CGCTCGACTG CCTGACCTGC
GCCGCCAACG GCGACTGCGA GCTGCAGGAC ATGGCCGGCG TGACCGGCCT GCGCGAGGTC
CGCTACGGCG TCGGTGACCA GTACGGCGGC GCCCACCACC TGAAGAGCGC GAAGGACGAG
TCGAACCCGT ACTTCACCTA CGACCCGAGC AAGTGCATCG TCTGCAACCG CTGCGTGCGC
GCCTGCGAGG AGACCCAGGG CACCTTCGCG CTGACGATCA GTGGCCGCGG CTTCGAGTCG
CGCGTGTCGG CGGGCCAGGA CCAGCCCTTC ATGGAAAGCG AGTGCGTCAG CTGCGGCGCC
TGCGTGCAGG CCTGCCCGAC CGCCACGCTG CAGGAGAAGT CGGTCATCTG GCTGGGCCAG
GCCGAGCACA GCGTCACCAC CACCTGTGCC TACTGCGGCG TCGGCTGCGG CTTCAAGGCC
GAGATGAAGG GCAACGAGGT CGTGCGCATG GTGCCGTGGA AGAACGGCCA GGCCAACGAA
GGCCACTCCT GCGTCAAGGG CCGCTTCGCC TGGGGCTACG CGACCCACAA GGACCGCATT
ACCACGCCGA TGATCCGCAA GCGCATCACC GACCCGTGGC AGGAGGTGAG CTGGGACGAG
GCGATCGGCT ATGCCGCCAG CGAGTTCAAG CGCATCCAGG CGAAGTACGG GCGCGACGCG
ATCGGCGGCA TCGTCTCCTC GCGCTGCACC AACGAGGAAG GCTATCTGGT GCAGAAGCTG
GTACGCGCGG CCTTCGGCAA CAACAACGTC GACACCTGCG CGCGCGTGTG CCACTCACCG
ACCGGCTATG GCCTGAAGCA GACGCTGGGC GAATCGGCCG GCACGCAGAC CTTCAAGTCG
GTGGAGAAGT CGGACGTGAT CATGGTCATC GGTGCCAACC CGACCGACGG CCACCCGGTG
TTCGCCTCGC GCATGAAGAA GCGACTGCGC GAAGGCGCGA AGCTGATCGT CGTCGACCCG
CGCAGGATCG ACCTGGTGAA GTCGCCGCAC GTGAAGGCCG ACCACCATCT TCAGCTGCGT
CCGGGCACCA ACGTCGCGGT GATCACCGCG CTGGCGCACG TGATCGTCAC CGAGGGCTTG
CTCGACGAGG CTTACATCGC CGAACGCTGC GAGGACAAGG CCTTCCGCGA GTGGCGCGAG
TTCGTGTCGC GCGATGCCAA CTCCCCTGAG GCGACCGCCG TCGTCACCGG CGTGCCGGCC
ACCGAGCTGC GCGCCGCCGC CCGACTGTTC ACCACCGGCA GATCCGACGG CACGGCTGCC
ACGCTGCAGG CCCGCGGGGC GCGGCCCAAC GCCGCCATCT ACTACGGCCT GGGCGTGACC
GAGCACAGCC AGGGTTCGAC GATGGTGATG GGCATCGCCA ACCTCGCGAT GGCCACCGGC
AACGTCGGCC GCGAGGGGGT GGGCGTGAAC CCGCTGCGCG GCCAGAACAA CGTGCAGGGT
TCGTGCGACA TCGGCTCCTT CCCGCACGAG TTGCCGGGCT ACCGCCACGT GTCGGACAGC
AGCACGCGGG CGCTGTTCGA GAACGCCTGG AACGTCGAGC TGCAGCCCGA ACCCGGCCTG
CGCATCCCCA ACATGTTCGA CGCCGCGCTG TCGGGCAGCT TCATGGGCCT GTACTGCGAG
GGCGAGGACA TCGTGCAGTC CGACCCGGAC ACGCAGCACG TCGCGCATGC GTTGTCCTCC
ATGGAATGCA TCGTCGTGCA GGACCTGTTC CTGAACGAGA CCGCGAAGTA CGCGCACGTC
TTCCTGCCGG GCTCGTCCTT CCTGGAGAAG GACGGCACCT TCACCAACGC CGAGCGCCGC
ATCTCGCGCG TGCGCAAGGT GATGCCGCCC AAGGCAGGCC TCGCCGACTG GGAGGTGACG
GTGAAGCTGT CGAACGCGCT CGGCTACCCG ATGGACTACA CCCACCCCGA GCAGATCATG
GCCGAGATCG CCGCGCTGAC GCCCACCTTC TCCGGCGTCA GCTACGAGAA GCTCGACCGT
CTGGGCAGCA TCCAGTGGCC GTGCAACGAT GAGGCCCCCG AGGGCACGCC GACGATGCAC
ATCGACCGCT TCGTGCGCGG CAAGGGCAAG TTCTTCATCA CGCAGTACGT CGCCTCCGAC
GAGAAGGTCA CGCGCCGGTT CCCGCTGCTG CTGACGACAG GGCGCATCCT GTCGCAGTAC
AACGTCGGCG CGCAGACGCG CCGCACCGAG AACAACCAGT GGCACAGCGA GGACCGGCTG
GAGCTGCACC CGCACGACGC CGAGGAGCGC GGCATCCGGG ACGGCGACTG GGTCGGCATC
GAGAGCCGCT CGGGCCAGAC CGTGCTGCGC GCGCAGGTCA GCGACCGCAT GCAGGCCGGC
GTGGTCTACA CGACCTTCCA CTTCCCCGAG TCGGGCGCCA ACGTCATCAC CACCGACAGC
TCCGACTGGG CCACCAACTG CCCCGAGTAC AAGGTGACCG CGGTGCAGGT GCTGCCGGTG
ATGCAGCCTT CGGGCTGGCA GAAGGCCTAC AGCCGCTTCA CCGAGGTGCA GGAGCGCCTG
CTCGCCGAAC GGCACAAGGC CGAGCCGGCC CTCGCCGGCG CGAAGAAGCC ATGA
 
Protein sequence
MLDPHAEIDY GTPASASTQA VTLEIDGQAV TVPAGTSLMR AAIDAGVQVP KLCATDSLEP 
FGSCRLCLVE IEGRRGYPAS CTTPAEAGMK VRTQSPKLQE LRKGVMELYI SDHPLDCLTC
AANGDCELQD MAGVTGLREV RYGVGDQYGG AHHLKSAKDE SNPYFTYDPS KCIVCNRCVR
ACEETQGTFA LTISGRGFES RVSAGQDQPF MESECVSCGA CVQACPTATL QEKSVIWLGQ
AEHSVTTTCA YCGVGCGFKA EMKGNEVVRM VPWKNGQANE GHSCVKGRFA WGYATHKDRI
TTPMIRKRIT DPWQEVSWDE AIGYAASEFK RIQAKYGRDA IGGIVSSRCT NEEGYLVQKL
VRAAFGNNNV DTCARVCHSP TGYGLKQTLG ESAGTQTFKS VEKSDVIMVI GANPTDGHPV
FASRMKKRLR EGAKLIVVDP RRIDLVKSPH VKADHHLQLR PGTNVAVITA LAHVIVTEGL
LDEAYIAERC EDKAFREWRE FVSRDANSPE ATAVVTGVPA TELRAAARLF TTGRSDGTAA
TLQARGARPN AAIYYGLGVT EHSQGSTMVM GIANLAMATG NVGREGVGVN PLRGQNNVQG
SCDIGSFPHE LPGYRHVSDS STRALFENAW NVELQPEPGL RIPNMFDAAL SGSFMGLYCE
GEDIVQSDPD TQHVAHALSS MECIVVQDLF LNETAKYAHV FLPGSSFLEK DGTFTNAERR
ISRVRKVMPP KAGLADWEVT VKLSNALGYP MDYTHPEQIM AEIAALTPTF SGVSYEKLDR
LGSIQWPCND EAPEGTPTMH IDRFVRGKGK FFITQYVASD EKVTRRFPLL LTTGRILSQY
NVGAQTRRTE NNQWHSEDRL ELHPHDAEER GIRDGDWVGI ESRSGQTVLR AQVSDRMQAG
VVYTTFHFPE SGANVITTDS SDWATNCPEY KVTAVQVLPV MQPSGWQKAY SRFTEVQERL
LAERHKAEPA LAGAKKP