Gene Mchl_5043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_5043 
Symbol 
ID7113646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp5391054 
End bp5394023 
Gene Length2970 bp 
Protein Length989 aa 
Translation table11 
GC content68% 
IMG OID643527737 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_002423736 
Protein GI218532920 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.331865 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.361747 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG GCCCCGAACC GCACGGCAAC AAGATCGAAC AGCCCGAGAT CCGCGCCGAC 
GAACGTCAGG ATGCCGGCGG GCCGGCAAAT GGCGCGCCAT CGACCTCCGG CGGCGCCTAC
TCGCAGGGCG CCAAGTCGGG TGGCCAGGCC GCGCCGGACC CTTCCGGCAC CTACGGCATC
AAGGACGCCC CGGTCGCGCC TGCGACCATC GCCTTCGAGT TCGACGGCCA ACAGGTCGAG
GCCCAGCCCG GCGAGACGAT CTGGGCGGTC GCCAAGCGCC TCGGCACGCA TATTCCGCAT
CTCTGCCACA AGCCCGATCC CGGCTACCGG CCGGACGGCA ATTGCCGCGC CTGCATGGTC
GAGATCGAGG GCGAGCGCGT GCTCGCCGCC TCGTGCAAGC GCACGCCCGC CATCGGGATG
AAGGTGAAGT CGGCCACCGA GCGCGCCACC AAGGCCCGCG CCATGGTGCT CGAACTGCTC
GTGGCCGATC AGCCCGAGCG TGCGACCTCG CATGACCCGT CGTCGCATTT CTGGGTGCAG
GCCGACGTTC TCGACGTGAC CGAGAGCCGC TTCCCGGCGG CCGAGCGCTG GACCAGCGAC
GTCAGCCATC CGGCGATGAG CGTCAACCTC GACGCCTGCA TCCAGTGCAA TCTCTGTGTG
CGCGCCTGCC GCGAGGTTCA GGTCAACGAC GTGATCGGCA TGGCCTACCG CGCCGCGGGC
TCCAAGGTCG TGTTCGACTT CGACGATCCG ATGGGTGGCT CCACCTGCGT CGCCTGCGGC
GAGTGCGTCC AGGCCTGCCC GACCGGCGCG CTGATGCCGG CCGCCTATCT CGACGCCAAC
CAGACCCGGA CGGTCTATCC CGACCGCGAG GTGAAGTCGC TCTGCCCCTA TTGCGGCGTC
GGCTGCCAAG TCTCCTACAA GGTCAAGGAC GAGCGCATCG TCTACGCCGA GGGCGTGAAC
GGACCGGCCA ACCAGAACCG GCTCTGCGTG AAGGGCCGCT TCGGCTTCGA CTACGTCCAC
CACCCCCACC GCCTGACGGT GCCGCTGATC CGCCTGGAGA ACGTGCCCAA GGACGCCAAC
GATCAGGTCG ATCCGGCGAA CCCCTGGACG CATTTCCGCG AGGCGACTTG GGACGAGGCG
CTCGACCGCG CGGCGGGCGG CCTGAAGACG ATCCGTGACA CCAACGGGCG CAAGGCGCTG
GCGGGCTTCG GCTCGGCCAA GGGTTCGAAC GAGGAGGCGT ACCTCTTCCA GAAGCTCGTC
CGCCTCGGCT TCGGCACCAA CAACGTCGAT CACTGCACGC GCCTGTGCCA CGCCTCGTCG
GTGGCCGCGC TGATGGAAGG CCTGAATTCC GGCGCCGTCA CCGCTCCCTT CTCGGCAGCG
CTCGACGCCG AAGTCATCGT CGTCATCGGC GCCAACCCGA CCGTGAACCA TCCGGTCGCG
GCGACCTTCC TCAAGAACGC GGTGAAGCAG CGCGGCGCCA AGCTGATCAT CATGGACCCG
CGGCGCCAGA CGCTCTCGCG CCACGCCTAT CGGCACCTCG CCTTCCGCCC CGGCTCGGAC
GTGGCGATGC TCAACGCGAT GCTCAACGTG ATCGTCACGG AGGGCCTCTA CGACGAGCAG
TACATCGCCG GCTACACCGA GAACTTCGAG GCCCTGCGCG AGAAGATCGT CGACTTCACG
CCGGAGAAGA TGGCTTCGGT CTGCGGCATC GACGCCGAGA CCCTGCGCGA GGTCGCCCGG
CTCTACGCCC GGGCCAAGTC GTCGCTCATC TTCTGGGGCA TGGGCGTCAG CCAGCACGTC
CACGGCACCG ACAACTCTCG CTGCCTGATC GCGCTCGCCC TCATCACCGG CCAGATCGGC
CGGCCCGGCA CCGGCCTGCA CCCGTTGCGC GGTCAGAACA ACGTCCAGGG CGCGTCCGAT
GCCGGCCTGA TCCCGATGGT CTACCCGGAC TATCAGTCGG TCGAGAAGGA CGCGGTGCGC
GAGCTGTTCG AGGAGTTCTG GGGCCAGTCC CTCGATCCGC AGAAGGGCCT CACCGTGGTC
GAGATCATGC GCGCGATCCA CGCGGGCGAG ATCCGGGGCA TGTTCGTCGA GGGCGAGAAC
CCGGCGATGT CCGACCCCGA CCTCAACCAC GCCCGCCACG CGCTGGCGAT GCTCGACCAC
CTCGTGGTAC AGGACCTGTT CCTGACTGAG ACCGCCTTCC ACGCCGACGT GGTGCTGCCG
GCCTCGGCCT TTGCCGAGAA GGCCGGCACC TTCACCAACA CCGACCGGCG CGTGCAGATC
GCCCAGCCCG TCGTCGCCCC TCCGGGCGAT GCGCGCCAGG ATTGGTGGAT CATCCAGGAA
CTGGCCCGAC GCCTCGACCT CGACTGGAAC TACGGCGGCC CGGCCGACAT CTTCGCCGAA
ATGGCGCAGG TGATGCCGTC CTTGAACAAC ATCACCTGGG AGCGTCTGGA GCGCGAGGGG
GCGGTGACCT ATCCGGTCGA TGCCCCGGAC CAGCCTGGCA ACGAGATCAT CTTCTATGCC
GGCTTCCCGA CCGAGAGCGG TCGCGCCAAG ATCGTGCCCG CGGCGATCGT GCCGCCGGAC
GAGGTGCCGG ACGACGAGTT CCCGATGGTG CTCTCGACCG GCCGCGTGCT CGAACACTGG
CACACCGGCT CGATGACCCG GCGCGCGGGC GTGCTCGACG CGCTGGAGCC GGAGGCGGTG
GCGTTCATGG CACCCAAGGA GCTCTACCGG CTCGGTCTCC GGCCCGGCGG GTCGATGCGG
TTGGAAACAC GGCGCGGCGC CGTCGTGTTG AAGGTCCGCT CCGACCGGGA CGTGCCGATC
GGCATGATCT TCATGCCCTT CTGCTACGCG GAAGCCGCTG CCAACCTTCT GACCAACCCC
GCCCTCGACC CCCTCGGAAA GATTCCCGAG TTCAAATTCT GCGCAGCCCG CGTCGTCCCC
GCGGAGGCTG CGCCGATGGC CGCCGAGTAA
 
Protein sequence
MSNGPEPHGN KIEQPEIRAD ERQDAGGPAN GAPSTSGGAY SQGAKSGGQA APDPSGTYGI 
KDAPVAPATI AFEFDGQQVE AQPGETIWAV AKRLGTHIPH LCHKPDPGYR PDGNCRACMV
EIEGERVLAA SCKRTPAIGM KVKSATERAT KARAMVLELL VADQPERATS HDPSSHFWVQ
ADVLDVTESR FPAAERWTSD VSHPAMSVNL DACIQCNLCV RACREVQVND VIGMAYRAAG
SKVVFDFDDP MGGSTCVACG ECVQACPTGA LMPAAYLDAN QTRTVYPDRE VKSLCPYCGV
GCQVSYKVKD ERIVYAEGVN GPANQNRLCV KGRFGFDYVH HPHRLTVPLI RLENVPKDAN
DQVDPANPWT HFREATWDEA LDRAAGGLKT IRDTNGRKAL AGFGSAKGSN EEAYLFQKLV
RLGFGTNNVD HCTRLCHASS VAALMEGLNS GAVTAPFSAA LDAEVIVVIG ANPTVNHPVA
ATFLKNAVKQ RGAKLIIMDP RRQTLSRHAY RHLAFRPGSD VAMLNAMLNV IVTEGLYDEQ
YIAGYTENFE ALREKIVDFT PEKMASVCGI DAETLREVAR LYARAKSSLI FWGMGVSQHV
HGTDNSRCLI ALALITGQIG RPGTGLHPLR GQNNVQGASD AGLIPMVYPD YQSVEKDAVR
ELFEEFWGQS LDPQKGLTVV EIMRAIHAGE IRGMFVEGEN PAMSDPDLNH ARHALAMLDH
LVVQDLFLTE TAFHADVVLP ASAFAEKAGT FTNTDRRVQI AQPVVAPPGD ARQDWWIIQE
LARRLDLDWN YGGPADIFAE MAQVMPSLNN ITWERLEREG AVTYPVDAPD QPGNEIIFYA
GFPTESGRAK IVPAAIVPPD EVPDDEFPMV LSTGRVLEHW HTGSMTRRAG VLDALEPEAV
AFMAPKELYR LGLRPGGSMR LETRRGAVVL KVRSDRDVPI GMIFMPFCYA EAAANLLTNP
ALDPLGKIPE FKFCAARVVP AEAAPMAAE