Gene Mext_4582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4582 
Symbol 
ID5835114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5117029 
End bp5119998 
Gene Length2970 bp 
Protein Length989 aa 
Translation table11 
GC content68% 
IMG OID641370376 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_001642021 
Protein GI163853978 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.656955 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG GCCCCGAACC GCACGGCAAC AAGATCGAAC AGCCCGAGAT CCGCGCCGAC 
GAACGTCAGG ATGCCGGCGG GCCGGCAAAT GGCGCACCAT CGACCTCCGG CGGCGCCTAC
TCTCAGGGCG CCAAGTCGGG TGGCCAGGCC GCGCCCGATC CTTCGGGCAC CTACGGCATC
AAGGACGCCC CGGTCGCGCC CGCGACCATC GCCTTCGAGT TCGATGGCCA ACAGGTCGAG
GCCCAGCCCG GCGAGACGAT TTGGGCGGTT GCCAAGCGCC TCGGCACGCA TATCCCGCAT
CTCTGCCACA AGCCCGATCC CGGCTACCGC CCGGACGGCA ATTGCCGCGC CTGCATGGTC
GAGATCGAGG GCGAGCGCGT GCTTGCCGCC TCGTGCAAGC GCACGCCCGC CATCGGCATG
AAGGTGAAGT CGGCCACCGA GCGCGCCACC AAGGCCCGCG CCATGGTGCT CGAACTGCTC
GTGGCCGATC AGCCCGAGCG CGCAACCTCG CATGACCCGT CGTCGCATTT CTGGGTGCAG
GCCGACGTTC TCGACGTGAC CGAGAGCCGC TTCCCGGCGG CCGAGCGCTG GACCAGCGAC
GTCAGCCACC CGGCGATGAG CGTCAATCTC GACGCCTGCA TCCAGTGCAA TCTGTGTGTG
CGCGCCTGCC GCGAGGTTCA GGTCAACGAC GTGATCGGCA TGGCCTACCG CGCCGCGGGC
TCCAAGGTCG TGTTCGACTT CGACGATCCG ATGGGTGGCT CCACCTGCGT CGCCTGCGGT
GAGTGCGTCC AGGCCTGCCC GACCGGCGCG CTGATGCCGG CCGCCTATCT CGACGCCAAC
CAGACCCGGA CGGTCTATCC CGACCGCGAG GTGAAGTCGC TCTGCCCCTA TTGCGGCGTC
GGCTGCCAAG TCTCCTACAA GGTCAAGGAC GAGCGCATCG TCTACGCCGA GGGCGTGAAC
GGACCGGCCA ACCAGAACCG GCTCTGCGTG AAGGGCCGCT TCGGCTTCGA CTACGTCCAC
CACCCCCACC GCCTGACGGT GCCGCTGATC CGCCTGGAGA ACGTGCCCAA GGACGCCAAC
GATCAGGTCG ATCCGGCGAA CCCCTGGACG CATTTCCGCG AGGCGACCTG GGACGAGGCG
CTCGACCGCG CGGCGGGCGG CCTGAAGGCG ATCCGTGACA CCAACGGGCG CAAGGCGCTG
GCGGGCTTCG GCTCGGCCAA GGGTTCGAAC GAGGAGGCCT ACCTCTTCCA GAAGCTCGTC
CGCCTCGGCT TCGGCACCAA CAACGTCGAT CACTGCACGC GCCTGTGCCA CGCCTCGTCG
GTCGCTGCGC TGATGGAGGG CTTGAATTCC GGCGCCGTCA CCGCCCCCTT CTCGGCAGCG
CTCGACGCCG AGGTCATCGT CGTCATCGGC GCCAACCCGA CCGTGAACCA TCCGGTCGCG
GCGACCTTCC TCAAGAACGC GGTGAAGCAG CGCGGCGCCA AGCTGATCAT CATGGACCCG
CGGCGCCAGA CGCTCTCGCG CCACGCCTAT CGGCACCTCG CCTTCCGCCC CGGCTCGGAC
GTGGCGATGC TCAACGCGAT GCTCAACGTG ATCGTCACGG AGGGCCTCTA CGACGAGCAG
TACATCGCCG GCTACACCGA GAACTTCGAG GCTCTGCGCG AGAAGATCGT CGACTTCACG
CCGGAGAAGA TGGCTTCGGT CTGCGGCATC GATGCCGAGA CCCTGCGTGA GGTCGCCCGG
CTCTATGCCC GGGCCAAGTC GTCGCTCATC TTCTGGGGCA TGGGCGTCAG CCAGCACGTC
CACGGCACCG ACAACTCGCG CTGCCTGATC GCGCTCGCCC TCATCACCGG CCAGATCGGC
CGGCCCGGCA CCGGCCTGCA CCCGTTGCGC GGCCAGAACA ACGTCCAGGG CGCGTCCGAT
GCCGGCCTGA TCCCGATGGT CTACCCGGAC TATCAGTCGG TCGAGAAGGA TGCGGTGCGC
GAGCTGTTCG AGGAGTTCTG GGGTCAGTCC CTCGATCCGC AGAAGGGCCT CACCGTGGTC
GAGATCATGC GCGCGATCCA CGCGGGCGAG ATCCGGGGCA TGTTCGTCGA GGGCGAGAAC
CCGGCGATGT CCGACCCCGA CCTCAACCAC GCCCGCCACG CGCTGGCGAT GCTCGACCAT
CTCGTGGTGC AGGACCTGTT CCTGACGGAG ACGGCCTTCC ACGCCGACGT GGTGCTGCCG
GCCTCGGCCT TTGCCGAGAA GGCCGGGACC TTCACCAACA CCGACCGGCG TGTGCAGATC
GCCCAGCCCG TCGTCGCCCC TCCGGGCGAT GCGCGCCAGG ATTGGTGGAT CATCCAGGAA
CTGGCCCGAC GCCTCGACCT CGACTGGAAC TACGGCGGCC CGGCCGACAT CTTCGCCGAG
ATGGCGCAGG TGATGCCGTC CTTGAACAAC ATCACCTGGG AGCGGCTGGA GCGCGAGGGG
GCGGTGACCT ATCCGGTCGA TGCTCCGGAC CAGCCCGGCA ACGAGATCAT CTTCTATGCC
GGCTTCCCGA CCGAGAGCGG GCGCGCCAAG ATCGTGCCCG CGGCGATCGT GCCGCCGGAC
GAGGTGCCGG ACGACGAGTT CCCGATGGTG CTCTCGACCG GCCGCGTGCT CGAACACTGG
CACACGGGCT CGATGACCCG GCGCGCGGGC GTGCTCGACG CGCTGGAGCC GGAGGCGGTG
GCCTTCATGG CACCCAAGGA GCTCTACCGG CTCGGTCTCC GGCCCGGCGG GTCGATGCGG
TTGGAAACAC GGCGCGGCGC CGTCGTGTTG AAGGTGCGCT CCGACCGGGA CGTGCCGATC
GGCATGATCT TCATGCCCTT CTGCTACGCG GAAGCCGCCG CCAACCTTCT GACCAACCCC
GCCCTCGACC CCCTTGGAAA GATTCCCGAG TTCAAATTCT GCGCAGCCCG CGTCGTCCCC
GCGGAGGCTG CGCCGATGGC CGCCGAGTAA
 
Protein sequence
MSNGPEPHGN KIEQPEIRAD ERQDAGGPAN GAPSTSGGAY SQGAKSGGQA APDPSGTYGI 
KDAPVAPATI AFEFDGQQVE AQPGETIWAV AKRLGTHIPH LCHKPDPGYR PDGNCRACMV
EIEGERVLAA SCKRTPAIGM KVKSATERAT KARAMVLELL VADQPERATS HDPSSHFWVQ
ADVLDVTESR FPAAERWTSD VSHPAMSVNL DACIQCNLCV RACREVQVND VIGMAYRAAG
SKVVFDFDDP MGGSTCVACG ECVQACPTGA LMPAAYLDAN QTRTVYPDRE VKSLCPYCGV
GCQVSYKVKD ERIVYAEGVN GPANQNRLCV KGRFGFDYVH HPHRLTVPLI RLENVPKDAN
DQVDPANPWT HFREATWDEA LDRAAGGLKA IRDTNGRKAL AGFGSAKGSN EEAYLFQKLV
RLGFGTNNVD HCTRLCHASS VAALMEGLNS GAVTAPFSAA LDAEVIVVIG ANPTVNHPVA
ATFLKNAVKQ RGAKLIIMDP RRQTLSRHAY RHLAFRPGSD VAMLNAMLNV IVTEGLYDEQ
YIAGYTENFE ALREKIVDFT PEKMASVCGI DAETLREVAR LYARAKSSLI FWGMGVSQHV
HGTDNSRCLI ALALITGQIG RPGTGLHPLR GQNNVQGASD AGLIPMVYPD YQSVEKDAVR
ELFEEFWGQS LDPQKGLTVV EIMRAIHAGE IRGMFVEGEN PAMSDPDLNH ARHALAMLDH
LVVQDLFLTE TAFHADVVLP ASAFAEKAGT FTNTDRRVQI AQPVVAPPGD ARQDWWIIQE
LARRLDLDWN YGGPADIFAE MAQVMPSLNN ITWERLEREG AVTYPVDAPD QPGNEIIFYA
GFPTESGRAK IVPAAIVPPD EVPDDEFPMV LSTGRVLEHW HTGSMTRRAG VLDALEPEAV
AFMAPKELYR LGLRPGGSMR LETRRGAVVL KVRSDRDVPI GMIFMPFCYA EAAANLLTNP
ALDPLGKIPE FKFCAARVVP AEAAPMAAE