Gene Mext_4406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4406 
Symbol 
ID5834743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4902926 
End bp4905778 
Gene Length2853 bp 
Protein Length950 aa 
Translation table11 
GC content67% 
IMG OID641370199 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_001641845 
Protein GI163853802 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCA TCAAGGAAAT CGACTACGGC ACGCCGATCC GGCTCAATGA GCAGACGGTG 
ACGCTGACCA TCGACGGCGA GAGCGTGACG GTGCCCGCGG GCACCTCGGT CATGGCCGCC
GCGATGCATA TGGGCACCAA GATCCCGAAG CTCTGCGCCA CGGATTCGCT GGAGCCGTTC
GGCTCCTGCC GGATGTGCCT CGTCGAGATC GACGGCCGCC GCGGCACGCC CGCCTCCTGC
ACCACCCCGG CCGAGAATGG CATGGTCGTG CACACGCAGA CCGACAAGCT GCATCGGCTG
CGCAAGGGCG TGATGGAGCT CTACATCTCC GACCACCCGC TCGACTGCCT GACCTGCGCC
GCCAACGGCG ATTGCGAGCT GCAAGATACT GCAGGTCAGG TCGGCCTGCG CGAGGTCCGC
TACGGCTATG ACGGCGACAA CCACGTCAAG CCGGCCTCCG ACCGCTACCT GCCCAAGGAC
GAGTCGAACC CCTACTTCAC CTACGACCCG TCGAAGTGCA TCGTCTGCAA CCGCTGCGTC
CGCGCCTGCG AGGAGACGCA GGGCACCTTC GCGCTGACCA TCGAGGGCCG CGGCTTCGAC
AGTCGCGTGG CGGCGGGCCC GACCAACTTC ATGCAGTCCG AATGCGTGTC CTGCGGCGCC
TGCGTCCAGG CCTGCCCGAC CGCGACGCTG CAGGAGAAGA CGATCCACCA ATACGGCCAG
CCGGACCATT CCGAGGTCAC GACCTGCGCC TATTGCGGCG TCGGCTGCGC CTTCAAGGCC
GAGATGCAGG GCGACAAGGT CGTCCGCATG GTGCCCTACA AGGGCGGCAA GGCGAACGAG
GGACATAGCT GCGTCAAGGG CCGCTTCGCC TACGGCTACG CCACCCACAA GGACCGCATC
ACCAAGCCGA TGATCCGCGC CAAGATCACG GATCCGTGGC GCGAGGTGTC GTGGGAGGAG
GCGATCAATC ACGCCGCCTC CGAGTTCAAG CGCATCCAGG CGACCTACGG CCGCGACTCG
GTCGGCGGCA TCACCTCGTC GCGCTGCACC AACGAGGAAG CCTACCTCGT CCAGAAGCTG
GTGCGCGCCG CCTTCGGCAA CAACAACGTC GATACCTGCG CCCGCGTCTG CCACTCGCCG
ACCGGCTACG GCCTGATGTC CACGCTCGGC ACCTCCGCCG GCACGCAGGA CTTCAAGTCG
GTCGAGGAAT CCGACGTGAT CCTCGTCATC GGCGCCAACC CGACCGACGG CCACCCCGTC
TTCGGCTCGC GGATGAAGAA GCGGCTGCGT GAGGGCGCCC GCCTCATCGT CGCCGACCCG
CGCAAGATCG ACCTCGTGAA GTCGCCCCAC ATCCGGGCCG AGCATCACCT GCCGCTCAAG
CCCGGCTCCA ACGTCGCCTT CATCAACGCC TTCGCCCACG TCATCGTCAC GGAAGGGCTG
ATCGCCGAGG ACTACGTCCG CGAGCGCTGC GATCTGGCCG AGTTCGAGTC CTGGGCCCGC
TTCATTGCCG AGGAGCGCAA CTCGCCGGAA GCCGCGCAGG CCATCACCGG CGTCGATCCA
CAGGAGATCC GCGCCGCGGC CCGGCTCTAC GCCACCGGCG GCAAGGCGGC GATCTACTAC
GGGCTCGGCG TGACCGAGCA CAGCCAGGGC TCGACCATGG TGATGGGCAT GGCCAACATC
GCCATGGCCA CCGGCAATAT CGGCATGGTG GGCGCCGGCG TGAACCCGCT GCGCGGCCAG
AACAACGTGC AGGGCTCCTG CGACATGGGC TCGTTCCCGC ACGAGCTGCC GGGCTACCGC
CACGTCTCGG ACGACGCCAC CCGCGAGAGC TTCGAGGCGA TCTGGGGGGC CAAGCTCGAC
AACGCGCCGG GCCTGCGCAT CACCAACATG CTGGACGAGG CCGTCGGCGG CAGCTTCAAG
GGCATGTACA TCCAGGGCGA GGACATCGCG CAGTCCGACC CCGACACCCA CCACGTCACC
TCCGGCCTCA AGGCCATGGA GTGCATCGTG ATCCAGGACC TGTTCCTGAA CGAGACCGCC
AAATACGCCC ACGTCTTCCT GCCGGGCGCT TCCTTCCTCG AGAAGGACGG CACCTTCACC
AATGCCGAGC GCCGCATCTC CCGCGTGCGC AAGGTGATGG CCCCGATGGG CGGCTACGGC
GATTGGGAGG GCACGGTGCT GCTCGCCAAC GCCCTGGGCT ACAAGATGGA GTACACCCAC
CCGTCCGAGA TCATGGACGA GATCGCGGCG CTCACCCCGA GCTTCGCCGG CGTCTCCTAT
GACAAGCTGG AGGAGCTGGG CTCGATCCAG TGGCCGTGCA ACGAGAAGGC GCCGCTCGGC
ACGCCGATGA TGCACGTCGA CCGGTTCGTG CGCGGCAAGG GCCGCTTCAT GATCACGGAA
TACGTGCCCA CCGACGAGCG GACCACGGGC AAGTTCCCGC TGATCCTCAC CACGGGCCGC
ATCCTCTCGC AGTACAATGT CGGCGCGCAG ACGCGGCGCA CCGAGAACTC CCGCTGGCAC
GAGGAAGACG TGCTGGAGAT CCACCCCTTC GACGCGGAGA TGCGCGGCAT CGTCGATGGC
GACCTCGTCG CCCTGGAGAG CCGCTCGGGC GACATCGCGC TGAAGGCCAA GGTGACCGAG
CGGATGCAGC CGGGGATCGT CTATACGACG TTCCACCACG CCAAGACCGG CGCCAACGTC
ATCACCACGG ACTATTCGGA CTGGGCGACC AACTGCCCCG AATACAAGGT GACGGCGGTG
CAGGTGCGGC GCACGAACCG GCCCTCGGAC TGGCAGGCGA AGTTCTACGA GGAGGACTTC
TCGCTCACCC GCATCGCCGA AGCGGCGGAA TAG
 
Protein sequence
MTLIKEIDYG TPIRLNEQTV TLTIDGESVT VPAGTSVMAA AMHMGTKIPK LCATDSLEPF 
GSCRMCLVEI DGRRGTPASC TTPAENGMVV HTQTDKLHRL RKGVMELYIS DHPLDCLTCA
ANGDCELQDT AGQVGLREVR YGYDGDNHVK PASDRYLPKD ESNPYFTYDP SKCIVCNRCV
RACEETQGTF ALTIEGRGFD SRVAAGPTNF MQSECVSCGA CVQACPTATL QEKTIHQYGQ
PDHSEVTTCA YCGVGCAFKA EMQGDKVVRM VPYKGGKANE GHSCVKGRFA YGYATHKDRI
TKPMIRAKIT DPWREVSWEE AINHAASEFK RIQATYGRDS VGGITSSRCT NEEAYLVQKL
VRAAFGNNNV DTCARVCHSP TGYGLMSTLG TSAGTQDFKS VEESDVILVI GANPTDGHPV
FGSRMKKRLR EGARLIVADP RKIDLVKSPH IRAEHHLPLK PGSNVAFINA FAHVIVTEGL
IAEDYVRERC DLAEFESWAR FIAEERNSPE AAQAITGVDP QEIRAAARLY ATGGKAAIYY
GLGVTEHSQG STMVMGMANI AMATGNIGMV GAGVNPLRGQ NNVQGSCDMG SFPHELPGYR
HVSDDATRES FEAIWGAKLD NAPGLRITNM LDEAVGGSFK GMYIQGEDIA QSDPDTHHVT
SGLKAMECIV IQDLFLNETA KYAHVFLPGA SFLEKDGTFT NAERRISRVR KVMAPMGGYG
DWEGTVLLAN ALGYKMEYTH PSEIMDEIAA LTPSFAGVSY DKLEELGSIQ WPCNEKAPLG
TPMMHVDRFV RGKGRFMITE YVPTDERTTG KFPLILTTGR ILSQYNVGAQ TRRTENSRWH
EEDVLEIHPF DAEMRGIVDG DLVALESRSG DIALKAKVTE RMQPGIVYTT FHHAKTGANV
ITTDYSDWAT NCPEYKVTAV QVRRTNRPSD WQAKFYEEDF SLTRIAEAAE