Gene Mext_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1643 
Symbol 
ID5832584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1833190 
End bp1834152 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content67% 
IMG OID641367441 
Productmalate dehydrogenase, NAD-dependent 
Protein accessionYP_001639113 
Protein GI163851070 
COG category[C] Energy production and conversion 
COG ID[COG0039] Malate/lactate dehydrogenases 
TIGRFAM ID[TIGR01763] malate dehydrogenase, NAD-dependent 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.160131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.329919 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGCA GCAAGATCGC GCTCATCGGC GCCGGACAGA TCGGCGGCAC GCTGGCCCAT 
CTCGCCGGCC TCAAGGAACT CGGTGACGTG GTGCTGTTCG ACATCGTCGA CGGCGTGCCG
CAGGGCAAGG CGCTCGACAT TGCCGAGTCC GCTCCCGTCG ACGGCTTCGA CGCCAAGTAT
TCGGGCGCCA GCGACTATTC CGCGATCGCC GGGGCCGACG TGGTGATCGT GACGGCGGGC
GTGCCGCGCA AGCCCGGCAT GAGCCGTGAC GACCTCATCG GCATCAACCT GAAGGTGATG
GAGGCGGTCG GCGCCGGCAT CAAGGAGCAC GCGCCCGACG CCTTCGTGAT CTGCATCACC
AACCCGCTCG ACGCGATGGT GTGGGCGCTG CAGAAGTTCT CGGGGCTGCC CACCAACAAG
GTCGTCGGCA TGGCCGGCGT GCTCGACTCC GCCCGCTTCC GCCACTTCCT GGCCGAAGAG
TTCGGCGTCT CGGTCGAGGA CGTGACCGCC TTCGTGCTCG GCGGCCACGG CGACGACATG
GTGCCGCTGA CCCGCTACTC GACGGTGGCC GGCGTGCCGC TGACCGATCT GGTCAAGCTC
GGCTGGACCA CCCAGGAGAA GCTCGACGCC ATGGTCGAGC GCACCCGCAA GGGCGGCGGC
GAGATCGTCA ACCTCCTGAA GACCGGCTCG GCCTTCTACG CGCCCGCCGC CTCCGCCATC
GCCATGGCCG AGAGCTACCT GCGCGACAAG AAGCGGGTTC TGCCCTGCGC CGCCTACCTC
GACGGCCAGT ACGGCATCGA CGGCCTCTAT GTCGGCGTGC CGGTGGTGAT CGGCGAGAAC
GGCGTCGAGC GTGTGCTCGA GGTGACCTTC AACGACGACG AGAAGGCGAT GTTCGAGAAG
TCGGTCAATT CGGTGAAGGG CTTGATCGAG GCCTGCAAGA GCGTCAACGA CAAGCTCGCG
TAA
 
Protein sequence
MARSKIALIG AGQIGGTLAH LAGLKELGDV VLFDIVDGVP QGKALDIAES APVDGFDAKY 
SGASDYSAIA GADVVIVTAG VPRKPGMSRD DLIGINLKVM EAVGAGIKEH APDAFVICIT
NPLDAMVWAL QKFSGLPTNK VVGMAGVLDS ARFRHFLAEE FGVSVEDVTA FVLGGHGDDM
VPLTRYSTVA GVPLTDLVKL GWTTQEKLDA MVERTRKGGG EIVNLLKTGS AFYAPAASAI
AMAESYLRDK KRVLPCAAYL DGQYGIDGLY VGVPVVIGEN GVERVLEVTF NDDEKAMFEK
SVNSVKGLIE ACKSVNDKLA