Gene Moth_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1954 
Symbol 
ID3832305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2032098 
End bp2033087 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content59% 
IMG OID637829885 
ProductD-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding 
Protein accessionYP_430795 
Protein GI83590786 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1052] Lactate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones61 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0249135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAAAT GGAACGTCTA TGTTACTCGT CTGGTCCCAC AACCGGCCCT GGATCTCCTG 
GCCGAGTACT GCGACCTGGA GATCAACCCT GAAGACCGGG TCCTGACCAG GGCTGAATTG
CTGGAAAAGG TCCGGGGTCG CGACGGCATC CTCTGTCTCC TGACGGACAT CCTGGACGAC
GAGGTCTTTA CCGCAGCTAA AGGGGCCAAG ATCTTCGCCA ACTTAGCCGT CGGCTTTAAT
AACGTCGACC TGGAAGCAGC CACCCGGCAC GGGATCATGA TCACCAATAC CCCGGGCGTC
CTCACCGAAG CCACCGCCGA CATGGCCTGG GCCCTGCTCT TTGCTGTGGC ACGGCGGGTG
GTGGAAGGCG ACAAGTTTAC CCGGGCCGGT AAATACAAGG GCTGGGGCCC CCTGTTGATG
CTCGGCCAGG AAATTACCGG TAAAACCCTG GGCGTCATCG GCGCCGGCCG TATCGGCACC
GCCTTTGCCC GCAAAGCCAG GGGCTTTGAT ATGAAGGTCC TCTACCACGA TGTCCAGCCA
AGCAAGGCTT TCGAAGAAGC CACCGGCGGT CAATTCGTCG ACAAGGAGAC CCTCCTCAAG
GAAGCTGATT TTGTTTCCCT GCACGTTCCC TTAATGCCTT CGACCACCCA CCTCATCAGT
ACTCCGGAAC TAAAACTGAT GAAGAAAACA GCCATCCTCA TTAACACCTC CCGTGGCCCG
GTCGTTGATG AAAAGGCCCT GGTCAAAGCC CTCCGAGAGA AGGAAATCTG GGGCGCCGGC
CTGGACGTCT TCGAAAACGA ACCGGAACTG GCCCCGGGCC TGGCTGACCT GGAGAATGTT
GTTCTCTGCC CCCACATCGC CAGCGCTACC TGGGAAACCC GGACCAATAT GGCCTTAATG
GCCGCCAACA ACCTGCTGGC CGCCCTGCGG GGTGAACTAC CGCCCCAGTG CCTGAACCCC
GAAGTTTACT ACCGGCAACA CGGTAAATAG
 
Protein sequence
MSKWNVYVTR LVPQPALDLL AEYCDLEINP EDRVLTRAEL LEKVRGRDGI LCLLTDILDD 
EVFTAAKGAK IFANLAVGFN NVDLEAATRH GIMITNTPGV LTEATADMAW ALLFAVARRV
VEGDKFTRAG KYKGWGPLLM LGQEITGKTL GVIGAGRIGT AFARKARGFD MKVLYHDVQP
SKAFEEATGG QFVDKETLLK EADFVSLHVP LMPSTTHLIS TPELKLMKKT AILINTSRGP
VVDEKALVKA LREKEIWGAG LDVFENEPEL APGLADLENV VLCPHIASAT WETRTNMALM
AANNLLAALR GELPPQCLNP EVYYRQHGK