Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1954 |
Symbol | |
ID | 3832305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 2032098 |
End bp | 2033087 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637829885 |
Product | D-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding |
Protein accession | YP_430795 |
Protein GI | 83590786 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1052] Lactate dehydrogenase and related dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 61 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0249135 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAAAT GGAACGTCTA TGTTACTCGT CTGGTCCCAC AACCGGCCCT GGATCTCCTG GCCGAGTACT GCGACCTGGA GATCAACCCT GAAGACCGGG TCCTGACCAG GGCTGAATTG CTGGAAAAGG TCCGGGGTCG CGACGGCATC CTCTGTCTCC TGACGGACAT CCTGGACGAC GAGGTCTTTA CCGCAGCTAA AGGGGCCAAG ATCTTCGCCA ACTTAGCCGT CGGCTTTAAT AACGTCGACC TGGAAGCAGC CACCCGGCAC GGGATCATGA TCACCAATAC CCCGGGCGTC CTCACCGAAG CCACCGCCGA CATGGCCTGG GCCCTGCTCT TTGCTGTGGC ACGGCGGGTG GTGGAAGGCG ACAAGTTTAC CCGGGCCGGT AAATACAAGG GCTGGGGCCC CCTGTTGATG CTCGGCCAGG AAATTACCGG TAAAACCCTG GGCGTCATCG GCGCCGGCCG TATCGGCACC GCCTTTGCCC GCAAAGCCAG GGGCTTTGAT ATGAAGGTCC TCTACCACGA TGTCCAGCCA AGCAAGGCTT TCGAAGAAGC CACCGGCGGT CAATTCGTCG ACAAGGAGAC CCTCCTCAAG GAAGCTGATT TTGTTTCCCT GCACGTTCCC TTAATGCCTT CGACCACCCA CCTCATCAGT ACTCCGGAAC TAAAACTGAT GAAGAAAACA GCCATCCTCA TTAACACCTC CCGTGGCCCG GTCGTTGATG AAAAGGCCCT GGTCAAAGCC CTCCGAGAGA AGGAAATCTG GGGCGCCGGC CTGGACGTCT TCGAAAACGA ACCGGAACTG GCCCCGGGCC TGGCTGACCT GGAGAATGTT GTTCTCTGCC CCCACATCGC CAGCGCTACC TGGGAAACCC GGACCAATAT GGCCTTAATG GCCGCCAACA ACCTGCTGGC CGCCCTGCGG GGTGAACTAC CGCCCCAGTG CCTGAACCCC GAAGTTTACT ACCGGCAACA CGGTAAATAG
|
Protein sequence | MSKWNVYVTR LVPQPALDLL AEYCDLEINP EDRVLTRAEL LEKVRGRDGI LCLLTDILDD EVFTAAKGAK IFANLAVGFN NVDLEAATRH GIMITNTPGV LTEATADMAW ALLFAVARRV VEGDKFTRAG KYKGWGPLLM LGQEITGKTL GVIGAGRIGT AFARKARGFD MKVLYHDVQP SKAFEEATGG QFVDKETLLK EADFVSLHVP LMPSTTHLIS TPELKLMKKT AILINTSRGP VVDEKALVKA LREKEIWGAG LDVFENEPEL APGLADLENV VLCPHIASAT WETRTNMALM AANNLLAALR GELPPQCLNP EVYYRQHGK
|
| |